- 25 Jul, 2024 2 commits
-
-
Tyler Michael Smith authored
Co-authored-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
youkaichao authored
-
- 21 Jul, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 19 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 18 Jul, 2024 1 commit
-
-
Nick Hill authored
-
- 17 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 16 Jul, 2024 1 commit
-
-
Cyrus Leung authored
-
- 12 Jul, 2024 1 commit
-
-
youkaichao authored
[distributed][misc] keep consistent with how pytorch finds libcudart.so (#6346)
-
- 10 Jul, 2024 1 commit
-
-
youkaichao authored
[core][distributed] add zmq fallback for broadcasting large objects (#6183)
-
- 03 Jul, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
- 01 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 29 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 28 Jun, 2024 2 commits
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 27 Jun, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
- 26 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 25 Jun, 2024 2 commits
-
-
Matt Wong authored
-
Woo-Yeon Lee authored
[Speculative Decoding] Support draft model on different tensor-parallel size than target model (#5414)
-
- 23 Jun, 2024 1 commit
-
-
Murali Andoorveedu authored
-
- 22 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 21 Jun, 2024 2 commits
-
-
youkaichao authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
youkaichao authored
-
- 18 Jun, 2024 1 commit
-
-
youkaichao authored
[bugfix][distributed] do not error if two processes do not agree on p2p capability (#5612)
-
- 17 Jun, 2024 1 commit
-
-
Kunshang Ji authored
Co-authored-by:
Jiang Li <jiang1.li@intel.com> Co-authored-by:
Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by:
Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com>
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 14 Jun, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 13 Jun, 2024 2 commits
-
-
Antoni Baum authored
-
youkaichao authored
[Core][Distributed] add coordinator to reduce code duplication in tp and pp (#5293)
-
- 11 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 10 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 09 Jun, 2024 1 commit
-
-
bnellnm authored
-
- 29 May, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 23 May, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
- 16 May, 2024 2 commits
-
-
youkaichao authored
-
Cody Yu authored
Co-authored-by:
Cade Daniel <edacih@gmail.com> Co-authored-by:
Cade Daniel <cade@anyscale.com>
-
- 13 May, 2024 1 commit
-
-
youkaichao authored
-