- 01 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 29 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 28 Jun, 2024 2 commits
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 27 Jun, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
- 26 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 25 Jun, 2024 2 commits
-
-
Matt Wong authored
-
Woo-Yeon Lee authored
[Speculative Decoding] Support draft model on different tensor-parallel size than target model (#5414)
-
- 23 Jun, 2024 1 commit
-
-
Murali Andoorveedu authored
-
- 22 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 21 Jun, 2024 2 commits
-
-
youkaichao authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
youkaichao authored
-
- 18 Jun, 2024 1 commit
-
-
youkaichao authored
[bugfix][distributed] do not error if two processes do not agree on p2p capability (#5612)
-
- 17 Jun, 2024 1 commit
-
-
Kunshang Ji authored
Co-authored-by:
Jiang Li <jiang1.li@intel.com> Co-authored-by:
Abhilash Majumder <abhilash.majumder@intel.com> Co-authored-by:
Abhilash Majumder <30946547+abhilash1910@users.noreply.github.com>
-
- 15 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 14 Jun, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 13 Jun, 2024 2 commits
-
-
Antoni Baum authored
-
youkaichao authored
[Core][Distributed] add coordinator to reduce code duplication in tp and pp (#5293)
-
- 11 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 10 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 09 Jun, 2024 1 commit
-
-
bnellnm authored
-
- 29 May, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 23 May, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
- 16 May, 2024 2 commits
-
-
youkaichao authored
-
Cody Yu authored
Co-authored-by:
Cade Daniel <edacih@gmail.com> Co-authored-by:
Cade Daniel <cade@anyscale.com>
-
- 13 May, 2024 1 commit
-
-
youkaichao authored
-
- 10 May, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
[Core][Distributed] refactor pynccl to hold multiple communicators (#4591)
-
- 08 May, 2024 1 commit
-
-
youkaichao authored
[Core][Distributed] support both cpu and device tensor in broadcast tensor dict (#4660)
-
- 07 May, 2024 1 commit
-
-
youkaichao authored
-
- 03 May, 2024 1 commit
-
-
youkaichao authored
-
- 02 May, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
- 01 May, 2024 1 commit
-
-
youkaichao authored
-
- 29 Apr, 2024 1 commit
-
-
youkaichao authored
-
- 26 Apr, 2024 1 commit
-
-
SangBin Cho authored
Co-authored-by:Danny Guinther <dguinther@neuralmagic.com>
-
- 24 Apr, 2024 2 commits
-
-
youkaichao authored
[Core][Distributed] use existing torch.cuda.device context manager (#4318)
-
youkaichao authored
-