- 30 Aug, 2024 1 commit
-
-
Richard Liu authored
-
- 26 Aug, 2024 1 commit
-
-
youkaichao authored
-
- 17 Aug, 2024 2 commits
-
-
youkaichao authored
Co-authored-by:cjackal <44624812+cjackal@users.noreply.github.com>
-
bnellnm authored
-
- 14 Aug, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 13 Aug, 2024 2 commits
-
-
Woosuk Kwon authored
-
Cyrus Leung authored
-
- 07 Aug, 2024 1 commit
-
-
youkaichao authored
-
- 05 Aug, 2024 1 commit
-
-
Rui Qiao authored
Signed-off-by:Rui Qiao <ruisearch42@gmail.com>
-
- 04 Aug, 2024 1 commit
-
-
youkaichao authored
-
- 01 Aug, 2024 1 commit
-
-
Aurick Qiao authored
Co-authored-by:Aurick Qiao <aurick.qiao@snowflake.com>
-
- 31 Jul, 2024 2 commits
-
-
Cody Yu authored
Co-authored-by:youkaichao <youkaichao@126.com>
-
Cyrus Leung authored
-
- 27 Jul, 2024 2 commits
-
-
Woosuk Kwon authored
-
Woosuk Kwon authored
-
- 26 Jul, 2024 1 commit
-
-
Li, Jiang authored
[Hardware] [Intel] Enable Multiprocessing and tensor parallel in CPU backend and update documentation (#6125)
-
- 25 Jul, 2024 2 commits
-
-
Tyler Michael Smith authored
Co-authored-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
youkaichao authored
-
- 21 Jul, 2024 1 commit
-
-
Woosuk Kwon authored
-
- 19 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 18 Jul, 2024 1 commit
-
-
Nick Hill authored
-
- 17 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 16 Jul, 2024 1 commit
-
-
Cyrus Leung authored
-
- 12 Jul, 2024 1 commit
-
-
youkaichao authored
[distributed][misc] keep consistent with how pytorch finds libcudart.so (#6346)
-
- 10 Jul, 2024 1 commit
-
-
youkaichao authored
[core][distributed] add zmq fallback for broadcasting large objects (#6183)
-
- 03 Jul, 2024 2 commits
-
-
youkaichao authored
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
Murali Andoorveedu authored
Signed-off-by:Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
- 01 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 29 Jun, 2024 1 commit
-
-
Cyrus Leung authored
-
- 28 Jun, 2024 2 commits
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
xwjiang2010 authored
Signed-off-by:
Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by:
Roger Wang <ywang@roblox.com>
-
- 27 Jun, 2024 1 commit
-
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
- 26 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 25 Jun, 2024 2 commits
-
-
Matt Wong authored
-
Woo-Yeon Lee authored
[Speculative Decoding] Support draft model on different tensor-parallel size than target model (#5414)
-
- 23 Jun, 2024 1 commit
-
-
Murali Andoorveedu authored
-
- 22 Jun, 2024 1 commit
-
-
youkaichao authored
-
- 21 Jun, 2024 2 commits
-
-
youkaichao authored
Co-authored-by:Cody Yu <hao.yu.cody@gmail.com>
-
youkaichao authored
-