- 15 Jul, 2024 3 commits
-
-
DefTruth authored
-
zifeitong authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Simon Mo authored
-
- 14 Jul, 2024 4 commits
-
-
Ethan Xu authored
Co-authored-by:simon-mo <simon.mo@hey.com>
-
Robert Shaw authored
-
Isotr0py authored
Co-authored-by:Roger Wang <ywang@roblox.com>
-
Robert Shaw authored
-
- 13 Jul, 2024 4 commits
-
-
Woosuk Kwon authored
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic.com>
-
youkaichao authored
Signed-off-by:
Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai> Co-authored-by:
Muralidhar Andoorveedu <muralidhar.andoorveedu@centml.ai>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 12 Jul, 2024 11 commits
-
-
Woosuk Kwon authored
-
Michael Goin authored
-
Cody Yu authored
-
Cyrus Leung authored
-
Robert Shaw authored
-
Robert Shaw authored
-
Hongxia Yang authored
-
Michael Goin authored
-
Helena Kloosterman authored
-
youkaichao authored
[distributed][misc] keep consistent with how pytorch finds libcudart.so (#6346)
-
Lily Liu authored
-
- 11 Jul, 2024 8 commits
-
-
Robert Shaw authored
Co-authored-by:Zifei Tong <zifeitong@gmail.com>
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic.com>
-
Mor Zusman authored
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Travis Johnson <tsjohnso@us.ibm.com>
-
pushan authored
Signed-off-by:
yatta zhang <ytzhang01@foxmail.com> Signed-off-by:
zhangyuntao.dev <zhangyuntao.dev@bytedance.com> Co-authored-by:
zhangyuntao.dev <zhangyuntao.dev@bytedance.com>
-
aniaan authored
-
daquexian authored
-
- 10 Jul, 2024 10 commits
-
-
Woosuk Kwon authored
-
sroy745 authored
[Speculative Decoding] Enabling bonus token in speculative decoding for KV cache based models (#5765)
-
sangjune.park authored
Signed-off-by:sangjune.park <sangjune.park@navercorp.com>
-
Benjamin Muskalla authored
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Woosuk Kwon authored
-
Cyrus Leung authored
-
Woosuk Kwon authored
-
youkaichao authored
[core][distributed] add zmq fallback for broadcasting large objects (#6183)
-
Abhinav Goyal authored
-