- 01 Jul, 2024 5 commits
-
-
zhyncs authored
-
youkaichao authored
-
Robert Shaw authored
-
sroy745 authored
-
youkaichao authored
-
- 30 Jun, 2024 9 commits
-
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
Dipika Sikka authored
-
Robert Shaw authored
Co-authored-by:rshaw@neuralmagic.com <rshaw@neuralmagic>
-
SangBin Cho authored
Co-authored-by:sang <sangcho@anyscale.com>
-
llmpros authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
youkaichao authored
[ci][distributed] fix some cuda init that makes it necessary to use spawn (#5991)
-
Cyrus Leung authored
-
Cyrus Leung authored
-
Roger Wang authored
-
- 29 Jun, 2024 14 commits
-
-
Matt Wong authored
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
Cyrus Leung authored
-
Robert Shaw authored
Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
Robert Shaw <rshaw@neuralmagic>
-
Cody Yu authored
-
Antoni Baum authored
-
Cyrus Leung authored
-
Roger Wang authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Woosuk Kwon authored
-
Joe Runde authored
Signed-off-by:Joe Runde <joe@joerun.de>
-
William Lin authored
Co-authored-by:Antoni Baum <antoni.baum@protonmail.com>
-
mcalman authored
-
Woosuk Kwon authored
-
Woosuk Kwon authored
-
- 28 Jun, 2024 12 commits
-
-
Lily Liu authored
Co-authored-by:LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
Tyler Michael Smith authored
-
Michael Goin authored
-
wangding zeng authored
Co-authored-by:Philipp Moritz <pcmoritz@gmail.com>
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
Tyler Michael Smith authored
-
Cody Yu authored
-
xwjiang2010 authored
Signed-off-by:Xiaowei Jiang <xwjiang2010@gmail.com>
-
Cyrus Leung authored
-
Thomas Parnell authored
[Bugfix] Better error message for MLPSpeculator when `num_speculative_tokens` is set too high (#5894) Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-