- 07 May, 2024 1 commit
-
-
youkaichao authored
-
- 04 May, 2024 1 commit
-
-
Cody Yu authored
-
- 02 May, 2024 3 commits
-
-
SangBin Cho authored
-
SangBin Cho authored
Co-authored-by:Cade Daniel <edacih@gmail.com>
-
SangBin Cho authored
[Bug fix][Core] assert num_new_tokens == 1 fails when SamplingParams.n is not 1 and max_tokens is large & Add tests for preemption (#4451)
-
- 01 May, 2024 2 commits
-
-
leiwen83 authored
Co-authored-by:
Lei Wen <wenlei03@qiyi.com> Co-authored-by:
Sage Moore <sagemoore@utexas.edu>
-
Pastel! authored
-
- 28 Apr, 2024 1 commit
-
-
Ronen Schaffer authored
Co-authored-by:
Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com> Co-authored-by:
Robert Shaw <rshaw@neuralmagic.com>
-
- 27 Apr, 2024 1 commit
-
-
Caio Mendes authored
-
- 26 Apr, 2024 2 commits
-
-
SangBin Cho authored
-
SangBin Cho authored
Co-authored-by:Danny Guinther <dguinther@neuralmagic.com>
-
- 23 Apr, 2024 2 commits
-
-
SangBin Cho authored
-
SangBin Cho authored
-
- 22 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 16 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 15 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 12 Apr, 2024 3 commits
-
-
SangBin Cho authored
-
Zhuohan Li authored
-
Michael Feil authored
Co-authored-by:Roger Wang <136131678+ywang96@users.noreply.github.com>
-
- 11 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 07 Apr, 2024 1 commit
-
-
youkaichao authored
-
- 05 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 03 Apr, 2024 1 commit
-
-
SangBin Cho authored
-
- 02 Apr, 2024 1 commit
-
-
Michael Goin authored
-
- 01 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 28 Mar, 2024 3 commits
-
-
Simon Mo authored
-
SangBin Cho authored
-
Cade Daniel authored
-
- 25 Mar, 2024 3 commits
-
-
xwjiang2010 authored
-
SangBin Cho authored
-
TianYu GUO authored
-
- 22 Mar, 2024 1 commit
-
-
Thomas Parnell authored
Co-authored-by:Jan van Lunteren <jvl@zurich.ibm.com>
-
- 21 Mar, 2024 1 commit
-
-
ElizaWszola authored
Co-authored-by:
rsnm2 <rshaw@neuralmagic.com> Co-authored-by:
Luka <luka@paperspace>
-
- 20 Mar, 2024 2 commits
-
-
SangBin Cho authored
-
ElizaWszola authored
[PREFIX CACHING FOLLOW UP] A bunch of fixes to block allocator performance when automatic prefix caching is disabled (#3357) Co-authored-by:Zhuohan Li <zhuohan123@gmail.com>
-
- 15 Mar, 2024 1 commit
-
-
Tao He authored
Signed-off-by:
Tao He <sighingnow@gmail.com> Co-authored-by:
simon-mo <simon.mo@hey.com>
-
- 13 Mar, 2024 1 commit
-
-
Breno Faria authored
-
- 11 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 08 Mar, 2024 1 commit
-
-
ElizaWszola authored
-
- 05 Mar, 2024 1 commit
-
-
Nick Hill authored
-