- 07 May, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:
Lei Wen <wenlei03@qiyi.com> Co-authored-by:
Cade Daniel <edacih@gmail.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
-
- 04 May, 2024 1 commit
-
-
Cody Yu authored
-
- 03 May, 2024 2 commits
-
-
Cade Daniel authored
-
SangBin Cho authored
-
- 02 May, 2024 1 commit
-
-
SangBin Cho authored
[Bug fix][Core] assert num_new_tokens == 1 fails when SamplingParams.n is not 1 and max_tokens is large & Add tests for preemption (#4451)
-
- 01 May, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:Lei Wen <wenlei03@qiyi.com>
-
- 30 Apr, 2024 1 commit
-
-
leiwen83 authored
Co-authored-by:Lei Wen <wenlei03@qiyi.com>
-
- 23 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 16 Apr, 2024 2 commits
-
-
Cade Daniel authored
-
Antoni Baum authored
-
- 09 Apr, 2024 1 commit
-
-
Cade Daniel authored
[Misc] [Core] Implement RFC "Augment BaseExecutor interfaces to enable hardware-agnostic speculative decoding" (#3837)
-
- 05 Apr, 2024 1 commit
-
-
Cade Daniel authored
-
- 03 Apr, 2024 1 commit
-
-
Cade Daniel authored
Co-authored-by:Lily Liu <lilyliupku@gmail.com>
-
- 25 Mar, 2024 2 commits
-
-
xwjiang2010 authored
-
SangBin Cho authored
-
- 22 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 20 Mar, 2024 1 commit
-
-
SangBin Cho authored
-
- 11 Mar, 2024 1 commit
-
-
Zhuohan Li authored
-
- 09 Mar, 2024 1 commit
-
-
Cade Daniel authored
-