- 04 Mar, 2025 16 commits
-
-
Lianmin Zheng authored
-
William authored
-
Liu Jinjie authored
Signed-off-by:Jinjie Liu <jinjie.liu@usc.edu>
-
Xiuyu Li authored
-
Michael Feil authored
-
Chen Shengzhi authored
-
DarkSharpness authored
-
Kebe authored
-
Qubitium-ModelCloud authored
-
HAI authored
-
kk authored
-
Xihuai Wang authored
-
Xihuai Wang authored
Co-authored-by:Lucas Pickup <lupickup@microsoft.com>
-
kk authored
Co-authored-by:wunhuang <wunhuang@amd.com>
-
Yineng Zhang authored
-
Lianmin Zheng authored
-
- 03 Mar, 2025 19 commits
-
-
Ke Bao authored
-
Chayenne authored
-
Qiaolin Yu authored
-
Chayenne authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Yudi Xue authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
-
Stefan He authored
-
yinfan98 authored
-
Chayenne authored
-
Lianmin Zheng authored
-
Baizhou Zhang authored
-
Zhousx authored
Co-authored-by:
Achazwl <323163497@qq.com> Co-authored-by:
Chayenne <zhaochen20@outlook.com>
-
Stefan He authored
-
- 02 Mar, 2025 5 commits
-
-
Hubert Lu authored
-
Ke Bao authored
-
Ke Bao authored
Co-authored-by:yizhang2077 <1109276519@qq.com>
-
Xiaoyu Zhang authored
-
Xiaoyu Zhang authored
-