- 06 Mar, 2025 11 commits
-
-
yinfan98 authored
-
Lianmin Zheng authored
Co-authored-by:
Sehoon Kim <kssteven418@gmail.com> Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
Sehoon Kim <sehoon@x.ai>
-
Lzhang-hub authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Zhiqiang Xie authored
-
Lianmin Zheng authored
-
Wenxuan Tan authored
-
Yineng Zhang authored
-
Yueyang Pan authored
Co-authored-by:Zhiqiang Xie <xiezhq@stanford.edu>
-
luzengxiangcn authored
Co-authored-by:Zhiqiang Xie <xiezhq@stanford.edu>
-
- 05 Mar, 2025 9 commits
-
-
Jhin authored
Co-authored-by:zhaochenyang20 <zhaochen20@outlook.com>
-
Ke Bao authored
-
Baizhou Zhang authored
-
HAI authored
-
Ying Sheng authored
Co-authored-by:Ke Bao <ISPObaoke@163.com>
-
yigex authored
-
Qubitium-ModelCloud authored
Signed-off-by:
ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by:
ZX-ModelCloud <zx@modelcloud.ai>
-
Mick authored
Co-authored-by:zhaochenyang20 <zhaochen20@outlook.com>
-
Lianmin Zheng authored
-
- 04 Mar, 2025 12 commits
-
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
William authored
-
Xiuyu Li authored
-
Chen Shengzhi authored
-
DarkSharpness authored
-
Qubitium-ModelCloud authored
-
HAI authored
-
kk authored
-
Xihuai Wang authored
-
Xihuai Wang authored
Co-authored-by:Lucas Pickup <lupickup@microsoft.com>
-
kk authored
Co-authored-by:wunhuang <wunhuang@amd.com>
-
- 03 Mar, 2025 8 commits
-
-
Ke Bao authored
-
Qiaolin Yu authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
-
yinfan98 authored
-