- 06 Oct, 2025 1 commit
-
-
Mick authored
-
- 29 Sep, 2025 2 commits
-
-
Lianmin Zheng authored
Co-authored-by:sglang-bot <sglangbot@gmail.com>
-
Mick authored
-
- 11 Sep, 2025 1 commit
-
-
Yi Zhang authored
Co-authored-by:
cao1zhg <114661107+cao1zhg@users.noreply.github.com> Co-authored-by:
ispobock <ispobaoke@gmail.com> Co-authored-by:
Binyao Jiang <byjiang1996@gmail.com> Co-authored-by:
hebiao064 <hebiaobuaa@gmail.com> Co-authored-by:
Lifu Huang <lifu.hlf@gmail.com> Co-authored-by:
qingquansong <ustcsqq@gmail.com> Co-authored-by:
Yaoyao Ding <dingyaoyao.cs@gmail.com> Co-authored-by:
Ke Bao <ISPObaoke@163.com> Co-authored-by:
Minglei Zhu <mingleizhu1122@gmail.com>
-
- 05 Aug, 2025 1 commit
-
-
kk authored
Co-authored-by:
wunhuang <wunhuang@amd.com> Co-authored-by:
Hubert Lu <Hubert.Lu@amd.com>
-
- 01 Aug, 2025 1 commit
-
-
Even Zhou authored
Co-authored-by:ronnie_zheng <zl19940307@163.com>
-
- 03 Jul, 2025 1 commit
-
-
Chunyuan WU authored
[CPU] support the case where num_attention_heads or intermediate_size is not divisible by the TP size (#6771)
-
- 01 Jul, 2025 2 commits
-
-
Simon_CQK authored
-
lukec authored
Co-authored-by:
shuaills <shishuaiuoe@gmail.com> Co-authored-by:
Shenggui Li <somerlee.9@gmail.com> Co-authored-by:
Yingyi Huang <yingyihuang2000@outlook.com> Co-authored-by:
yizhang2077 <1109276519@qq.com>
-
- 24 Jun, 2025 1 commit
-
-
xianzhiT authored
-
- 23 Jun, 2025 1 commit
-
-
Yuhong Guo authored
-
- 17 Jun, 2025 1 commit
-
-
Charles Chen authored
-
- 09 Apr, 2025 1 commit
-
-
HandH1998 authored
Co-authored-by:
laixinn <xielx@shanghaitech.edu.cn> Co-authored-by:
sleepcoo <sleepcoo@gmail.com> Co-authored-by:
zhyncs <me@zhyncs.com>
-
- 08 Apr, 2025 1 commit
-
-
DangKai authored
Co-authored-by:dangkai.dk <dangkai.dk@alibaba-inc.com>
-
- 28 Mar, 2025 1 commit
-
-
Brayden Zhong authored
-
- 27 Mar, 2025 1 commit
-
-
Juwan Yoo authored
-
- 14 Mar, 2025 1 commit
-
-
wangyu authored
Signed-off-by:wangyu <wangyu.steph@bytedance.com>
-
- 11 Mar, 2025 1 commit
-
-
Mick authored
-
- 03 Mar, 2025 1 commit
-
-
Lianmin Zheng authored
Support penalty in overlap mode; return logprob with chunked prefill; improve benchmark scripts (#3988) Co-authored-by:
SangBin Cho <rkooo567@gmail.com> Co-authored-by:
dhou-xai <dhou@x.ai> Co-authored-by:
Hanming Lu <hanming_lu@berkeley.edu>
-
- 21 Feb, 2025 1 commit
-
-
Zhiyu authored
-
- 27 Jan, 2025 1 commit
-
-
Lianmin Zheng authored
-
- 22 Jan, 2025 1 commit
-
-
lukec authored
-
- 18 Jan, 2025 1 commit
-
-
bjmsong authored
Co-authored-by:bjmsong <bjmsong@126.com>
-
- 17 Jan, 2025 1 commit
-
-
Yineng Zhang authored
Co-authored-by:Zhangyi <1109276519@qq.com>
-
- 02 Dec, 2024 1 commit
-
-
Yineng Zhang authored
Co-authored-by:HandH1998 <1335248067@qq.com>
-