- 06 Nov, 2025 12 commits
-
-
sglang-bot authored
-
Ke Bao authored
-
jiapingW authored
Co-authored-by:canghua <canghua.wjp@alibaba-inc.com>
-
yinghui authored
Co-authored-by:Scott Lee <scottjlee@users.noreply.github.com>
-
Yuan Luo authored
Co-authored-by:
luoyuan.luo <luoyuan.luo@antgroup.com> Co-authored-by:
羽癫 <yudian.zy@antgroup.com>
-
Binyao Jiang authored
[GDN] Fuse b.sigmoid(), fused_gdn_gating and unsqueeze into one kernel: up to 0.85% e2e speedup (#12508)
-
Yi Zhang authored
-
Yi Zhang authored
-
Mick authored
-
Atream authored
-
YAMY authored
-
gongwei-130 authored
-
- 05 Nov, 2025 21 commits
-
-
Lianmin Zheng authored
-
Kaixi Hou authored
-
Shu Wang authored
-
Atream authored
Co-authored-by:
Chen Hongtao <56470055+chenht2022@users.noreply.github.com> Co-authored-by:
chenht2022 <cht22@mails.tsinghua.edu.cn>
-
Morpheus Guo authored
Co-authored-by:yuechguo <yuechguo@amd.com>
-
Mick authored
Co-authored-by:
yhyang201 <yhyang201@gmail.com> Co-authored-by:
yizhang2077 <1109276519@qq.com> Co-authored-by:
Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by:
ispobock <ispobaoke@gmail.com> Co-authored-by:
JiLi <leege233@gmail.com> Co-authored-by:
CHEN Xi <78632976+RubiaCx@users.noreply.github.com> Co-authored-by:
laixin <xielx@shanghaitech.edu.cn> Co-authored-by:
SolitaryThinker <wlsaidhi@gmail.com> Co-authored-by:
jzhang38 <a1286225768@gmail.com> Co-authored-by:
BrianChen1129 <yongqichcd@gmail.com> Co-authored-by:
Kevin Lin <42618777+kevin314@users.noreply.github.com> Co-authored-by:
Edenzzzz <wtan45@wisc.edu> Co-authored-by:
rlsu9 <r3su@ucsd.edu> Co-authored-by:
Jinzhe Pan <48981407+eigensystem@users.noreply.github.com> Co-authored-by:
foreverpiano <pianoqwz@qq.com> Co-authored-by:
RandNMR73 <notomatthew31@gmail.com> Co-authored-by:
PorridgeSwim <yz3883@columbia.edu> Co-authored-by:
Jiali Chen <90408393+gary-chenjl@users.noreply.github.com>
-
Lianmin Zheng authored
-
bigmoyan authored
Signed-off-by:wangzhengtao <wangzhengtao@msh.team>
-
Yuxuan Zhang authored
-
Yuhong Guo authored
-
yinghui authored
-
zejunchen-zejun authored
Signed-off-by:
zejunchen-zejun <zejun.chen@amd.com> Co-authored-by:
HAI <hixiao@gmail.com>
-
yinghui authored
-
Glen Liu authored
[Feature] add --lora-request-distribution arg to bench_serving.py and support skewed and distinct workloads (#12175)
-
ai-easy-cpu authored
Co-authored-by:
AI-bot-easy <litchys0123@outlook.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Liangsheng Yin authored
-
Kaixi Hou authored
-
sglang-bot authored
-
Nicolas Castet authored
-
soaringk authored
-
Baizhou Zhang authored
-
- 04 Nov, 2025 7 commits
-
-
Kaixi Hou authored
-
Johnsonms authored
-
Lianmin Zheng authored
Co-authored-by:
github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by:
SangBin Cho <rkooo567@gmail.com>
-
Baizhou Zhang authored
-
Trevor Morris authored
-
Liangsheng Yin authored
-
Liangsheng Yin authored
-