- 06 Nov, 2025 27 commits
-
-
Ke Bao authored
-
Xiaoyu Zhang authored
[Refactor] Refactor fused_moe_triton tuning tools: extract shared utils, add EP/MLLM support, reduce overhead (#12440) Co-authored-by:
xu-yfei <xu-yfei@users.noreply.github.com> Co-authored-by:
Yongfei Xu <xuyongfei.xyf@antgroup.com>
-
Keyang Ru authored
-
Amit Prakash authored
-
jiapingW authored
Co-authored-by:canghua <canghua.wjp@alibaba-inc.com>
-
yinghui authored
Co-authored-by:Scott Lee <scottjlee@users.noreply.github.com>
-
Yuan Luo authored
Co-authored-by:
luoyuan.luo <luoyuan.luo@antgroup.com> Co-authored-by:
羽癫 <yudian.zy@antgroup.com>
-
Binyao Jiang authored
[GDN] Fuse b.sigmoid(), fused_gdn_gating and unsqueeze into one kernel: up to 0.85% e2e speedup (#12508)
-
Yi Zhang authored
-
Yi Zhang authored
-
Keyang Ru authored
-
Mick authored
-
Chang Su authored
-
Baizhou Zhang authored
-
b8zhong authored
-
Baizhou Zhang authored
-
Atream authored
-
YAMY authored
-
alisonshao authored
-
Baizhou Zhang authored
-
gongwei-130 authored
-
Keyang Ru authored
-
Keyang Ru authored
-
Zaili Wang authored
-
Baizhou Zhang authored
-
Keyang Ru authored
-
Keyang Ru authored
-
- 05 Nov, 2025 13 commits
-
-
Keyang Ru authored
Co-authored-by:Chang Su <chang.s.su@oracle.com>
-
Keyang Ru authored
-
Chang Su authored
-
Chang Su authored
-
Kangyan-Zhou authored
-
Lianmin Zheng authored
-
Keyang Ru authored
-
Chang Su authored
-
Kaixi Hou authored
-
Shu Wang authored
-
Atream authored
Co-authored-by:
Chen Hongtao <56470055+chenht2022@users.noreply.github.com> Co-authored-by:
chenht2022 <cht22@mails.tsinghua.edu.cn>
-
wyx authored
-
Morpheus Guo authored
Co-authored-by:yuechguo <yuechguo@amd.com>
-