- 22 Apr, 2026 1 commit
-
-
wangmin6 authored
-
- 18 Apr, 2026 1 commit
-
-
wanglong3 authored
-
- 16 Mar, 2026 1 commit
-
-
王敏 authored
-
- 12 Mar, 2026 1 commit
-
-
wujl5 authored
-
- 06 Mar, 2026 1 commit
-
-
王敏 authored
-
- 02 Mar, 2026 1 commit
-
-
zhuwenwen authored
-
- 16 Feb, 2026 1 commit
-
-
Rayyyyy authored
-
- 06 Feb, 2026 2 commits
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 04 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 03 Feb, 2026 3 commits
-
-
Richard Zou authored
[torch.compile] Don't do the fast moe cold start optimization if there is speculative decoding (#33624) Signed-off-by:
Richard Zou <zou3519@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> (cherry picked from commit 5eac9a1b)
-
Richard Zou authored
Signed-off-by:
Richard Zou <zou3519@gmail.com> (cherry picked from commit d9aa39a3)
-
zhuwenwen authored
-
- 02 Feb, 2026 2 commits
-
-
csy0225 authored
Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
i-zhangmingming <i-zhangmingming@stepfun.com> Co-authored-by:
xiewuxun <xiewuxun@stepfun.com> Co-authored-by:
zetaohong <i-hongzetao@stepfun.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> (cherry picked from commit c3b40dc3)
-
Luka Govedič authored
[fix][torch.compile] Fix cold-start compilation time increase by adding kv cache update to splitting ops (#33441) Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Richard Zou <zou3519@gmail.com> (cherry picked from commit 15f40b20)
-
- 27 Jan, 2026 1 commit
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
- 26 Jan, 2026 1 commit
-
-
Yuxuan Zhang authored
Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 24 Jan, 2026 2 commits
-
-
Hiroken. authored
Signed-off-by:
Hongjian Zhang <zhanghongjian@xiaohongshu.com> Signed-off-by:
Xingran Wang <wangxingran123456@outlook.com> Co-authored-by:
Xingran Wang <wangxingran123456@outlook.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 23 Jan, 2026 2 commits
-
-
Harry Huang authored
Signed-off-by:
huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 22 Jan, 2026 4 commits
-
-
David Ramon Prados authored
-
Alex Sun authored
Signed-off-by:Alex Sun <alex.s@amd.com>
-
Ifta khairul Alam Adil authored
Signed-off-by:
Ifta Khairul Alam Adil <ikaadil007@gmail.com> Signed-off-by:
Ifta khairul Alam Adil <25082512+ikaadil@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 21 Jan, 2026 3 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
whx authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 20 Jan, 2026 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 19 Jan, 2026 3 commits
-
-
Matthew Bonanni authored
[Attention][MLA] Make FLASHINFER_MLA the default MLA backend on Blackwell, and TRTLLM the default prefill (#32615) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Tomas Ruiz authored
Signed-off-by:Tomas Ruiz <tomas.ruiz.te@gmail.com>
-
Yuxuan Zhang authored
Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
Yuxuan Zhang <2448370773@qq.com>
-
- 17 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 16 Jan, 2026 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> (cherry picked from commit 1be5a735)
-
- 15 Jan, 2026 4 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Matthew Bonanni authored
[Attention][MLA] Make `FLASHINFER_MLA` the default MLA backend on Blackwell, and TRTLLM the default prefill (#32339) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
dtc authored
Signed-off-by:Tianchen Ding <dtcccc@linux.alibaba.com>
-