- 03 Apr, 2026 1 commit
-
-
shunting314 authored
Signed-off-by:shunting314 <shunting@meta.com>
-
- 01 Apr, 2026 2 commits
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 31 Mar, 2026 3 commits
-
-
Vedant V Jhaveri authored
Signed-off-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Matthew Bonanni authored
Signed-off-by:
SandishKumarHN <sandishkumarhn@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
SandishKumarHN <sandishkumarhn@gmail.com>
-
- 30 Mar, 2026 4 commits
-
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Collin McCarthy authored
Signed-off-by:
Collin McCarthy <cmccarthy@nvidia.com> Signed-off-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Nicolò Lucchesi authored
[Mamba][Bugfix] Raise on insufficient cache blocks instead of silently capping cudagraph sizes (#38270) Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 29 Mar, 2026 2 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Wentao Ye authored
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 26 Mar, 2026 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 24 Mar, 2026 3 commits
-
-
Ming Yang authored
-
Sungjae Lee authored
Signed-off-by:
Sungjae Lee <33976427+llsj14@users.noreply.github.com> Signed-off-by:
Sungjae Lee <sung-jae.lee@navercorp.com> Signed-off-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Mar, 2026 3 commits
-
-
Matthew Bonanni authored
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Co-authored-by:
zhrrr <43847754+izhuhaoran@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
yanghui1-arch authored
Signed-off-by:dass90 <3053034939@qq.com>
-
Baorun (Lauren) Mu authored
Signed-off-by:Baorun Mu <bmu@nvidia.com>
-
- 20 Mar, 2026 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
- 19 Mar, 2026 2 commits
-
-
Collin McCarthy authored
Signed-off-by:Collin McCarthy <cmccarthy@nvidia.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 18 Mar, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
- 17 Mar, 2026 1 commit
-
-
Benjamin Chislett authored
-
- 16 Mar, 2026 1 commit
-
-
Fynn Schmitt-Ulms authored
-
- 13 Mar, 2026 2 commits
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Ekagra Ranjan authored
Signed-off-by:Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
- 11 Mar, 2026 2 commits
-
-
Hongxin Xu authored
Signed-off-by:
arlenxu <arlenxu@tencent.com> Signed-off-by:
xhx1022 <1737006628@qq.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 10 Mar, 2026 2 commits
-
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Wentao Ye authored
[Perf] Compute maxsim in worker side, reducing redundant copies, 2.7% E2E throughput improvement (#36159) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 09 Mar, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Mar, 2026 2 commits
-
-
PatchyTIS authored
-
Matthew Bonanni authored
-
- 06 Mar, 2026 2 commits
-
-
zhanqiuhu authored
Signed-off-by:
Claude <noreply@anthropic.com> Signed-off-by:
Zhanqiu Hu <zh338@cornell.edu> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 05 Mar, 2026 1 commit
-
-
Jiayi Yan authored
Signed-off-by:
1195343015 <1195343015@qq.com> Signed-off-by:
Jiayi Yan <66017932+1195343015@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-