- 25 Apr, 2026 1 commit
-
-
Andreas Karatzas authored
[ROCm][Engine] Fix GPU memory leaks in engine shutdown and test workaround for async KV prefix cache reset (#38503) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 24 Apr, 2026 4 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Signed-off-by:
Luciano Martins <lucianomartins@google.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 22 Apr, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 20 Apr, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Markov Ilya <markovilya19@gmail.com>
-
- 17 Apr, 2026 1 commit
-
-
Jing Wang authored
Signed-off-by:
Jing Wang <jingwang96@qq.com> Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com>
-
- 16 Apr, 2026 1 commit
-
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
- 14 Apr, 2026 2 commits
-
-
Francesco Fusco authored
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
- 10 Apr, 2026 4 commits
-
-
zhrrr authored
Signed-off-by:zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
-
Peter Nguyen authored
Signed-off-by:Peter Nguyen <petern0408@gmail.com>
-
Elvir Crnčević authored
Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Co-authored-by:
Claude Sonnet 4 <noreply@anthropic.com>
-
jackwang2120 authored
Signed-off-by:
jackcfwang <jackcfwang@tencent.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 09 Apr, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Ajay Anubolu authored
Signed-off-by:
AjAnubolu <anuboluajay@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 08 Apr, 2026 2 commits
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
JartX authored
Signed-off-by:
JartX <sagformas@epdcenter.es> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 05 Apr, 2026 1 commit
-
-
Greg Pereira authored
Signed-off-by:
greg pereira <grpereir@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 03 Apr, 2026 3 commits
-
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
shunting314 authored
Signed-off-by:shunting314 <shunting@meta.com>
-
- 01 Apr, 2026 2 commits
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 31 Mar, 2026 3 commits
-
-
Vedant V Jhaveri authored
Signed-off-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Matthew Bonanni authored
Signed-off-by:
SandishKumarHN <sandishkumarhn@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
SandishKumarHN <sandishkumarhn@gmail.com>
-
- 30 Mar, 2026 4 commits
-
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Collin McCarthy authored
Signed-off-by:
Collin McCarthy <cmccarthy@nvidia.com> Signed-off-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Nicolò Lucchesi authored
[Mamba][Bugfix] Raise on insufficient cache blocks instead of silently capping cudagraph sizes (#38270) Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 29 Mar, 2026 2 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Wentao Ye authored
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 26 Mar, 2026 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 24 Mar, 2026 3 commits
-
-
Ming Yang authored
-
Sungjae Lee authored
Signed-off-by:
Sungjae Lee <33976427+llsj14@users.noreply.github.com> Signed-off-by:
Sungjae Lee <sung-jae.lee@navercorp.com> Signed-off-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Mar, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
zhuhaoran <zhuhaoran.zhr@alibaba-inc.com> Co-authored-by:
zhrrr <43847754+izhuhaoran@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-