- 27 Apr, 2026 1 commit
-
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@inferact.ai> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
qizixi <zixi@inferact.ai> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Yongye Zhu <yongye@inferact.ai> Co-authored-by:
Simon Mo <simon@inferact.ai> Co-authored-by:
Bugen Zhao <i@bugenzhao.com> Co-authored-by:
Giancarlo Delfin <gdelfin@inferact.ai> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roy Wang <yasong.wang@inferact.ai> Co-authored-by:
Woosuk Kwon <woosuk@inferact.ai> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Zhewen Li <jerven.vllm@gmail.com> Co-authored-by:
Zijing Liu <liuzijing2014@gmail.com> Co-authored-by:
khluu <khluu000@gmail.com> Co-authored-by:
qizixi <zixi@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai>
-
- 25 Apr, 2026 1 commit
-
-
Andreas Karatzas authored
[ROCm][Engine] Fix GPU memory leaks in engine shutdown and test workaround for async KV prefix cache reset (#38503) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 24 Apr, 2026 4 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Signed-off-by:
Luciano Martins <lucianomartins@google.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 22 Apr, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 20 Apr, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Markov Ilya <markovilya19@gmail.com>
-
- 17 Apr, 2026 1 commit
-
-
Jing Wang authored
Signed-off-by:
Jing Wang <jingwang96@qq.com> Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com>
-
- 16 Apr, 2026 1 commit
-
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
- 14 Apr, 2026 2 commits
-
-
Francesco Fusco authored
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
- 10 Apr, 2026 4 commits
-
-
zhrrr authored
Signed-off-by:zhuhaoran <zhuhaoran.zhr@alibaba-inc.com>
-
Peter Nguyen authored
Signed-off-by:Peter Nguyen <petern0408@gmail.com>
-
Elvir Crnčević authored
Signed-off-by:
Elvir Crncevic <elvircrn@gmail.com> Co-authored-by:
Claude Sonnet 4 <noreply@anthropic.com>
-
jackwang2120 authored
Signed-off-by:
jackcfwang <jackcfwang@tencent.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 09 Apr, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Ajay Anubolu authored
Signed-off-by:
AjAnubolu <anuboluajay@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 08 Apr, 2026 2 commits
-
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
JartX authored
Signed-off-by:
JartX <sagformas@epdcenter.es> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 05 Apr, 2026 1 commit
-
-
Greg Pereira authored
Signed-off-by:
greg pereira <grpereir@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 03 Apr, 2026 3 commits
-
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
shunting314 authored
Signed-off-by:shunting314 <shunting@meta.com>
-
- 01 Apr, 2026 2 commits
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 31 Mar, 2026 3 commits
-
-
Vedant V Jhaveri authored
Signed-off-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Vedant Jhaveri <vjhaveri@linkedin.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Matthew Bonanni authored
Signed-off-by:
SandishKumarHN <sandishkumarhn@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
SandishKumarHN <sandishkumarhn@gmail.com>
-
- 30 Mar, 2026 4 commits
-
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Collin McCarthy authored
Signed-off-by:
Collin McCarthy <cmccarthy@nvidia.com> Signed-off-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com> Co-authored-by:
Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Nicolò Lucchesi authored
[Mamba][Bugfix] Raise on insufficient cache blocks instead of silently capping cudagraph sizes (#38270) Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 29 Mar, 2026 2 commits
-
-
Kyle Sayers authored
Signed-off-by:Kyle Sayers <kylesayrs@gmail.com>
-
Wentao Ye authored
[Perf] Remove redundant device copies for CPU-only pooling token IDs, 48.9% E2E throughput improvement (#38139) Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 26 Mar, 2026 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
- 24 Mar, 2026 3 commits
-
-
Ming Yang authored
-
Sungjae Lee authored
Signed-off-by:
Sungjae Lee <33976427+llsj14@users.noreply.github.com> Signed-off-by:
Sungjae Lee <sung-jae.lee@navercorp.com> Signed-off-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-