- 27 Apr, 2026 2 commits
-
-
Dao007forever authored
Signed-off-by:
Dao Le <Dao007forever@gmail.com> Signed-off-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Claude <noreply@anthropic.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com> (cherry picked from commit 7b1bc0a3eb01a6bc2650eda9970049f7825240d7)
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@inferact.ai> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai> Signed-off-by:
qizixi <zixi@inferact.ai> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Yongye Zhu <yongye@inferact.ai> Co-authored-by:
Simon Mo <simon@inferact.ai> Co-authored-by:
Bugen Zhao <i@bugenzhao.com> Co-authored-by:
Giancarlo Delfin <gdelfin@inferact.ai> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roy Wang <yasong.wang@inferact.ai> Co-authored-by:
Woosuk Kwon <woosuk@inferact.ai> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Zhewen Li <jerven.vllm@gmail.com> Co-authored-by:
Zijing Liu <liuzijing2014@gmail.com> Co-authored-by:
khluu <khluu000@gmail.com> Co-authored-by:
qizixi <zixi@inferact.ai> Co-authored-by:
Zhewen Li <zhewenli@inferact.ai>
-
- 26 Apr, 2026 1 commit
-
-
Xinan Miao authored
Signed-off-by:
SouthWest7 <am1ao@qq.com> Signed-off-by:
Xinan Miao <1403572259@qq.com> Co-authored-by:
SouthWest7 <am1ao@qq.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
OpenAI Codex <codex@openai.com> Co-authored-by:
Wang Xingran <72983099+wangxingran222@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
- 25 Apr, 2026 1 commit
-
-
Andreas Karatzas authored
[ROCm][Engine] Fix GPU memory leaks in engine shutdown and test workaround for async KV prefix cache reset (#38503) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 24 Apr, 2026 8 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
JartX authored
Signed-off-by:JartX <sagformas@epdcenter.es>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Signed-off-by:
Luciano Martins <lucianomartins@google.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 23 Apr, 2026 5 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
czhu-cohere authored
Signed-off-by:
root <conway.zhu@cohere.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
shaharmor98 authored
Signed-off-by:
Shahar Mor <smor@nvidia.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Srreyansh Sethi authored
Signed-off-by:
Srreyansh Sethi <srreyansh.sethi@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Julian Huang authored
Signed-off-by:
墨楼 <huangzhilin.hzl@antgroup.com> Co-authored-by:
墨楼 <huangzhilin.hzl@antgroup.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Codex <codex@openai.com>
-
- 22 Apr, 2026 7 commits
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Fynn Schmitt-Ulms authored
Signed-off-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:
Hollow Man <hollowman@opensuse.org> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
philip-essential authored
Signed-off-by:Philip Monk <169196560+philip-essential@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Carl Y authored
Signed-off-by:
Carl You <4531192+carlyou@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
- 21 Apr, 2026 9 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Rishi Puri authored
Signed-off-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
Zijing Liu authored
[MRv2]fix: model accuracy regression caused by reusing the stale last_sampled_tokens and draft_tokens (#39833) Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Shanshan Shen authored
[MM][CG] Optimize default `max_frames_per_batch` auto-infer for ViT CUDA graph video inference (#40445) Signed-off-by:shen-shanshan <467638484@qq.com>
-
Kris Hung authored
Signed-off-by:
Krish Hung <krishung5@gmail.com> Signed-off-by:
krishung5 <krish@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
hangy-amd authored
Signed-off-by:Hang Yang <hangy@amd.com>
-
- 20 Apr, 2026 6 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
larryli2-amd authored
[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3 speculative decoding on ROCm (#39616) Signed-off-by:
larryli2-amd <larryli2@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Markov Ilya <markovilya19@gmail.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Chaojun Zhang authored
Signed-off-by:
chaojun-zhang <chaojun.zhang@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 19 Apr, 2026 1 commit
-
-
Andrew Barnes authored
Signed-off-by:Bortlesboat <bortstheboat@gmail.com>
-