- 23 Apr, 2026 3 commits
-
-
shaharmor98 authored
Signed-off-by:
Shahar Mor <smor@nvidia.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Srreyansh Sethi authored
Signed-off-by:
Srreyansh Sethi <srreyansh.sethi@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Julian Huang authored
Signed-off-by:
墨楼 <huangzhilin.hzl@antgroup.com> Co-authored-by:
墨楼 <huangzhilin.hzl@antgroup.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Codex <codex@openai.com>
-
- 22 Apr, 2026 7 commits
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Fynn Schmitt-Ulms authored
Signed-off-by:
Fynn Schmitt-Ulms <fschmitt@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:
Hollow Man <hollowman@opensuse.org> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
philip-essential authored
Signed-off-by:Philip Monk <169196560+philip-essential@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Carl Y authored
Signed-off-by:
Carl You <4531192+carlyou@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
- 21 Apr, 2026 9 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Rishi Puri authored
Signed-off-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
Zijing Liu authored
[MRv2]fix: model accuracy regression caused by reusing the stale last_sampled_tokens and draft_tokens (#39833) Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Shanshan Shen authored
[MM][CG] Optimize default `max_frames_per_batch` auto-infer for ViT CUDA graph video inference (#40445) Signed-off-by:shen-shanshan <467638484@qq.com>
-
Kris Hung authored
Signed-off-by:
Krish Hung <krishung5@gmail.com> Signed-off-by:
krishung5 <krish@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
hangy-amd authored
Signed-off-by:Hang Yang <hangy@amd.com>
-
- 20 Apr, 2026 6 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
larryli2-amd authored
[ROCm][Feature] Enable AITER MLA attention backend to work with Eagle3 speculative decoding on ROCm (#39616) Signed-off-by:
larryli2-amd <larryli2@amd.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Markov Ilya <markovilya19@gmail.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
Nicolò Lucchesi <nlucches@redhat.com>
-
Chaojun Zhang authored
Signed-off-by:
chaojun-zhang <chaojun.zhang@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 19 Apr, 2026 2 commits
-
-
Andrew Barnes authored
Signed-off-by:Bortlesboat <bortstheboat@gmail.com>
-
omerpaz95 authored
Signed-off-by:omerpaz95 <omerpaz95@gmail.com>
-
- 18 Apr, 2026 1 commit
-
-
Dan Alistarh authored
Signed-off-by:Dan Alistarh <d.alistarh@gmail.com>
-
- 17 Apr, 2026 6 commits
-
-
aditi-amd authored
Signed-off-by:aditi <aditi.rana@amd.com>
-
Xinyu Chen authored
Signed-off-by:
Xinyu Chen <xinyu1.chen@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Jing Wang authored
Signed-off-by:
Jing Wang <jingwang96@qq.com> Co-authored-by:
Copilot <175728472+Copilot@users.noreply.github.com>
-
sychen52 authored
Signed-off-by:Shiyang Chen <shiychen@nvidia.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 16 Apr, 2026 4 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Nikita Shapovalov authored
[Bugfix] Fix Ray compiled-DAG SHM channel stalls by detaching zero-copy `np.ndarray` logprobs buffers (#35736) Signed-off-by:Nikita Shapovalov <nikita@poolside.ai>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
- 15 Apr, 2026 2 commits
-
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Csrayz authored
[Metrics] Add request_id to FinishedRequestStats to enable correlation between metrics and requests (#39710) Enables external `StatLogger` plugins to correlate per-request metrics with request-level context. Also, this is a pre-requisite for Prometheus exemplars in #30972. Signed-off-by:Csrayz <33659823+Csrayz@users.noreply.github.com>
-