- 22 Apr, 2026 10 commits
-
-
Ekagra Ranjan authored
Signed-off-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Carl Y authored
Signed-off-by:
Carl You <4531192+carlyou@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Rishapveer Singh authored
Signed-off-by:
Rishapveer Singh <singhrishapveer@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Martin Hickey authored
Signed-off-by:Martin Hickey <martin.hickey@ie.ibm.com>
-
Jaseel Muhammad authored
Signed-off-by:
Jaseel Muhammad <jaseel.muhammad@mbzuai.ac.ae> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
EdalatiAli authored
Signed-off-by:
EdalatiAli <aliedalati@cohere.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Soila Kavulya authored
Signed-off-by: Soila Kavulya <soila.p.kavulya.intel.com>
-
rasmith authored
[ROCm][P/D][MORI][BugFix] Ensure correct api is used when making requests to prefill / decode nodes (#39835) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Khushali Desai authored
Signed-off-by:khushali9 <khushali.desai9@gmail.com>
-
- 21 Apr, 2026 30 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Fergus authored
Signed-off-by:
Fergus <fergus.barratt00@gmail.com> Signed-off-by:
fergus barratt <fergus.barratt00@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Rishi Puri authored
Signed-off-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
Zijing Liu authored
[MRv2]fix: model accuracy regression caused by reusing the stale last_sampled_tokens and draft_tokens (#39833) Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Vadim Gimpelson authored
Signed-off-by:
Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
roikoren755 authored
Default to 'align' mamba cache mode for Mamba-based models when speculative decoding is enabled (#40454) Signed-off-by:Roi Koren <roik@nvidia.com>
-
Shanshan Shen authored
[MM][CG] Optimize default `max_frames_per_batch` auto-infer for ViT CUDA graph video inference (#40445) Signed-off-by:shen-shanshan <467638484@qq.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
Talor Abramovich authored
Signed-off-by:
talora <talora@nvidia.com> Signed-off-by:
Talor Abramovich <talor19@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
artem-spector authored
Signed-off-by:
Artem Spector <artems@il.ibm.com> Signed-off-by:
artemspector <artems@il.ibm.com> Co-authored-by:
artemspector <artems@il.ibm.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Kris Hung authored
Signed-off-by:
Krish Hung <krishung5@gmail.com> Signed-off-by:
krishung5 <krish@nvidia.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
wang.yuqi authored
Revert "[Startup] Parallelize torch/transformers import + weight prefetch + forkserver prewarm" (#40438) Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Zeyu Zhang authored
Signed-off-by:Alchuang22-dev <2584829494@qq.com>
-
hangy-amd authored
Signed-off-by:Hang Yang <hangy@amd.com>
-
milesial authored
-
SeongJun Lee authored
Signed-off-by:lesj0610 <lesj0610@gmail.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon@inferact.ai>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Misa authored
[Bugfix] Fix `_CONFIG_REGISTRY` types getting wrong config class when on-disk model_type differs (#39554) Signed-off-by:
Misa <misaAle@users.noreply.github.com> Signed-off-by:
Misael Casarez <misacasa@amazon.com> Co-authored-by:
Misael Casarez <misacasa@amazon.com>
-
Yanan Cao authored
Signed-off-by:
Yanan Cao <gmagogsfm@gmail.com> Co-authored-by:
Theresa Shan <Theresa.Shan@amd.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-