- 22 Apr, 2026 19 commits
-
-
storyicon authored
Signed-off-by:
storyicon <storyicon@foxmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Chauncey authored
[Bugfix] [Reasoning] Add reasoning_start_str/reasoning_end_str properties to reasoning parsers (#40566) Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
philip-essential authored
Signed-off-by:Philip Monk <169196560+philip-essential@users.noreply.github.com>
-
Carl Y authored
Signed-off-by:Carl You <4531192+carlyou@users.noreply.github.com>
-
Or Ozeri authored
Signed-off-by:
Or Ozeri <oro@il.ibm.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Bugen Zhao authored
Signed-off-by:Bugen Zhao <i@bugenzhao.com>
-
Ekagra Ranjan authored
Signed-off-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Micah Williamson authored
Signed-off-by:Micah Williamson <micah.williamson@amd.com>
-
Carl Y authored
Signed-off-by:
Carl You <4531192+carlyou@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Rishapveer Singh authored
Signed-off-by:
Rishapveer Singh <singhrishapveer@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Martin Hickey authored
Signed-off-by:Martin Hickey <martin.hickey@ie.ibm.com>
-
Jaseel Muhammad authored
Signed-off-by:
Jaseel Muhammad <jaseel.muhammad@mbzuai.ac.ae> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
rasmith authored
[AMD][CI][BugFix] Override normalize_e4m3fn_to_e4m3fnuz for fnuz machines in test_moe_layer_no_parallel (#40550) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
EdalatiAli authored
Signed-off-by:
EdalatiAli <aliedalati@cohere.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Soila Kavulya authored
Signed-off-by: Soila Kavulya <soila.p.kavulya.intel.com>
-
rasmith authored
[ROCm][P/D][MORI][BugFix] Ensure correct api is used when making requests to prefill / decode nodes (#39835) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Jhao-Ting Chen authored
Signed-off-by:Jhao-Ting Chen <jhaotingc@nvidia.com>
-
Khushali Desai authored
Signed-off-by:khushali9 <khushali.desai9@gmail.com>
-
TJian authored
[ROCm] [Wheel] [Bugfix] [Critical] Remove any packages installed from github from rocm.txt e.g `fastsafetensors` as it is incompatible with `uv pip` (#40461) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 21 Apr, 2026 21 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jakub Zakrzewski authored
[Bugfix][Kernel] nvfp4 cutlass MoE: fix nvfp4 experts quant out-of-bounds read for expert counts not divisible by 4 or 16 (#40351) Signed-off-by:Jakub Zakrzewski <jzakrzewski@nvidia.com>
-
Fergus authored
Signed-off-by:
Fergus <fergus.barratt00@gmail.com> Signed-off-by:
fergus barratt <fergus.barratt00@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Rishi Puri authored
Signed-off-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Signed-off-by:
Rishi Puri <puririshi98@berkeley.edu> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
Zijing Liu authored
[MRv2]fix: model accuracy regression caused by reusing the stale last_sampled_tokens and draft_tokens (#39833) Signed-off-by:Zijing Liu <liuzijing2014@gmail.com>
-
Isotr0py authored
Co-authored-by:Roger Wang <hey@rogerw.io>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Nick Hill <nickhill123@gmail.com>
-
Vadim Gimpelson authored
Signed-off-by:
Vadim Gimpelson <vadim.gimpelson@gmail.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
roikoren755 authored
Default to 'align' mamba cache mode for Mamba-based models when speculative decoding is enabled (#40454) Signed-off-by:Roi Koren <roik@nvidia.com>
-
Shanshan Shen authored
[MM][CG] Optimize default `max_frames_per_batch` auto-infer for ViT CUDA graph video inference (#40445) Signed-off-by:shen-shanshan <467638484@qq.com>
-
xiangdong authored
Signed-off-by:
zengxian <xiangdong.zeng@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
Yusuf Mohammad authored
Signed-off-by:
Yusuf <yusufmohammad@live.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Talor Abramovich authored
Signed-off-by:
talora <talora@nvidia.com> Signed-off-by:
Talor Abramovich <talor19@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
artem-spector authored
Signed-off-by:
Artem Spector <artems@il.ibm.com> Signed-off-by:
artemspector <artems@il.ibm.com> Co-authored-by:
artemspector <artems@il.ibm.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-