- 22 Apr, 2026 1 commit
-
-
Ekagra Ranjan authored
Signed-off-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 21 Apr, 2026 8 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
roikoren755 authored
Default to 'align' mamba cache mode for Mamba-based models when speculative decoding is enabled (#40454) Signed-off-by:Roi Koren <roik@nvidia.com>
-
Shanshan Shen authored
[MM][CG] Optimize default `max_frames_per_batch` auto-infer for ViT CUDA graph video inference (#40445) Signed-off-by:shen-shanshan <467638484@qq.com>
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
artem-spector authored
Signed-off-by:
Artem Spector <artems@il.ibm.com> Signed-off-by:
artemspector <artems@il.ibm.com> Co-authored-by:
artemspector <artems@il.ibm.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com>
-
- 20 Apr, 2026 2 commits
-
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Julien Denize authored
Signed-off-by:
Julien Denize <julien.denize@mistral.ai> Signed-off-by:
juliendenize <julien.denize@mistral.ai>
-
- 19 Apr, 2026 1 commit
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 18 Apr, 2026 1 commit
-
-
Rishapveer Singh authored
Signed-off-by:Rishapveer Singh <singhrishapveer@gmail.com>
-
- 17 Apr, 2026 2 commits
-
-
allgather authored
Signed-off-by:
allgather <all2allops@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
- 16 Apr, 2026 4 commits
-
-
Netanel Haber authored
Bugfix: Parakeet: `.conv.pointwise/depthwise_conv1/2.bias weigths` can exist even if `convolution_bias=False` (#40007) Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
grYe99 authored
Signed-off-by:
grYe99 <guorongye99@gmail.com> Co-authored-by:
grYe99 <guorongye99@gmail.com>
-
lalit10 authored
Signed-off-by:Lalit Laxminarayan Bangad <lalitbangad@gmail.com>
-
Abhijit Roy authored
Signed-off-by:
Abhijit <abroy@redhat.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 15 Apr, 2026 7 commits
-
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
khluu <khluu000@gmail.com> Signed-off-by:
Kevin H. Luu <khluu000@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
khluu <khluu000@gmail.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
jiang1.li <jiang1.li@intel.com>
-
Luciano Martins authored
Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com>
-
Collin McCarthy authored
Signed-off-by:Collin McCarthy <cmccarthy@nvidia.com>
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
zhanqiuhu authored
Signed-off-by:Zhanqiu Hu <zhu@redhat.com>
-
danielafrimi authored
Signed-off-by: <> Co-authored-by:root <root@lyris0144.lyris.clusters.nvidia.com>
-
Yan Ma authored
Signed-off-by:Yan Ma <yan.ma@intel.com>
-
- 14 Apr, 2026 7 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
roikoren755 authored
Signed-off-by:Roi Koren <roik@nvidia.com>
-
Hexiang Wang authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
bhargav-patel-29 authored
[Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707) Signed-off-by:
bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Thomas authored
Signed-off-by:
thomasmaindron <thomasmaindron@users.noreply.github.com> Co-authored-by:
thomasmaindron <thomasmaindron@users.noreply.github.com> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com>
-
Shanshan Shen authored
Signed-off-by:
shen-shanshan <467638484@qq.com> Signed-off-by:
Shanshan Shen <87969357+shen-shanshan@users.noreply.github.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
lalit10 authored
Signed-off-by:
Lalit Laxminarayan Bangad <lalitbangad@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 13 Apr, 2026 4 commits
-
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Pedram Razavi authored
Signed-off-by:Pedram Razavi <pedram.razavi@gmail.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Jesus Federico authored
Signed-off-by:
Jesus Federico <jefp@amazon.com> Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Co-authored-by:
Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by:
wang.yuqi <yuqi.wang@daocloud.io>
-
- 12 Apr, 2026 1 commit
-
-
r266-tech authored
Signed-off-by:
r266-tech <r266.tech@gmail.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
- 11 Apr, 2026 2 commits
-
-
ShubyM authored
Signed-off-by:ShubyM <shubymishra20@gmail.com>
-
Lee Yongjun authored
Signed-off-by:leeyongjun <jqueen.astro@gmail.com>
-