- 06 Apr, 2026 11 commits
-
-
fxmarty-amd authored
[NVFP4] Support NVFP4 dense models from `modelopt` and `compressed-tensors` on AMD Instinct MI300, MI355X and Hopper through emulation (#35733) Signed-off-by:
Felix Marty <Felix.Marty@amd.com> Signed-off-by:
fxmarty-amd <felmarty@amd.com> Co-authored-by:
Kyle Sayers <kylesayrs@gmail.com>
-
Netanel Haber authored
NemotronH default mamba_ssm_cache_dtype=float32; enable auto-hook for NemotronHNanoVLV2Config (#39032) Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
namgyu-youn authored
Signed-off-by:namgyu-youn <namgyu.dev@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
bhargav-patel-29 authored
Signed-off-by:
bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
liuchenbing2026 authored
Signed-off-by:
liuchenbing <chenliumail@163.com> Signed-off-by:
liucb <liuchengbao_work@163.com> Co-authored-by:
liuchenbing <chenliumail@163.com>
-
Andreas Karatzas authored
[ROCm][Quantization] Add asymmetric INT8 quantization support to TritonInt8ScaledMMLinearKernel (#38501) Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 05 Apr, 2026 5 commits
-
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Greg Pereira authored
Signed-off-by:
greg pereira <grpereir@redhat.com> Signed-off-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Wei Zhao authored
Signed-off-by:
wzhao18 <wzhao18.sz@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Martin Vit authored
Signed-off-by:
Martin Vit <martin@voipmonitor.org> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 04 Apr, 2026 6 commits
-
-
Robert Shaw authored
-
Xiaoshuang Wang authored
Signed-off-by:Icey <1790571317@qq.com>
-
Artem Perevedentsev authored
Signed-off-by:
Artem Perevedentsev <aperevedents@nvidia.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Linkun authored
Signed-off-by:
Linkun Chen <github@lkchen.net> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
lalit10 authored
Signed-off-by:
Lalit Laxminarayan Bangad <lalitbangad@gmail.com> Co-authored-by:
Lalit Laxminarayan Bangad <lalitbangad@meta.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
- 03 Apr, 2026 16 commits
-
-
elenalil-aws authored
Signed-off-by:elenalil-aws <elenalil@amazon.com>
-
yzong-rh authored
Signed-off-by:Yifan Zong <yzong@redhat.com>
-
Vasiliy Kuznetsov authored
Signed-off-by:Vasiliy Kuznetsov <vasiliy@meta.com>
-
Yusuf Mohammad authored
Signed-off-by:
yusuf <yusuf@deeplearningmachine.mynet> Signed-off-by: <> Co-authored-by:
yusuf <yusuf@deeplearningmachine.mynet>
-
Qiming Zhang authored
Signed-off-by:
mayuyuace <qiming1.zhang@intel.com> Signed-off-by:
Qiming Zhang <qiming1.zhang@intel.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Artem Perevedentsev authored
Signed-off-by:
Artem Perevedentsev <aperevedents@nvidia.com> Signed-off-by:
Vadim Gimpelson <156319763+vadiklyutiy@users.noreply.github.com>
-
Mieszko Dziadowiec authored
Signed-off-by:
Mieszko Dziadowiec <mdziadowiec@habana.ai> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Stefano Castagnetta <scastagnetta@nvidia.com> Co-authored-by:
Claude <noreply@anthropic.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
Aaron Hao authored
Signed-off-by:ahao-anyscale <ahao@anyscale.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Bowen Bao authored
Signed-off-by:
Bowen Bao <bowenbao@amd.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Carl Y authored
Signed-off-by:
Carl You <4531192+carlyou@users.noreply.github.com> Signed-off-by:
Carl Y <4531192+carlyou@users.noreply.github.com> Co-authored-by:
Claude Opus 4.6 <noreply@anthropic.com>
-
Yan Ma authored
Signed-off-by:
Yan Ma <yan.ma@intel.com> Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
1096125073 authored
Signed-off-by:
xiayongqiang <xiayq1@chinatelecom.cn> Co-authored-by:
xiayongqiang <xiayq1@chinatelecom.cn>
-
- 02 Apr, 2026 2 commits
-
-
Nicolò Lucchesi authored
-
Luciano Martins authored
feat(models): implement Google Gemma 4 architecture support (MoE, Multimodal, Reasoning, Tool-Use) (#38826) Signed-off-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Signed-off-by:
Luciano Martins <lucianomartins@google.com> Co-authored-by:
Luciano Martins <lucianommartins@users.noreply.github.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-