- 08 Apr, 2026 4 commits
-
-
Andrey Talman authored
-
Md. Mekayel Anik authored
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@inferact.ai>
-
zofia authored
Signed-off-by:
Zhu, Zufang <zufang.zhu@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 07 Apr, 2026 20 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Yubo Wang authored
Signed-off-by:Yubo Wang <yubowang2019@gmail.com>
-
Flora Feng authored
Signed-off-by:sfeng33 <4florafeng@gmail.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <jinzhen.ljz@antgroup.com>
-
ibifrost authored
Signed-off-by:
wuchenxin <wuchenxin.wcx@alibaba-inc.com> Signed-off-by:
ibifrost <47308427+ibifrost@users.noreply.github.com> Co-authored-by:
Simon Mo <simon.mo@hey.com>
-
kkyyxhll authored
[Bugfix][Quantization] Fix PerTensorScale loading with tuple shard_id in MergedColumnParallelLinear (#38517) Signed-off-by:loukang <loukang@xiaohongshu.com>
-
maobaolong authored
Signed-off-by:
baoloongmao <baoloongmao@tencent.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Julien Denize <40604584+juliendenize@users.noreply.github.com>
-
Ronen Schaffer authored
Signed-off-by:Ronen Schaffer <ronen.schaffer@ibm.com>
-
Wei Zhao authored
Signed-off-by:wzhao18 <wzhao18.sz@gmail.com>
-
Rohan Potdar authored
Signed-off-by:Rohan138 <rohanpotdar138@gmail.com>
-
Jiangyun Zhu authored
Signed-off-by:
zjy0516 <riverclouds.zhu@qq.com> Signed-off-by:
Jiangyun Zhu <riverclouds.zhu@qq.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Rishapveer Singh authored
Signed-off-by:Rishapveer Singh <215205492+rishaps@users.noreply.github.com>
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Andrew Barnes authored
Signed-off-by:Bortlesboat <bortstheboat@gmail.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 06 Apr, 2026 16 commits
-
-
fxmarty-amd authored
[NVFP4] Support NVFP4 dense models from `modelopt` and `compressed-tensors` on AMD Instinct MI300, MI355X and Hopper through emulation (#35733) Signed-off-by:
Felix Marty <Felix.Marty@amd.com> Signed-off-by:
fxmarty-amd <felmarty@amd.com> Co-authored-by:
Kyle Sayers <kylesayrs@gmail.com>
-
Matthew Bonanni authored
-
Woosuk Kwon authored
Signed-off-by:
WoosukKwon <woosuk.kwon@berkeley.edu> Signed-off-by:
Woosuk Kwon <woosuk@inferact.ai>
-
Netanel Haber authored
NemotronH default mamba_ssm_cache_dtype=float32; enable auto-hook for NemotronHNanoVLV2Config (#39032) Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
namgyu-youn authored
Signed-off-by:namgyu-youn <namgyu.dev@gmail.com>
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
zhanqiuhu authored
-
bnellnm authored
Signed-off-by:
Bill Nell <bnell@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Frederik Gossen authored
Signed-off-by:Frederik Gossen <frgossen@meta.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Lukas Geiger authored
[Models][GDN] Remove GPU/CPU syncs in `GDNAttentionMetadata.build` during speculative decoding (#38047) Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Walter Beller-Morales authored
Signed-off-by:walterbm <walter.beller.morales@gmail.com>
-
Julien Denize authored
Signed-off-by:juliendenize <julien.denize@mistral.ai>
-
bhargav-patel-29 authored
Signed-off-by:
bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-