- 10 Apr, 2026 8 commits
-
-
PatchyTIS authored
Signed-off-by:
PatchouliTaisa <patchychen@tencent.com> Co-authored-by:
PatchouliTaisa <patchychen@tencent.com>
-
Hexiang Wang authored
Signed-off-by:
whx-sjtu <2952154980@qq.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
milesial authored
Signed-off-by:milesial <milesial@users.noreply.github.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Kyungmin Lee authored
Signed-off-by:
lkm2835 <lkm2835@gmail.com> Co-authored-by:
mergify[bot] <37929162+mergify[bot]@users.noreply.github.com>
-
Kunshang Ji authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Kunshang Ji <jikunshang95@gmail.com>
-
Ibrahim Arshad authored
Signed-off-by:Ibrahim Arshad <38925737+ibrahim1023@users.noreply.github.com>
-
Artem Perevedentsev authored
Signed-off-by:Artem Perevedentsev <aperevedents@nvidia.com>
-
- 09 Apr, 2026 11 commits
-
-
Ekagra Ranjan authored
Signed-off-by:Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
PikaPikachu authored
Signed-off-by:kangletian <Letian.Kang@amd.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Andrii Skliar authored
Signed-off-by:
Andrii Skliar <askliar@nvidia.com> Co-authored-by:
Andrii Skliar <askliar@nvidia.com>
-
Yongye Zhu authored
-
Ilya Boytsov authored
Signed-off-by:Ilya Boytsov <ilyaboytsov1805@gmail.com>
-
Wei Zhao authored
Signed-off-by:
wzhao18 <wzhao18.sz@gmail.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
Dipika Sikka authored
Signed-off-by:Dipika Sikka <dipikasikka1@gmail.com>
-
Maral authored
[W8A8 Block Linear Refactor][2/N] Remove W8A8Fp8BlockLinearOp and adopt Fp8 block linear kernel selections. (#33892) Signed-off-by:
maral <maralbahari.98@gmail.com> Signed-off-by:
Maral <maralbahari.98@gmail.com>
-
Benjamin Chislett authored
Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Signed-off-by:
Benjamin Chislett <chislett.ben@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 08 Apr, 2026 7 commits
-
-
Kai Song authored
Signed-off-by:Song Kai <songkai05@baidu.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Jackmin801 authored
Signed-off-by:
Jackmin801 <ongjackm@gmail.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Roberto L. Castro authored
[Perf][Kernel] Persistent TopK scheduler: unified CUDAGraph-safe kernel with dynamic per-row dispatch - DeepSeek-V3.2 DSA decode (#37421) Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
rasmith authored
[CI][AMD][BugFix][Kernel] Cast induction variable to int64 on MI350 for chunk_gated_delta_rule_fwd_kernel_h_blockdim64 to avoid illegal memory access (#39087) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Andrey Talman authored
-
zofia authored
Signed-off-by:
Zhu, Zufang <zufang.zhu@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 07 Apr, 2026 12 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Yubo Wang authored
Signed-off-by:Yubo Wang <yubowang2019@gmail.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Chendi.Xue authored
Signed-off-by:
Chendi Xue <chendi.xue@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <jinzhen.ljz@antgroup.com>
-
kkyyxhll authored
[Bugfix][Quantization] Fix PerTensorScale loading with tuple shard_id in MergedColumnParallelLinear (#38517) Signed-off-by:loukang <loukang@xiaohongshu.com>
-
Wei Zhao authored
Signed-off-by:wzhao18 <wzhao18.sz@gmail.com>
-
Jiangyun Zhu authored
Signed-off-by:
zjy0516 <riverclouds.zhu@qq.com> Signed-off-by:
Jiangyun Zhu <riverclouds.zhu@qq.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Rishapveer Singh authored
Signed-off-by:Rishapveer Singh <215205492+rishaps@users.noreply.github.com>
-
Netanel Haber authored
Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
- 06 Apr, 2026 2 commits
-
-
fxmarty-amd authored
[NVFP4] Support NVFP4 dense models from `modelopt` and `compressed-tensors` on AMD Instinct MI300, MI355X and Hopper through emulation (#35733) Signed-off-by:
Felix Marty <Felix.Marty@amd.com> Signed-off-by:
fxmarty-amd <felmarty@amd.com> Co-authored-by:
Kyle Sayers <kylesayrs@gmail.com>
-
Netanel Haber authored
NemotronH default mamba_ssm_cache_dtype=float32; enable auto-hook for NemotronHNanoVLV2Config (#39032) Signed-off-by:Netanel Haber <58652339+netanel-haber@users.noreply.github.com>
-