- 19 Jan, 2026 3 commits
-
-
Matthew Bonanni authored
[Attention][MLA] Make FLASHINFER_MLA the default MLA backend on Blackwell, and TRTLLM the default prefill (#32615) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
jiahanc authored
Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
danisereb authored
Signed-off-by:Daniel Serebrenik <daserebrenik@nvidia.com>
-
- 18 Jan, 2026 5 commits
-
-
Iryna Boiko authored
Signed-off-by:Iryna Boiko <iboiko@habana.ai>
-
Andrey Khalyavin authored
Signed-off-by:Andrey Khalyavin <halyavin@yandex-team.ru>
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
tjp_zju authored
Signed-off-by:
tom-zju <tanjianpingzju1990@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <rshaw@neuralmagic.com> Co-authored-by:
Robert Shaw <rshaw@neuralmagic.com>
-
- 17 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 16 Jan, 2026 7 commits
-
-
Hashem Hashemi authored
Signed-off-by:Hashem Hashemi <hashem.hashemi@amd.com>
-
Rabi Mishra authored
Signed-off-by:rabi <ramishra@redhat.com>
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
XiongfeiWei authored
Signed-off-by:Xiongfei Wei <isaacwxf23@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
TomerBN-Nvidia authored
Signed-off-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@ipp1-1429.ipp1a1.colossus.nvidia.com>
-
- 15 Jan, 2026 8 commits
-
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix `assert x_s.shape[-1] == x_q.shape[-1] // group_shape[1]` in Blackwell Quantized MoE Test (#32362) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Matthew Bonanni authored
[Attention][MLA] Make `FLASHINFER_MLA` the default MLA backend on Blackwell, and TRTLLM the default prefill (#32339) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
brian033 authored
Signed-off-by:
brian033 <85883730+brian033@users.noreply.github.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Eldar Kurtić <8884008+eldarkurtic@users.noreply.github.com>
-
rasmith authored
[CI][AMD][Quantization][BugFix] Fix fp8 max in quant_utils.py and update test_fp8_quant.::test_static_fp8_quant_group_2d to use correct fp8 dtype and adjust atol/rtol (#32201) Signed-off-by:Randall Smith <ransmith@amd.com>
-
- 14 Jan, 2026 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Yi Liu authored
Signed-off-by:yiliu30 <yi4.liu@intel.com>
-
- 13 Jan, 2026 6 commits
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Mickaël Seznec authored
Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 12 Jan, 2026 6 commits
-
-
xuebwang-amd authored
Signed-off-by:xuebwang-amd <xuebwang@amd.com>
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
danielafrimi authored
Signed-off-by:
dafrimi <dafrimi@nvidia.com> Signed-off-by: <> Co-authored-by:
root <root@gpu-267.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
root <root@gpu-537.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
root <root@pool0-01777.cm.cluster>
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Signed-off-by:
Hongxin Xu <70438206+xhx1022@users.noreply.github.com> Signed-off-by:
arlenxu <arlenxu@tencent.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
- 11 Jan, 2026 1 commit
-
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
- 10 Jan, 2026 1 commit
-
-
Vensen authored
Signed-off-by:vensen <vensenmu@gmail.com>
-