- 16 Jan, 2026 1 commit
-
-
TomerBN-Nvidia authored
Signed-off-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@ipp1-1429.ipp1a1.colossus.nvidia.com>
-
- 15 Jan, 2026 9 commits
-
-
Yongye Zhu authored
Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix `assert x_s.shape[-1] == x_q.shape[-1] // group_shape[1]` in Blackwell Quantized MoE Test (#32362) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Dipika Sikka authored
Signed-off-by:
Dipika Sikka <dipikasikka1@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Matthew Bonanni authored
[Attention][MLA] Make `FLASHINFER_MLA` the default MLA backend on Blackwell, and TRTLLM the default prefill (#32339) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
brian033 authored
Signed-off-by:
brian033 <85883730+brian033@users.noreply.github.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Eldar Kurtić <8884008+eldarkurtic@users.noreply.github.com>
-
rasmith authored
[CI][AMD][Quantization][BugFix] Fix fp8 max in quant_utils.py and update test_fp8_quant.::test_static_fp8_quant_group_2d to use correct fp8 dtype and adjust atol/rtol (#32201) Signed-off-by:Randall Smith <ransmith@amd.com>
-
- 14 Jan, 2026 5 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
sangho.lee authored
Signed-off-by:
sanghol <sanghol@allenai.org> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Yi Liu authored
Signed-off-by:yiliu30 <yi4.liu@intel.com>
-
- 13 Jan, 2026 9 commits
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Mickaël Seznec authored
Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
YunzhuLu authored
Signed-off-by:
YunzhuLu <lucia.yunzhu@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 12 Jan, 2026 13 commits
-
-
xuebwang-amd authored
Signed-off-by:xuebwang-amd <xuebwang@amd.com>
-
Vadim Gimpelson authored
Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
Lucas Kabela authored
Signed-off-by:Lucas Kabela <lucaskabela@meta.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Kyungmin Lee authored
Signed-off-by:lkm2835 <lkm2835@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
danielafrimi authored
Signed-off-by:
dafrimi <dafrimi@nvidia.com> Signed-off-by: <> Co-authored-by:
root <root@gpu-267.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
root <root@gpu-537.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
root <root@pool0-01777.cm.cluster>
-
Jaehyun An authored
Signed-off-by:Jaehyun An <steve.ai@kakaocorp.com>
-
Kyungmin Lee authored
Signed-off-by:
lkm2835 <lkm2835@gmail.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
lgai-exaone <exaonemodels@lgresearch.ai> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Hongxin Xu authored
Signed-off-by:
xhx1022 <1737006628@qq.com> Signed-off-by:
Hongxin Xu <70438206+xhx1022@users.noreply.github.com> Signed-off-by:
arlenxu <arlenxu@tencent.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
arlenxu <arlenxu@tencent.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 11 Jan, 2026 3 commits
-
-
maang authored
Signed-off-by:
maang <maang_h@163.com> Signed-off-by:
maang-h <55082429+maang-h@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
Andy Liu authored
[MTP][GLM][Bugfix] Fixed .weight_scale loading logic that dropped MTP prediction accuracy with fp8+mtp (#32101) Signed-off-by:Andy Liu <andyliu@roblox.com>
-