- 23 Jan, 2026 4 commits
-
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
Karan Bansal authored
Signed-off-by:Karan Bansal <karanb192@gmail.com>
-
Luka Govedič authored
[torch.compile] Compile `CustomOp.forward_native` for `SiluAndMul` and `QuantFP8` to avoid raw torch ops inside opaque custom ops (#32806) Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 22 Jan, 2026 6 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Eldar Kurtić authored
Signed-off-by:
Eldar Kurtic <8884008+eldarkurtic@users.noreply.github.com> Signed-off-by:
eldarkurtic <8884008+eldarkurtic@users.noreply.github.com>
-
Richard Zou authored
Signed-off-by:Richard Zou <zou3519@gmail.com>
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Alex Sun authored
Signed-off-by:Alex Sun <alex.s@amd.com>
-
- 21 Jan, 2026 6 commits
-
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
whx authored
Signed-off-by:whx-sjtu <2952154980@qq.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 20 Jan, 2026 2 commits
-
-
linhaifeng authored
Signed-off-by:linhaifeng <1371675203@qq.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 19 Jan, 2026 2 commits
-
-
Yanan Cao authored
Signed-off-by:Yanan Cao <gmagogsfm@gmail.com>
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
- 18 Jan, 2026 2 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <rshaw@neuralmagic.com> Co-authored-by:
Robert Shaw <rshaw@neuralmagic.com>
-
- 16 Jan, 2026 3 commits
-
-
Hashem Hashemi authored
Signed-off-by:Hashem Hashemi <hashem.hashemi@amd.com>
-
rasmith authored
Signed-off-by:Randall Smith <ransmith@amd.com>
-
TomerBN-Nvidia authored
Signed-off-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@computelab-frontend-8.nvidia.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Tomer Natan <tbarnatan@ipp1-1429.ipp1a1.colossus.nvidia.com>
-
- 15 Jan, 2026 2 commits
-
-
rasmith authored
Signed-off-by:
Randall Smith <ransmith@amd.com> Co-authored-by:
Randall Smith <ransmith@amd.com>
-
rasmith authored
[CI][AMD][Quantization][BugFix] Fix fp8 max in quant_utils.py and update test_fp8_quant.::test_static_fp8_quant_group_2d to use correct fp8 dtype and adjust atol/rtol (#32201) Signed-off-by:Randall Smith <ransmith@amd.com>
-
- 14 Jan, 2026 1 commit
-
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
- 13 Jan, 2026 2 commits
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com>
-
Rabi Mishra authored
Signed-off-by:rabi <ramishra@redhat.com>
-
- 12 Jan, 2026 1 commit
-
-
danielafrimi authored
Signed-off-by:
dafrimi <dafrimi@nvidia.com> Signed-off-by: <> Co-authored-by:
root <root@gpu-267.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
root <root@gpu-537.slurm-workers-slurm.slurm.svc.cluster.local> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
root <root@pool0-01777.cm.cluster>
-
- 11 Jan, 2026 1 commit
-
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
- 10 Jan, 2026 2 commits
-
-
shyeh25 authored
Signed-off-by:shyeh25 <206795756+shyeh25@users.noreply.github.com>
-
PatrykSaffer authored
Signed-off-by:
Patryk Saffer <patryk.saffer99@gmail.com> Signed-off-by:
PatrykSaffer <patryk.saffer@mistral.ai> Co-authored-by:
Patryk Saffer <patryk.saffer99@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 09 Jan, 2026 4 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Runkai Tao authored
Signed-off-by:
Runkai Tao <rt572@physics.rutgers.edu> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com>
-
vllmellm authored
[Bugfix][ROCm]Fix Qwen3-Next-80B-A3B-Thinking inference and optimize non-standard block size (544) support under rocm_atten (#31380) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 08 Jan, 2026 2 commits
-
-
Lucas Wilkinson authored
[Misc] Fix `Current vLLM config is not set.` warnings, assert to avoid issues in the future (#31747) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Michael Goin authored
-