- 11 Mar, 2026 1 commit
-
-
laibao authored
-
- 09 Mar, 2026 1 commit
-
-
yangql authored
-
- 03 Mar, 2026 1 commit
-
-
zhuwenwen authored
-
- 06 Feb, 2026 1 commit
-
-
zhuwenwen authored
set fp8_e4m3 only supported on nmz and support q&kvcache fp8 set VLLM_PCIE_USE_CUSTOM_ALLREDUCE=1
-
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 04 Feb, 2026 2 commits
- 02 Feb, 2026 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> (cherry picked from commit 0a3c71e7)
-
- 30 Jan, 2026 1 commit
-
-
zhuwenwen authored
add prepare_so_files to prepare so
-
- 29 Jan, 2026 1 commit
-
-
zhuwenwen authored
not supported FlashMLASchedMeta
-
- 28 Jan, 2026 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:
NickLucche <nlucches@redhat.com> (cherry picked from commit 1f3a2c29)
-
zhuwenwen authored
-
- 27 Jan, 2026 1 commit
-
-
Strahinja Stamenkovic authored
Signed-off-by:sstamenk <strahinja.stamenkovic@amd.com>
-
- 24 Jan, 2026 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Luka Govedič <luka.govedic@gmail.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Varun Sundar Rabindranath <varunsundar08@gmail.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Luka Govedič <luka.govedic@gmail.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Luka Govedič <lgovedic@redhat.com>
-
- 23 Jan, 2026 6 commits
-
-
Markus / Mark authored
Signed-off-by:
marksverdhei <marksverdhei@hotmail.com> Signed-off-by:
Markus / Mark <46672778+marksverdhei@users.noreply.github.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Harry Huang authored
Signed-off-by:
huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
tianshu-Michael-yu authored
Signed-off-by:Tianshu Yu <tianshuyu.formal@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 22 Jan, 2026 3 commits
-
-
Eldar Kurtić authored
Signed-off-by:
Eldar Kurtic <8884008+eldarkurtic@users.noreply.github.com> Signed-off-by:
eldarkurtic <8884008+eldarkurtic@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 21 Jan, 2026 6 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Robert Shaw authored
-
zhuwenwen authored
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 19 Jan, 2026 2 commits
-
-
Tomas Ruiz authored
Signed-off-by:Tomas Ruiz <tomas.ruiz.te@gmail.com>
-
Vadim Gimpelson authored
[BUGFIX] Fix degenerate strides in TRTLLM query tensors for FlashInfer backend. Fixes issue #32353 (#32417) Signed-off-by:Vadim Gimpelson <vadim.gimpelson@gmail.com>
-
- 18 Jan, 2026 1 commit
-
-
Li Xie authored
Signed-off-by:xieli <xieli@stepfun.com>
-
- 17 Jan, 2026 1 commit
-
-
Guofang.Tang authored
Signed-off-by:
Guofang Tang <tinggofun@gmail.com> Co-authored-by:
Guofang Tang <tinggofun@gmail.com>
-
- 16 Jan, 2026 3 commits
-
-
zhuwenwen authored
区分pcie和hglink custom allreduce的使用 vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 set VLLM_USE_FUSED_RMS_ROPE=1 add SUPPORT_MOE_MARLIN_W16A16 to use moe marlin on bw support fa kvcache fp8 (todo: add VLLM_USE_QUERY_QUANT to not use q quant) update moe_align_block_size
-
zhuwenwen authored
fix _forward_encoder_attention remove medusa set VLLM_PCIE_USE_CUSTOM_ALLREDUCE=1
-
vllmellm authored
[Bugfix][ROCm][performance] Resolve the performance regression issue of the Qwen3-Next-80B-A3B-Thinking under rocm_atten (#32336) Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> (cherry picked from commit e27078ea)
-
- 15 Jan, 2026 2 commits
-
-
Matthias Gehre authored
Signed-off-by:Matthias Gehre <matthias.gehre@amd.com>
-
Pleaplusone authored
[ROCm][Perf] Enable shuffle kv cache layout and assembly paged attention kernel for `AiterFlashAttentionBackend` (#29887) Signed-off-by:ganyi <ygan@amd.com>
-
- 14 Jan, 2026 1 commit
-
-
vllmellm authored
[Bugfix][ROCm][performance] Resolve the performance regression issue of the Qwen3-Next-80B-A3B-Thinking under rocm_atten (#32336) Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 13 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-