- 10 Feb, 2026 1 commit
-
-
lixh authored
-
- 08 Feb, 2026 1 commit
-
-
王敏 authored
-
- 06 Feb, 2026 4 commits
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 04 Feb, 2026 2 commits
-
-
zhuwenwen authored
-
Michael Goin authored
Signed-off-by:Robert Shaw <rshaw@neuralmagic.com>
-
- 03 Feb, 2026 1 commit
-
-
Zhewen Li authored
Signed-off-by:
zhewenli <zhewen@inferact.ai> Co-authored-by:
zhewenli <zhewen@inferact.ai>
-
- 02 Feb, 2026 2 commits
-
-
René Honig authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com> (cherry picked from commit 07978117)
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> (cherry picked from commit 31aedfe7)
-
- 28 Jan, 2026 2 commits
- 27 Jan, 2026 1 commit
-
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Signed-off-by:
Amir Klein <203507526+amirkl94@users.noreply.github.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
amirkl94 <203507526+amirkl94@users.noreply.github.com>
-
- 26 Jan, 2026 2 commits
-
-
Pengchao Wang authored
Signed-off-by:Pengchao Wang <wpc@fb.com>
-
danielafrimi authored
Signed-off-by:
dafrimi <dafrimi@nvidia.com> Co-authored-by:
root <root@gpu-51.slurm-workers-slurm.slurm.svc.cluster.local>
-
- 25 Jan, 2026 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:LopezCastroRoberto <rocastro@redhat.com>
-
- 23 Jan, 2026 5 commits
-
-
joninco authored
Signed-off-by:jon <joninco@bullpoint.org>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Luka Govedič authored
[torch.compile] Compile `CustomOp.forward_native` for `SiluAndMul` and `QuantFP8` to avoid raw torch ops inside opaque custom ops (#32806) Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Signed-off-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 22 Jan, 2026 4 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
Xin Yang authored
Signed-off-by:Xin Yang <xyangx@amazon.com>
-
Eldar Kurtić authored
Signed-off-by:
Eldar Kurtic <8884008+eldarkurtic@users.noreply.github.com> Signed-off-by:
eldarkurtic <8884008+eldarkurtic@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 21 Jan, 2026 5 commits
-
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
Robert Shaw authored
-
- 20 Jan, 2026 3 commits
-
-
Vasiliy Kuznetsov authored
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 19 Jan, 2026 1 commit
-
-
jiahanc authored
Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Signed-off-by:
Robert Shaw <robshaw@redhat.com> Co-authored-by:
Robert Shaw <robshaw@redhat.com>
-
- 18 Jan, 2026 2 commits
-
-
bnellnm authored
Signed-off-by:Bill Nell <bnell@redhat.com>
-
tjp_zju authored
Signed-off-by:
tom-zju <tanjianpingzju1990@gmail.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 16 Jan, 2026 2 commits
-
-
zhuwenwen authored
区分pcie和hglink custom allreduce的使用 vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 set VLLM_USE_FUSED_RMS_ROPE=1 add SUPPORT_MOE_MARLIN_W16A16 to use moe marlin on bw support fa kvcache fp8 (todo: add VLLM_USE_QUERY_QUANT to not use q quant) update moe_align_block_size
-
zhuwenwen authored
fix _forward_encoder_attention remove medusa set VLLM_PCIE_USE_CUSTOM_ALLREDUCE=1
-