- 03 Mar, 2026 1 commit
-
-
zhuwenwen authored
-
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 28 Jan, 2026 1 commit
-
-
zhuwenwen authored
-
- 27 Jan, 2026 1 commit
-
-
Paco Xu authored
Signed-off-by:Paco Xu <paco.xu@daocloud.io>
-
- 24 Jan, 2026 1 commit
-
-
monajafi-amd authored
Signed-off-by:mohammad najafi <mohammad.najafi@amd.com>
-
- 23 Jan, 2026 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Markus / Mark authored
Signed-off-by:
marksverdhei <marksverdhei@hotmail.com> Signed-off-by:
Markus / Mark <46672778+marksverdhei@users.noreply.github.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 22 Jan, 2026 4 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
Alex Sun authored
Signed-off-by:Alex Sun <alex.s@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 21 Jan, 2026 1 commit
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 19 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
[Attention][MLA] Make FLASHINFER_MLA the default MLA backend on Blackwell, and TRTLLM the default prefill (#32615) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 17 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 16 Jan, 2026 3 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
区分pcie和hglink custom allreduce的使用 vllm:export VLLM_CUSTOM_CACHE=1 dtk:export HIP_KERNEL_EVENT_SYSTENFENCE=1 set VLLM_USE_FUSED_RMS_ROPE=1 add SUPPORT_MOE_MARLIN_W16A16 to use moe marlin on bw support fa kvcache fp8 (todo: add VLLM_USE_QUERY_QUANT to not use q quant) update moe_align_block_size
-
zhuwenwen authored
fix _forward_encoder_attention remove medusa set VLLM_PCIE_USE_CUSTOM_ALLREDUCE=1
-
- 15 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
[Attention][MLA] Make `FLASHINFER_MLA` the default MLA backend on Blackwell, and TRTLLM the default prefill (#32339) Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 14 Jan, 2026 2 commits
-
-
Shanshan Shen authored
Signed-off-by:shen-shanshan <467638484@qq.com>
-
Hongxia Yang authored
Signed-off-by:Hongxia Yang <hongxia.yang@amd.com>
-
- 11 Jan, 2026 1 commit
-
-
Matt authored
Signed-off-by:Matthew Wong <Matthew.Wong2@amd.com>
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 3 commits
-
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
sihao_li authored
Signed-off-by:sihao.li <sihao.li@intel.com>
-
weiyu authored
Signed-off-by:
Wei-Yu Lin <weiyulin@google.com> Signed-off-by:
weiyu <62784299+weiyu0824@users.noreply.github.com>
-
- 06 Jan, 2026 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 05 Jan, 2026 3 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
zzzzwwjj authored
Signed-off-by:
zzzzwwjj <1183291235@qq.com> Signed-off-by:
zzzzwwjj <34335947+zzzzwwjj@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
- 31 Dec, 2025 1 commit
-
-
SameerAsal authored
Signed-off-by:
SameerAsal <SameerAsal@users.noreply.github.com> Co-authored-by:
SameerAsal <SameerAsal@users.noreply.github.com>
-
- 30 Dec, 2025 2 commits
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
Pleaplusone authored
[ROCm][Bugfix] Fix accuracy issue on fmoe when `VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS` enabled (#31523) Signed-off-by:ganyi <ygan@amd.com>
-
- 25 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 24 Dec, 2025 1 commit
-
-
sihao_li authored
Signed-off-by:
sihao.li <sihao.li@intel.com> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 23 Dec, 2025 1 commit
-
-
Yan Ma authored
[XPU] decrease IGC_ForceOCLSIMDWidth for speculative decoding triton-xpu kernel compilation (#30538) Signed-off-by:Yan Ma <yan.ma@intel.com>
-
- 22 Dec, 2025 1 commit
-
-
Kevin McKay authored
Signed-off-by:
c0de128 <kevin.mckay@outlook.com> Co-authored-by:
Claude Sonnet 4.5 <noreply@anthropic.com>
-
- 19 Dec, 2025 2 commits
-
-
zhuwenwen authored
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 18 Dec, 2025 1 commit
-
-
Fanli Lin authored
Signed-off-by:Fanli Lin <fanli.lin@intel.com>
-
- 16 Dec, 2025 1 commit
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-