- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 4 commits
-
-
zhuwenwen authored
-
ℍ𝕠𝕝𝕝𝕠𝕨 𝕄𝕒𝕟 authored
Signed-off-by:Hollow Man <hollowman@opensuse.org>
-
zhuwenwen authored
-
zhuwenwen authored
-
- 06 Jan, 2026 2 commits
- 24 Dec, 2025 2 commits
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 19 Dec, 2025 2 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
zhuwenwen authored
-
- 18 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 17 Dec, 2025 2 commits
-
-
baoqian426 authored
Signed-off-by:
baoqian <1354987947@qq.com> Signed-off-by:
baoqian426 <1354987947@qq.com>
-
zhuwenwen authored
修复CompressedTensorsLinearMethod中的w4a16的冲突问题 feat(moe): add Marlin W16A16 fused MoE behind VLLM_USE_MARLIN_W16A16_MOE replace the fp8_mqa_logits and fp8_paged_mqa_logits interfaces in deepgemm with mqa_logits and paged_mqa_logits from lightop
-
- 12 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 11 Dec, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 08 Dec, 2025 1 commit
-
-
Daniel Cámpora authored
Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 02 Dec, 2025 3 commits
-
-
Julien Denize authored
Signed-off-by:
juliendenize <julien.denize@mistral.ai> (cherry picked from commit 5e5646e2)
-
Julien Denize authored
Signed-off-by:juliendenize <julien.denize@mistral.ai>
-
Julien Denize authored
Signed-off-by:
Julien Denize <julien.denize@mistral.ai> Signed-off-by:
Julien Denize <40604584+juliendenize@users.noreply.github.com> Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
Mickael Seznec <mickael@mistral.ai>
-
- 30 Nov, 2025 1 commit
-
-
王敏 authored
-
- 26 Nov, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 24 Nov, 2025 1 commit
-
-
杰兮 authored
Signed-off-by:
zhyajie <yajizhan@amd.com> Co-authored-by:
zhyajie <yajizhan@amd.com>
-
- 20 Nov, 2025 2 commits
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Pleaplusone authored
[ROCm][BugFix] Fix shared expert loading error when disable `VLLM_ROCM_USE_AITER_FUSION_SHARED_EXPERTS` (#28633) Signed-off-by:ganyi <ygan@amd.com>
-
- 19 Nov, 2025 3 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 15 Nov, 2025 1 commit
-
-
Eldar Kurtić authored
Signed-off-by:Eldar Kurtic <8884008+eldarkurtic@users.noreply.github.com>
-
- 13 Nov, 2025 2 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
zhuwenwen authored
-
- 10 Nov, 2025 1 commit
-
-
vllmellm authored
[RFC][ROCm][AITER] Keep all AITER kernels in `_aiter_ops` class like `_custom_ops` and `_ipex_ops` (#24490) Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 08 Nov, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Kunshang Ji <kunshang.ji@intel.com>
-
- 06 Nov, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 05 Nov, 2025 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <markovilya197@gmail.com> Signed-off-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 31 Oct, 2025 1 commit
-
-
zhuwenwen authored
-
- 21 Oct, 2025 4 commits
-
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Daniel Campora <961215+dcampora@users.noreply.github.com> Signed-off-by:
Lain <siyuanf@nvidia.com> Co-authored-by:
Daniel Campora <961215+dcampora@users.noreply.github.com>
-
Alexander Matveev authored
[Performance] Dual stream execution of "shared_experts" and "selected_experts" inside FusedMoE (#26440) Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-
Daniel Cámpora authored
Signed-off-by:Daniel Campora <961215+dcampora@users.noreply.github.com>
-
Chen Wu authored
Signed-off-by:
wuchen <cntryroa@gmail.com> Signed-off-by:
banjuede <lmklhc@163.com> Signed-off-by:
Chen Wu <cntryroa@gmail.com> Signed-off-by:
Danielle Robinson <dmmaddix@amazon.com> Signed-off-by:
Jee Jee Li <pandaleefree@gmail.com> Signed-off-by:
bk-201 <joy25810@foxmail.com> Co-authored-by:
wuchen <wuchen@zetyun.com> Co-authored-by:
Nathan Van Gheem <vangheem@gmail.com> Co-authored-by:
banjuede <lmklhc@163.com> Co-authored-by:
Danielle Robinson <dmmaddix@amazon.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
bk-201 <joy25810@foxmail.com>
-