- 22 Apr, 2026 1 commit
-
-
wangmin6 authored
-
- 17 Apr, 2026 1 commit
-
-
王敏 authored
-
- 01 Apr, 2026 1 commit
-
-
王敏 authored
-
- 26 Mar, 2026 1 commit
-
-
wanghl6 authored
-
- 21 Mar, 2026 3 commits
- 18 Mar, 2026 1 commit
-
-
yangql authored
-
- 17 Mar, 2026 1 commit
-
-
王敏 authored
-
- 11 Mar, 2026 1 commit
-
-
laibao authored
-
- 09 Mar, 2026 1 commit
-
-
yangql authored
-
- 03 Mar, 2026 1 commit
-
-
zhuwenwen authored
-
- 05 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 04 Feb, 2026 1 commit
-
-
zhuwenwen authored
-
- 29 Jan, 2026 1 commit
-
-
zhuwenwen authored
not supported FlashMLASchedMeta
-
- 23 Jan, 2026 3 commits
-
-
Markus / Mark authored
Signed-off-by:
marksverdhei <marksverdhei@hotmail.com> Signed-off-by:
Markus / Mark <46672778+marksverdhei@users.noreply.github.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
Claude Opus 4.5 <noreply@anthropic.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 22 Jan, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 21 Jan, 2026 2 commits
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 17 Jan, 2026 1 commit
-
-
Guofang.Tang authored
Signed-off-by:
Guofang Tang <tinggofun@gmail.com> Co-authored-by:
Guofang Tang <tinggofun@gmail.com>
-
- 16 Jan, 2026 1 commit
-
-
zhuwenwen authored
fix _forward_encoder_attention remove medusa set VLLM_PCIE_USE_CUSTOM_ALLREDUCE=1
-
- 13 Jan, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Mickaël Seznec authored
Signed-off-by:
Mickael Seznec <mickael@mistral.ai> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 12 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 4 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Jack Yang authored
Signed-off-by:
Zhuohao Yang <zy242@cornell.edu> Co-authored-by:
Zhuohao Yang <zy242@cornell.edu> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 06 Jan, 2026 3 commits
-
-
Lucas Wilkinson authored
[Attention][1/n] Remove usage of deprecated `seq_lens_cpu` and `num_computed_tokens_cpu` CommonAttentionMetadata properties (#31773) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
zhuwenwen authored
-
zhuwenwen authored
update weights_not_loaded and flash_mla_with_kvcache update paged_mqa_logits
-
- 02 Jan, 2026 1 commit
-
-
Kevin McKay authored
Signed-off-by:c0de128 <kevin.mckay@outlook.com>
-
- 31 Dec, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
-
- 23 Dec, 2025 2 commits
-
-
zhuwenwen authored
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 22 Dec, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-