- 03 Feb, 2026 1 commit
-
-
王敏 authored
-
- 12 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jack Yang authored
Signed-off-by:
Zhuohao Yang <zy242@cornell.edu> Co-authored-by:
Zhuohao Yang <zy242@cornell.edu> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 06 Jan, 2026 1 commit
-
-
Lucas Wilkinson authored
[Attention][1/n] Remove usage of deprecated `seq_lens_cpu` and `num_computed_tokens_cpu` CommonAttentionMetadata properties (#31773) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 23 Dec, 2025 1 commit
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 22 Dec, 2025 3 commits
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 18 Dec, 2025 1 commit
-
-
zhuwenwen authored
-
- 13 Dec, 2025 1 commit
-
-
Roberto L. Castro authored
Signed-off-by:
LopezCastroRoberto <robertol.c510@gmail.com> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 11 Dec, 2025 1 commit
-
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
- 08 Dec, 2025 1 commit
-
-
Lain authored
Signed-off-by:Siyuan Fu <siyuanf@nvidia.com>
-
- 05 Dec, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
- 04 Dec, 2025 3 commits
- 02 Dec, 2025 1 commit
-
-
王敏 authored
-
- 30 Nov, 2025 1 commit
-
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
- 28 Nov, 2025 1 commit
-
-
Augusto Yao authored
Signed-off-by:augusto.yjh <augusto.yjh@antgroup.com>
-
- 26 Nov, 2025 1 commit
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 25 Nov, 2025 1 commit
-
-
Pleaplusone authored
[Perf][Deepseek] optimize gather_and_maybe_dequant_cache kernel's perf for extremely long sequence (#28029) Signed-off-by:ganyi <ygan@amd.com>
-
- 20 Nov, 2025 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 19 Nov, 2025 2 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 13 Nov, 2025 3 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
zhuwenwen authored
add VLLM_USE_PD_SPLIT to split prefill and decode replace triton_ of rms and act_and_mul
-
王敏 authored
-
- 11 Nov, 2025 3 commits
-
-
Max Hu authored
Signed-off-by:
Max Hu <hyoung2991@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Jie Luo authored
Signed-off-by:Livinfly <luojie3m@gmail.com>
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 10 Nov, 2025 2 commits
-
-
vllmellm authored
[RFC][ROCm][AITER] Keep all AITER kernels in `_aiter_ops` class like `_custom_ops` and `_ipex_ops` (#24490) Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 08 Nov, 2025 1 commit
-
-
zhangsicheng5 authored
Signed-off-by:
zhangsicheng5 <zhangsicheng5@huawei.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
Qiu <qiuchunshuo@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
-
- 05 Nov, 2025 2 commits
-
-
Qiu authored
Signed-off-by:Qiu <qiuchunshuo@huawei.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 26 Oct, 2025 1 commit
-
-
Yeshwanth N authored
Signed-off-by:
Yeshwanth Surya <yeshsurya@gmail.com> Signed-off-by:
Yeshwanth N <yeshsurya@gmail.com> Signed-off-by:
yeshsurya <yeshsurya@gmail.com>
-
- 24 Oct, 2025 1 commit
-
-
Ming Yang authored
Signed-off-by:Ming Yang <minos.future@gmail.com>
-
- 20 Oct, 2025 1 commit
-
-
zhuwenwen authored
-