- 10 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Make seq_lens_cpu optional in CommonAttentionMetadata to enable true async spec-decode (#29624) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 09 Dec, 2025 2 commits
-
-
Jaya Yuan authored
Signed-off-by:FENP <yuanyongjie.yyj@antgroup.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 08 Dec, 2025 1 commit
-
-
Lain authored
Signed-off-by:Siyuan Fu <siyuanf@nvidia.com>
-
- 07 Dec, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 05 Dec, 2025 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
Jingchun Gao authored
Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com>
-
- 04 Dec, 2025 1 commit
-
-
Andreas Karatzas authored
Signed-off-by:Andreas Karatzas <akaratza@amd.com>
-
- 02 Dec, 2025 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 01 Dec, 2025 1 commit
-
-
Isotr0py authored
Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
baonudesifeizhai <baonudesifeizhai@gmail.com> Co-authored-by:
baonudesifeizhai <baonudesifeizhai@gmail.com>
-
- 30 Nov, 2025 2 commits
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
- 29 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 28 Nov, 2025 1 commit
-
-
Augusto Yao authored
Signed-off-by:augusto.yjh <augusto.yjh@antgroup.com>
-
- 27 Nov, 2025 4 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Andrii Skliar authored
Signed-off-by:
Andrii Skliar <askliar@askliar-mlt.client.nvidia.com> Co-authored-by:
Andrii Skliar <askliar@askliar-mlt.client.nvidia.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
[Attention][Async] Eliminate `seq_lens_cpu` in FlashAttention metadata building with DCP > 1 (#29449) Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 26 Nov, 2025 3 commits
-
-
Lucas Wilkinson authored
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 25 Nov, 2025 4 commits
-
-
Nicolò Lucchesi authored
[Core] Generalize Encoder-Decoder `seq_lens` computation to avoid Whisper hardcoded logic (#29268) Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
gbyu-amd authored
Signed-off-by:
guanbao <gyu@amd.com> Signed-off-by:
Guanbao Yu <gyu@amd.com> Signed-off-by:
gbyu-amd <Guanbao.Yu@amd.com> Co-authored-by:
guanbao <gyu@amd.com>
-
Pleaplusone authored
[Perf][Deepseek] optimize gather_and_maybe_dequant_cache kernel's perf for extremely long sequence (#28029) Signed-off-by:ganyi <ygan@amd.com>
-
- 24 Nov, 2025 2 commits
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
tongqiu authored
Signed-off-by:apinge <Tong.Qiu2@amd.com>
-
- 22 Nov, 2025 2 commits
-
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 21 Nov, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
who who who authored
Signed-off-by:fsx950223 <fsx950223@outlook.com>
-
- 20 Nov, 2025 3 commits
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Qiang Zhang authored
Signed-off-by:chiangzhang <chiangzhang@tencent.com>
-
- 19 Nov, 2025 5 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> (cherry picked from commit 48fc8b1e)
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 17 Nov, 2025 1 commit
-
-
Xiake Sun authored
Signed-off-by:Xiake Sun <xiake.sun@amd.com>
-