- 29 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 28 Nov, 2025 1 commit
-
-
Augusto Yao authored
Signed-off-by:augusto.yjh <augusto.yjh@antgroup.com>
-
- 27 Nov, 2025 4 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Andrii Skliar authored
Signed-off-by:
Andrii Skliar <askliar@askliar-mlt.client.nvidia.com> Co-authored-by:
Andrii Skliar <askliar@askliar-mlt.client.nvidia.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
[Attention][Async] Eliminate `seq_lens_cpu` in FlashAttention metadata building with DCP > 1 (#29449) Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 26 Nov, 2025 3 commits
-
-
Lucas Wilkinson authored
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 25 Nov, 2025 4 commits
-
-
Nicolò Lucchesi authored
[Core] Generalize Encoder-Decoder `seq_lens` computation to avoid Whisper hardcoded logic (#29268) Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
gbyu-amd authored
Signed-off-by:
guanbao <gyu@amd.com> Signed-off-by:
Guanbao Yu <gyu@amd.com> Signed-off-by:
gbyu-amd <Guanbao.Yu@amd.com> Co-authored-by:
guanbao <gyu@amd.com>
-
Pleaplusone authored
[Perf][Deepseek] optimize gather_and_maybe_dequant_cache kernel's perf for extremely long sequence (#28029) Signed-off-by:ganyi <ygan@amd.com>
-
- 24 Nov, 2025 2 commits
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
tongqiu authored
Signed-off-by:apinge <Tong.Qiu2@amd.com>
-
- 22 Nov, 2025 2 commits
-
-
Fadi Arafeh authored
Signed-off-by:Fadi Arafeh <fadi.arafeh@arm.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 21 Nov, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
who who who authored
Signed-off-by:fsx950223 <fsx950223@outlook.com>
-
- 20 Nov, 2025 3 commits
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Qiang Zhang authored
Signed-off-by:chiangzhang <chiangzhang@tencent.com>
-
- 19 Nov, 2025 4 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
Aleksandr Malyshev authored
Signed-off-by:
Aleksandr Malyshev <maleksan@amd.com> Co-authored-by:
Aleksandr Malyshev <maleksan@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 17 Nov, 2025 1 commit
-
-
Xiake Sun authored
Signed-off-by:Xiake Sun <xiake.sun@amd.com>
-
- 14 Nov, 2025 3 commits
-
-
Lucas Wilkinson authored
-
Yong Hoon Shin authored
Signed-off-by:
Yong Hoon Shin <yhshin@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
Jingchun Gao authored
Signed-off-by:
gaojc <1055866782@qq.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
gaojingchun (A) <g00955623@china.huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
-
- 13 Nov, 2025 6 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Huamin Li authored
Signed-off-by:Huamin Li <3ericli@gmail.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Nov, 2025 4 commits
-
-
Benjamin Chislett authored
[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479) Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Andreas Karatzas authored
Signed-off-by:
Andreas Karatzas <akaratza@amd.com> Signed-off-by:
Andreas Karatzas <Andreas.Karatzas@amd.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-