- 28 Nov, 2025 3 commits
-
-
wang.yuqi authored
Signed-off-by:
wang.yuqi <yuqi.wang@daocloud.io> Signed-off-by:
wang.yuqi <noooop@126.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
EanWang211123 authored
Signed-off-by:
Tsai, Louie <louie.tsai@intel.com> Signed-off-by:
EanWang211123 <wangyiheng@sangfor.com.cn> Co-authored-by:
Louie Tsai <louie.tsai@intel.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
maang-h authored
Signed-off-by:maang <maang_h@163.com>
-
- 27 Nov, 2025 13 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Andrii Skliar authored
Signed-off-by:
Andrii Skliar <askliar@askliar-mlt.client.nvidia.com> Co-authored-by:
Andrii Skliar <askliar@askliar-mlt.client.nvidia.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Didier Durand authored
Signed-off-by:Didier Durand <durand.didier@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Morrison Turnansky authored
Signed-off-by:
morrison-turnansky <mturnans@redhat.com> Signed-off-by:
adabeyta <aabeyta@redhat.com> Signed-off-by:
Morrison Turnansky <mturnans@redhat.com> Co-authored-by:
adabeyta <aabeyta@redhat.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Matthew Bonanni authored
[Attention][Async] Eliminate `seq_lens_cpu` in FlashAttention metadata building with DCP > 1 (#29449) Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 26 Nov, 2025 5 commits
-
-
Johnny Yang authored
Signed-off-by:Johnny Yang <johnnyyang@google.com>
-
Lucas Wilkinson authored
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
Xieyang Xu authored
-
- 25 Nov, 2025 16 commits
-
-
Andrey Khalyavin authored
Signed-off-by:
Andrey Khalyavin <halyavin@yandex-team.ru> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Co-authored-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Yifan Qiao authored
Signed-off-by:
Yifan Qiao <yifanqiao@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
Eldar Kurtić authored
Signed-off-by:Eldar Kurtic <8884008+eldarkurtic@users.noreply.github.com>
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.io>
-
Nicolò Lucchesi authored
[Core] Generalize Encoder-Decoder `seq_lens` computation to avoid Whisper hardcoded logic (#29268) Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
Avishek Goswami authored
Signed-off-by:GOavi101 <1704178@kiit.ac.in>
-
wang.yuqi authored
Signed-off-by:wang.yuqi <yuqi.wang@daocloud.io>
-
Rémi Delacourt authored
Signed-off-by:
Rémi Delacourt <remi@mistral.ai> Signed-off-by:
Rémi Delacourt <54138269+Flechman@users.noreply.github.com> Signed-off-by:
remi <remi@mistral.ai>
-
Jiangyun Zhu authored
Signed-off-by:zjy0516 <riverclouds.zhu@qq.com>
-
Isotr0py authored
Signed-off-by:
manayang <jackmanayang@gmail.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
Roger Wang <hey@rogerw.io> Co-authored-by:
sergeywang <sergeywang@tencent.com> Co-authored-by:
manayang <jackmanayang@gmail.com> Co-authored-by:
manayang <manayang@tencent.com> Co-authored-by:
Roger Wang <hey@rogerw.io>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
gbyu-amd authored
Signed-off-by:
guanbao <gyu@amd.com> Signed-off-by:
Guanbao Yu <gyu@amd.com> Signed-off-by:
gbyu-amd <Guanbao.Yu@amd.com> Co-authored-by:
guanbao <gyu@amd.com>
-
Pleaplusone authored
[Perf][Deepseek] optimize gather_and_maybe_dequant_cache kernel's perf for extremely long sequence (#28029) Signed-off-by:ganyi <ygan@amd.com>
-
Hanjie Qiu authored
Signed-off-by:
hjjq <hanjieq@nvidia.com> Signed-off-by:
Benjamin Chislett <bchislett@nvidia.com> Co-authored-by:
Benjamin Chislett <bchislett@nvidia.com>
-
- 24 Nov, 2025 3 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-