- 24 Apr, 2026 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nickhill123@gmail.com>
-
- 30 Mar, 2026 1 commit
-
-
SandishKumarHN authored
Signed-off-by:
SandishKumarHN <sandish@fb.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Matthew Bonanni <mbonanni@redhat.com>
-
- 27 Mar, 2026 1 commit
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
- 26 Mar, 2026 1 commit
-
-
Chuan (Richard) Li authored
Signed-off-by:Li <chuali@amd.com>
-
- 20 Mar, 2026 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 10 Mar, 2026 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
- 20 Feb, 2026 1 commit
-
-
Huamin Li authored
Signed-off-by:
Huamin Li <3ericli@gmail.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 13 Feb, 2026 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk@inferact.ai>
-
- 05 Feb, 2026 1 commit
-
-
Benjamin Chislett authored
Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
- 23 Jan, 2026 2 commits
-
-
Harry Huang authored
Signed-off-by:
huanghaoyan.hhy <huanghaoyan.hhy@alibaba-inc.com> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Chen Zhang <zhangch99@outlook.com>
-
tianshu-Michael-yu authored
Signed-off-by:Tianshu Yu <tianshuyu.formal@gmail.com>
-
- 19 Jan, 2026 1 commit
-
-
Tomas Ruiz authored
Signed-off-by:Tomas Ruiz <tomas.ruiz.te@gmail.com>
-
- 13 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 12 Jan, 2026 2 commits
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Asaf Joseph Gardin authored
Signed-off-by:Josephasafg <ajgard7@gmail.com>
-
- 10 Jan, 2026 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 09 Jan, 2026 1 commit
-
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 07 Jan, 2026 1 commit
-
-
Jack Yang authored
Signed-off-by:
Zhuohao Yang <zy242@cornell.edu> Co-authored-by:
Zhuohao Yang <zy242@cornell.edu> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
- 06 Jan, 2026 1 commit
-
-
Lucas Wilkinson authored
[Attention][1/n] Remove usage of deprecated `seq_lens_cpu` and `num_computed_tokens_cpu` CommonAttentionMetadata properties (#31773) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 23 Dec, 2025 1 commit
-
-
Patrick von Platen authored
Signed-off-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
- 16 Dec, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Stanislaw Wozniak <stw@zurich.ibm.com>
-
jiangkuaixue123 authored
Signed-off-by:
jiangkuaixue123 <jiangxiaozhou111@163.com> Co-authored-by:
root <root@hk01dgx028.cm.cluster>
-
- 12 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 10 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Make seq_lens_cpu optional in CommonAttentionMetadata to enable true async spec-decode (#29624) Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 09 Dec, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Benjamin Chislett <chislett.ben@gmail.com>
-
- 29 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 27 Nov, 2025 3 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
[Attention][Async] Eliminate `seq_lens_cpu` in FlashAttention metadata building with DCP > 1 (#29449) Signed-off-by:Matthew Bonanni <mbonanni@redhat.com>
-
- 26 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
-
- 25 Nov, 2025 1 commit
-
-
Nicolò Lucchesi authored
[Core] Generalize Encoder-Decoder `seq_lens` computation to avoid Whisper hardcoded logic (#29268) Signed-off-by:
NickLucche <nlucches@redhat.com> Co-authored-by:
Ekagra Ranjan <3116519+ekagra-ranjan@users.noreply.github.com>
-
- 19 Nov, 2025 2 commits
-
-
Qiu authored
Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
FENP <yuanyongjie.yyj@antgroup.com> Signed-off-by:
LookAround <lixushi@huawei.com> Signed-off-by:
Jingchun Gao <gaojingchun1@huawei.com> Signed-off-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
FENP <yuanyongjie.yyj@antgroup.com> Co-authored-by:
LookAround <lixushi@huawei.com> Co-authored-by:
Jingchun Gao <gaojingchun1@huawei.com> Co-authored-by:
zhenwenqi2024 <zhenwenqi_2022@qq.com> Co-authored-by:
Jingchun Gao <63247409+gjc0824@users.noreply.github.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 14 Nov, 2025 1 commit
-
-
Yong Hoon Shin authored
Signed-off-by:
Yong Hoon Shin <yhshin@meta.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk>
-
- 12 Nov, 2025 2 commits
-
-
Benjamin Chislett authored
[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479) Signed-off-by:Benjamin Chislett <bchislett@nvidia.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 08 Nov, 2025 1 commit
-
-
zhangsicheng5 authored
Signed-off-by:
zhangsicheng5 <zhangsicheng5@huawei.com> Signed-off-by:
QiuChunshuo <qiuchunshuo@huawei.com> Signed-off-by:
Qiu <qiuchunshuo@huawei.com> Co-authored-by:
QiuChunshuo <qiuchunshuo@huawei.com>
-
- 05 Nov, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 04 Nov, 2025 1 commit
-
-
Pleaplusone authored
Signed-off-by:ganyi <ygan@amd.com>
-
- 30 Oct, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-