- 22 Aug, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
- 21 Aug, 2025 3 commits
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Paul Pak authored
Signed-off-by:Paul Pak <paulpak58@gmail.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Signed-off-by:
asafg <39553475+Josephasafg@users.noreply.github.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 20 Aug, 2025 5 commits
-
-
Matthew Bonanni authored
Signed-off-by:
Matthew Bonanni <mbonanni001@gmail.com> Signed-off-by:
Matthew Bonanni <mbonanni@redhat.com>
-
Matthew Bonanni authored
Signed-off-by:Matthew Bonanni <mbonanni001@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
who who who authored
Signed-off-by:fsx950223 <fsx950223@outlook.com>
-
Zebing Lin authored
Signed-off-by:linzebing <linzebing1995@gmail.com>
-
- 19 Aug, 2025 6 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
elvischenv authored
Signed-off-by:
elvischenv <219235043+elvischenv@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Chengji Yao authored
Signed-off-by:
Chengji Yao <chengjiyao@gmail.com> Signed-off-by:
Chengji Yao <chengjiyao@google.com> Co-authored-by:
Chengji Yao <chengjiyao@gmail.com>
-
- 16 Aug, 2025 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 15 Aug, 2025 5 commits
-
-
eigen authored
-
fhl2000 authored
[Core] Allow full cudagraph with separate attention routines and orthogonal to compilation, add support for FA2 and FlashInfer (#20059) Signed-off-by:
fhl <2410591650@qq.com> Signed-off-by:
fhl2000 <63384265+fhl2000@users.noreply.github.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Lucas Wilkinson <LucasWilkinson@users.noreply.github.com>
-
amirai21 authored
Signed-off-by:
amirk <amirk@ai21.com> Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com> Co-authored-by:
Asaf Joseph Gardin <39553475+Josephasafg@users.noreply.github.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 13 Aug, 2025 3 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Aug, 2025 2 commits
-
-
Xiaozhu Meng authored
Signed-off-by:
Xiaozhu <mxz297@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
wang.yuqi authored
[Bugfix] Fix ModernBert load & Enable sliding window attention for bidirectional attention. (#22637) Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Max de Bayser <mbayser@br.ibm.com>
-
- 10 Aug, 2025 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 09 Aug, 2025 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 08 Aug, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
- 07 Aug, 2025 4 commits
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Lain <fusiyuan2000@hotmail.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 06 Aug, 2025 3 commits
-
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
- 05 Aug, 2025 1 commit
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-