- 07 Aug, 2025 4 commits
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Lain <fusiyuan2000@hotmail.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 06 Aug, 2025 3 commits
-
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
- 05 Aug, 2025 3 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 04 Aug, 2025 2 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
ZiTian.Zhao authored
Signed-off-by:zitian.zhao <zitian.zhao@tencentmusic.com>
-
- 02 Aug, 2025 3 commits
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
fhl2000 authored
-
- 01 Aug, 2025 2 commits
-
-
Michael Goin authored
-
Mickaël Seznec authored
Signed-off-by:Mickael Seznec <mickael@mistral.ai>
-
- 31 Jul, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 30 Jul, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 29 Jul, 2025 1 commit
-
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
- 28 Jul, 2025 1 commit
-
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 26 Jul, 2025 1 commit
-
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Russell Bryant <rbryant@redhat.com>
-
- 25 Jul, 2025 1 commit
-
-
who who who authored
Signed-off-by:
fsx950223 <fsx950223@outlook.com> Signed-off-by:
amd-ruitang3 <Rui.Tang2@amd.com> Co-authored-by:
amd-ruitang3 <Rui.Tang2@amd.com>
-
- 24 Jul, 2025 2 commits
-
-
weiliang authored
Signed-off-by:Weiliang Liu <weiliangl@nvidia.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 21 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 20 Jul, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 19 Jul, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Lucia Fang authored
Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Lu Fang <fanglu@meta.com>
-
- 18 Jul, 2025 2 commits
-
-
Lucas Wilkinson authored
-
elvischenv authored
[Bugfix] Fix the tensor non-contiguous issue for Flashinfer TRT-LLM backend attention kernel (#21133)
-
- 17 Jul, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
QiliangCui authored
Signed-off-by:Qiliang Cui <derrhein@gmail.com>
-
- 16 Jul, 2025 3 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
Elfie Guo authored
Signed-off-by:
Elfie Guo <elfieg@nvidia.com> Co-authored-by:
Elfie Guo <eflieg@nvidia.com>
-
- 15 Jul, 2025 2 commits
-
-
Yifei Teng authored
Signed-off-by:Yifei Teng <tengyifei88@gmail.com>
-
Alexander Matveev authored
Signed-off-by:Alexander Matveev <amatveev@redhat.com>
-