- 19 Aug, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 18 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 15 Aug, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 13 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 11 Aug, 2025 1 commit
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 10 Aug, 2025 1 commit
-
-
Chengji Yao authored
[TPU] kv cache update kernel doesn't need to be padded slices to multiple of num_slices_per_block (#22394) Signed-off-by:
Chengji Yao <chengjiyao@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@gmail.com>
-
- 09 Aug, 2025 1 commit
-
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Signed-off-by:
Andy Xie <andy.xning@gmail.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Zhiyu Cheng <zhiyuc@nvidia.com> Signed-off-by:
Shu Wang <shuw@nvidia.com> Signed-off-by:
Po-Han Huang <pohanh@nvidia.com> Signed-off-by:
Shu Wang. <shuw@nvidia.com> Signed-off-by:
XIn Li <xinli@nvidia.com> Signed-off-by:
Junhao Li <junhao@ubicloud.com> Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
zitian.zhao <zitian.zhao@tencentmusic.com> Signed-off-by:
zitian zhao <zitian.zhao@tencentmusic.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Signed-off-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Signed-off-by:
Linkun <github@lkchen.net> Co-authored-by:
Ning Xie <andy.xning@gmail.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com> Co-authored-by:
Andrew Sansom <andrew@protopia.ai> Co-authored-by:
Zhiyu <zhiyuc@nvidia.com> Co-authored-by:
Shu Wang <shuw@nvidia.com> Co-authored-by:
XIn Li <xinli@nvidia.com> Co-authored-by:
Junhao Li <streaver91@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Yuxuan Zhang <2448370773@qq.com> Co-authored-by:
ZiTian Zhao <zitian.zhao@tencentmusic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Po-Han Huang (NVIDIA) <53919306+nvpohanh@users.noreply.github.com> Co-authored-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Hong Hanh <hanh.usth@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
lkchen <github@lkchen.net>
-
- 07 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 05 Aug, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 03 Aug, 2025 1 commit
-
-
David Ben-David authored
Signed-off-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
David Ben-David <davidb@pliops.com>
-
- 25 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 24 Jul, 2025 1 commit
-
-
Juncheng Gu authored
Signed-off-by:
Juncheng Gu <juncgu@gmail.com> Signed-off-by:
Richard Liu <ricliu@google.com> Co-authored-by:
Richard Liu <39319471+richardsliu@users.noreply.github.com> Co-authored-by:
Richard Liu <ricliu@google.com>
-
- 23 Jul, 2025 2 commits
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 21 Jul, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 20 Jul, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 18 Jul, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
- 16 Jul, 2025 2 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
- 15 Jul, 2025 1 commit
-
-
Yifei Teng authored
Signed-off-by:Yifei Teng <tengyifei88@gmail.com>
-
- 14 Jul, 2025 2 commits
-
-
wangxiyuan authored
Signed-off-by:wangxiyuan <wangxiyuan1007@gmail.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 10 Jul, 2025 1 commit
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
- 02 Jul, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 30 Jun, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 26 Jun, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 25 Jun, 2025 1 commit
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
- 19 Jun, 2025 1 commit
-
-
Maximilien de Bayser authored
Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Signed-off-by:
Max de Bayser <maxdebayser@gmail.com> Signed-off-by:
22quinn <33176974+22quinn@users.noreply.github.com> Co-authored-by:
22quinn <33176974+22quinn@users.noreply.github.com>
-
- 18 Jun, 2025 1 commit
-
-
afeldman-nm authored
Signed-off-by:Andrew Feldman <afeldman@redhat.com>
-
- 13 Jun, 2025 1 commit
-
-
汪志鹏 authored
Signed-off-by:汪志鹏 <wangzhipeng628@gmail.com>
-
- 06 Jun, 2025 3 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 04 Jun, 2025 1 commit
-
-
Lukas Geiger authored
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com>
-
- 03 Jun, 2025 4 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
Siyuan Liu authored
Signed-off-by:
Siyuan Liu <lsiyuan@google.com> Co-authored-by:
Hossein Sarshar <hossein.sarshar@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@google.com>
-