- 11 Aug, 2025 4 commits
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
GuanLuo authored
Signed-off-by:GuanLuo <gluo@nvidia.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 10 Aug, 2025 2 commits
-
-
Chengji Yao authored
[TPU] kv cache update kernel doesn't need to be padded slices to multiple of num_slices_per_block (#22394) Signed-off-by:
Chengji Yao <chengjiyao@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@gmail.com>
-
Le Chen authored
Signed-off-by:
lechen <lecself@163.com> Signed-off-by:
LeChen <lecself@163.com>
-
- 09 Aug, 2025 1 commit
-
-
Kyuyeun Kim authored
Signed-off-by:Kyuyeun Kim <kyuyeunk@google.com>
-
- 08 Aug, 2025 2 commits
-
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 07 Aug, 2025 5 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 05 Aug, 2025 5 commits
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Woosuk Kwon authored
-
PiteXChen authored
Signed-off-by:CLFutureX <775523362@qq.com>
-
- 04 Aug, 2025 4 commits
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Abirdcfly authored
Signed-off-by:Abirdcfly <fp544037857@gmail.com>
-
- 03 Aug, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
David Ben-David authored
Signed-off-by:
David Ben-David <davidb@pliops.com> Co-authored-by:
David Ben-David <davidb@pliops.com>
-
- 02 Aug, 2025 3 commits
-
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.me>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Sage Moore authored
Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 01 Aug, 2025 2 commits
-
-
rongfu.leng authored
Signed-off-by:rongfu.leng <rongfu.leng@daocloud.io>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 31 Jul, 2025 4 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
zhiweiz authored
Signed-off-by:
morgendave <morgendave@gmail.com> Co-authored-by:
Roger Wang <hey@rogerw.me>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 30 Jul, 2025 6 commits
-
-
Zebing Lin authored
Signed-off-by:linzebing <linzebing1995@gmail.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Chenguang Zheng authored
Signed-off-by:
fake0fan <645327136@qq.com> Signed-off-by:
herotai214 <herotai214@gmail.com> Co-authored-by:
herotai214 <herotai214@gmail.com>
-
633WHU authored
Signed-off-by:
chiliu <chiliu@paypal.com> Co-authored-by:
chiliu <chiliu@paypal.com>
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Ruixiang Tan authored
Signed-off-by:
tanruixiang <tanruixiang0104@gmail.com> Signed-off-by:
Ruixiang Tan <819464715@qq.com> Signed-off-by:
GitHub <noreply@github.com>
-