"vscode:/vscode.git/clone" did not exist on "2f5f98bb752f9efdb20b85752bd7137f3f787be4"
- 14 Aug, 2025 5 commits
-
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
nvjullin authored
Signed-off-by:
Julien Lin <jullin@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
iAmir97 authored
Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Signed-off-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:
Nick Hill <nhill@redhat.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
- 13 Aug, 2025 5 commits
-
-
Jialin Ouyang authored
Signed-off-by:Jialin Ouyang <Jialin.Ouyang@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 12 Aug, 2025 3 commits
-
-
Xiaozhu Meng authored
Signed-off-by:
Xiaozhu <mxz297@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
-
Rahul Tuli authored
Signed-off-by:Rahul Tuli <rtuli@redhat.com>
-
wang.yuqi authored
[Bugfix] Fix ModernBert load & Enable sliding window attention for bidirectional attention. (#22637) Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Max de Bayser <mbayser@br.ibm.com> Co-authored-by:
Max de Bayser <mbayser@br.ibm.com>
-
- 11 Aug, 2025 3 commits
-
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
- 10 Aug, 2025 2 commits
-
-
Chengji Yao authored
[TPU] kv cache update kernel doesn't need to be padded slices to multiple of num_slices_per_block (#22394) Signed-off-by:
Chengji Yao <chengjiyao@gmail.com> Co-authored-by:
Chengji Yao <chengjiyao@gmail.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 09 Aug, 2025 5 commits
-
-
Or Ozeri authored
Signed-off-by:Or Ozeri <oro@il.ibm.com>
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Roger Wang authored
Signed-off-by:
Roger Wang <hey@rogerw.me> Signed-off-by:
Andy Xie <andy.xning@gmail.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Signed-off-by:
Andrew Sansom <andrew@protopia.ai> Signed-off-by:
Zhiyu Cheng <zhiyuc@nvidia.com> Signed-off-by:
Shu Wang <shuw@nvidia.com> Signed-off-by:
Po-Han Huang <pohanh@nvidia.com> Signed-off-by:
Shu Wang. <shuw@nvidia.com> Signed-off-by:
XIn Li <xinli@nvidia.com> Signed-off-by:
Junhao Li <junhao@ubicloud.com> Signed-off-by:
chaunceyjiang <chaunceyjiang@gmail.com> Signed-off-by:
zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by:
zitian.zhao <zitian.zhao@tencentmusic.com> Signed-off-by:
zitian zhao <zitian.zhao@tencentmusic.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
iAmir97 <Amir.balwel@embeddedllm.com> Signed-off-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Signed-off-by:
Linkun <github@lkchen.net> Co-authored-by:
Ning Xie <andy.xning@gmail.com> Co-authored-by:
TJian <tunjian.tan@embeddedllm.com> Co-authored-by:
Andrew Sansom <andrew@protopia.ai> Co-authored-by:
Zhiyu <zhiyuc@nvidia.com> Co-authored-by:
Shu Wang <shuw@nvidia.com> Co-authored-by:
XIn Li <xinli@nvidia.com> Co-authored-by:
Junhao Li <streaver91@gmail.com> Co-authored-by:
Chauncey <chaunceyjiang@gmail.com> Co-authored-by:
Yuxuan Zhang <2448370773@qq.com> Co-authored-by:
ZiTian Zhao <zitian.zhao@tencentmusic.com> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Po-Han Huang (NVIDIA) <53919306+nvpohanh@users.noreply.github.com> Co-authored-by:
iAmir97 <71513472+iAmir97@users.noreply.github.com> Co-authored-by:
iAmir97 <Amir.balwel@embeddedllm.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by:
Hong Hanh <hanh.usth@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
lkchen <github@lkchen.net>
-
Pradyun92 authored
Signed-off-by:
Pradyun Ramadorai <pradyunr@amazon.com> Signed-off-by:
Pradyun92 <142861237+Pradyun92@users.noreply.github.com> Co-authored-by:
Pradyun Ramadorai <pradyunr@amazon.com> Co-authored-by:
Nicolò Lucchesi <nicolo.lucchesi@gmail.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
- 08 Aug, 2025 6 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by:
Varun Sundar Rabindranath <vsundarr@redhat.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
yewentao256 <zhyanwentao@126.com> Co-authored-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Po-Han Huang (NVIDIA) authored
Signed-off-by:Po-Han Huang <pohanh@nvidia.com>
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 07 Aug, 2025 8 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Syed Muhammad Bin Asif authored
Signed-off-by:Syed Muhammad Bin Asif <syedmba7@connect.hku.hk>
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Lain <fusiyuan2000@hotmail.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 06 Aug, 2025 3 commits
-
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-