- 07 Aug, 2025 21 commits
-
-
wang.yuqi authored
Signed-off-by:wang.yuqi <noooop@126.com>
-
WeiQing Chen authored
Signed-off-by:
ycyaw66 <497410282@qq.com> Signed-off-by:
David Chen <530634352@qq.com> Co-authored-by:
ycyaw66 <497410282@qq.com>
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Lionel Villard authored
Signed-off-by:Lionel Villard <villard@us.ibm.com>
-
ZiTian.Zhao authored
Signed-off-by:zitian.zhao <zitian.zhao@tencentmusic.com>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Syed Muhammad Bin Asif authored
Signed-off-by:Syed Muhammad Bin Asif <syedmba7@connect.hku.hk>
-
qscqesze authored
Signed-off-by:
QscQ <qscqesze@gmail.com> Signed-off-by:
qingjun <qingjun@minimaxi.com>
-
Michael Goin authored
Signed-off-by:Michael Goin <mgoin64@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Tao He authored
Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
tc-mb authored
Co-authored-by:imning3 <hbning@pku.edu.cn>
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
Lain authored
Signed-off-by:
Siyuan Fu <siyuanf@nvidia.com> Signed-off-by:
Lain <fusiyuan2000@hotmail.com> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Yongye Zhu authored
Signed-off-by:Yongye Zhu <zyy1102000@gmail.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
Asaf Joseph Gardin authored
Signed-off-by:
asafg <asafg@ai21.com> Co-authored-by:
asafg <asafg@ai21.com>
-
- 06 Aug, 2025 19 commits
-
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Yongye Zhu <zyy1102000@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix triton compile error in `kernel_unified_attention_2/3d` caused by attention sinks (#22368) Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
Zhang Jason authored
Signed-off-by:Zhang Jason <ning.zhang2@amd.com>
-
Lucas Wilkinson authored
Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
-
Roger Wang authored
Signed-off-by:Roger Wang <hey@rogerw.me>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Yongye Zhu authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com>
-
Chen Zhang authored
Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
isotr0py <2037008807@qq.com> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-