"vllm/model_executor/models/phimoe.py" did not exist on "8678a69ab51956031e3bb70bdf1a781a8652e67d"
- 07 Aug, 2025 1 commit
-
-
Lucas Wilkinson authored
[Attention] Support multiple attention metadata builders per kv_cache_spec + proper local attention no hybrid kv cache fix (#21588) Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 06 Aug, 2025 3 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Lucas Wilkinson authored
[BugFix] Fix triton compile error in `kernel_unified_attention_2/3d` caused by attention sinks (#22368) Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
LiuXiaoxuanPKU <lilyliupku@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com> Co-authored-by:
Minseok Lee <47620120+minseokl@users.noreply.github.com> Co-authored-by:
Yongye Zhu <zyy1102000@gmail.com>
-
- 05 Aug, 2025 2 commits
-
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
- 04 Aug, 2025 1 commit
-
-
Giancarlo Delfin authored
Signed-off-by:Giancarlo Delfin <gdelfin@meta.com>
-
- 01 Aug, 2025 1 commit
-
-
Michael Goin authored
-
- 29 Jul, 2025 1 commit
-
-
elvischenv authored
Signed-off-by:elvischenv <219235043+elvischenv@users.noreply.github.com>
-
- 24 Jul, 2025 1 commit
-
-
weiliang authored
Signed-off-by:Weiliang Liu <weiliangl@nvidia.com>
-
- 23 Jul, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Tao He authored
Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com>
-
- 21 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com>
-
- 19 Jul, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Lucia Fang authored
Signed-off-by:
Lucia Fang <fanglu@fb.com> Signed-off-by:
Lu Fang <fanglu@meta.com> Signed-off-by:
Lu Fang <fanglu@fb.com> Co-authored-by:
Lu Fang <fanglu@meta.com>
-
- 18 Jul, 2025 2 commits
-
-
hax0r31337 authored
Signed-off-by:hax0r31337 <liulihaocaiqwq@gmail.com>
-
Richard Zou authored
Signed-off-by:rzou <zou3519@gmail.com>
-
- 17 Jul, 2025 1 commit
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
- 16 Jul, 2025 1 commit
-
-
Peter Pan authored
Signed-off-by:Peter Pan <Peter.Pan@daocloud.io>
-
- 15 Jul, 2025 1 commit
-
-
Li Wang authored
Signed-off-by:wangli <wangli858794774@gmail.com>
-
- 14 Jul, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 13 Jul, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 12 Jul, 2025 1 commit
-
-
Congcong Chen authored
Signed-off-by:Congcong Chen <congcongchen@microsoft.com>
-
- 11 Jul, 2025 3 commits
-
-
Pavani Majety authored
Signed-off-by:
Pavani Majety <pmajety@nvidia.com> Signed-off-by:
mgoin <mgoin64@gmail.com> Co-authored-by:
shuw <shuw@nvidia.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Luka Govedič authored
Signed-off-by:
Luka Govedic <lgovedic@redhat.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
Alexander Matveev authored
-
- 08 Jul, 2025 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 07 Jul, 2025 1 commit
-
-
jvlunteren authored
Signed-off-by:Jan van Lunteren <jvl@zurich.ibm.com>
-
- 06 Jul, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
jiang1.li <jiang1.li@intel.com> Co-authored-by:
Li, Jiang <jiang1.li@intel.com>
-
- 02 Jul, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 27 Jun, 2025 1 commit
-
-
Chendi.Xue authored
Signed-off-by:Chendi.Xue <chendi.xue@intel.com>
-
- 26 Jun, 2025 3 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
TJian authored
[Bugfix][V1][ROCm] Fix AITER Flash Attention Backend (Fix API Break and Local Attention Logic: affecting Llama4) (#19904) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 25 Jun, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 20 Jun, 2025 2 commits
-
-
Ning Xie authored
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 18 Jun, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-