- 06 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 05 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 24 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 17 Jul, 2025 3 commits
- 10 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 09 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 06 Jul, 2025 3 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Woosuk Kwon authored
Signed-off-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by:
jiang1.li <jiang1.li@intel.com> Co-authored-by:
Li, Jiang <jiang1.li@intel.com>
- 03 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 02 Jul, 2025 1 commit
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
- 27 Jun, 2025 2 commits
-
-
zhuwenwen authored
-
Chendi.Xue authored
Signed-off-by:Chendi.Xue <chendi.xue@intel.com>
-
- 26 Jun, 2025 3 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
TJian authored
[Bugfix][V1][ROCm] Fix AITER Flash Attention Backend (Fix API Break and Local Attention Logic: affecting Llama4) (#19904) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 25 Jun, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 23 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 21 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 20 Jun, 2025 2 commits
-
-
Ning Xie authored
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 18 Jun, 2025 3 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
zhuwenwen authored
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 17 Jun, 2025 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
jvlunteren authored
Signed-off-by:Jan van Lunteren <jvl@zurich.ibm.com>
-
- 15 Jun, 2025 1 commit
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 13 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 12 Jun, 2025 2 commits
-
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>
-
Ning Xie authored
Signed-off-by:Andy Xie <andy.xning@gmail.com>
-
- 11 Jun, 2025 2 commits
-
-
rasmith authored
[AMD] [Quantization] Add override flag for attention dtype instead of using kv_cache_dtype trigger (#17331) Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 06 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 04 Jun, 2025 1 commit
-
-
Li, Jiang authored
-
- 03 Jun, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 30 May, 2025 2 commits
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
zhuwenwen authored
-