- 03 Jul, 2025 1 commit
-
-
Nicolò Lucchesi authored
[Misc] Fix `Unable to detect current VLLM config. Defaulting to NHD kv cache layout` warning (#20400) Signed-off-by:NickLucche <nlucches@redhat.com>
-
- 02 Jul, 2025 3 commits
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Liangliang Ma authored
Signed-off-by:Ma, Liangliang <liangliang.ma@intel.com>
-
- 01 Jul, 2025 2 commits
-
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
TY-AMD authored
Signed-off-by:Tianyuan Wu <Tianyuan.Wu@amd.com>
-
- 27 Jun, 2025 1 commit
-
-
Chendi.Xue authored
Signed-off-by:Chendi.Xue <chendi.xue@intel.com>
-
- 26 Jun, 2025 3 commits
-
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Kunshang Ji authored
Signed-off-by:Kunshang Ji <kunshang.ji@intel.com>
-
TJian authored
[Bugfix][V1][ROCm] Fix AITER Flash Attention Backend (Fix API Break and Local Attention Logic: affecting Llama4) (#19904) Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
- 25 Jun, 2025 3 commits
-
-
Chenyaaang authored
Signed-off-by:Chenyaaang <chenyangli@google.com>
-
Chengji Yao authored
Signed-off-by:Chengji Yao <chengjiyao@google.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 20 Jun, 2025 1 commit
-
-
qli88 authored
Signed-off-by:Qiang Li <qiang.li2@amd.com>
-
- 19 Jun, 2025 1 commit
-
-
zsolt-borbely-htec authored
Signed-off-by:Zsolt Borbely <zsolt.borbely@htecgroup.com>
-
- 18 Jun, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Zzz9990 authored
Signed-off-by:
fsx950223 <fsx950223@outlook.com> Signed-off-by:
charlifu <charlifu@amd.com> Co-authored-by:
fsx950223 <fsx950223@outlook.com> Co-authored-by:
charlifu <charlifu@amd.com>
-
- 17 Jun, 2025 3 commits
-
-
Charlie Fu authored
Signed-off-by:charlifu <charlifu@amd.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
- 16 Jun, 2025 2 commits
-
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
Chengji Yao authored
Signed-off-by:
Chengji Yao <chengjiyao@google.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 15 Jun, 2025 1 commit
-
-
22quinn authored
Signed-off-by:22quinn <33176974+22quinn@users.noreply.github.com>
-
- 13 Jun, 2025 1 commit
-
-
Luka Govedič authored
Signed-off-by:luka <luka@neuralmagic.com>
-
- 12 Jun, 2025 1 commit
-
-
Luka Govedič authored
Signed-off-by:
Luka Govedič <lgovedic@redhat.com> Co-authored-by:
Sage Moore <sage@neuralmagic.com>
-
- 10 Jun, 2025 1 commit
-
-
Rachel Guo authored
[BugFix][FlashInfer] Fix attention backend interface mismatch with unexpected keyword `use_irope` (#19134) Signed-off-by:Yunqiu Guo <guorachel@meta.com>
-
- 09 Jun, 2025 1 commit
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 07 Jun, 2025 1 commit
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
- 05 Jun, 2025 1 commit
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 04 Jun, 2025 4 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:nicklucche <nlucches@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Kaixi Hou authored
-
Li, Jiang authored
-
- 03 Jun, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 30 May, 2025 1 commit
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 29 May, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 28 May, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
- 23 May, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 21 May, 2025 1 commit
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-