- 10 Jun, 2025 1 commit
-
-
Rachel Guo authored
[BugFix][FlashInfer] Fix attention backend interface mismatch with unexpected keyword `use_irope` (#19134) Signed-off-by:Yunqiu Guo <guorachel@meta.com>
-
- 09 Jun, 2025 1 commit
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
- 07 Jun, 2025 1 commit
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
- 05 Jun, 2025 1 commit
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 04 Jun, 2025 4 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:nicklucche <nlucches@redhat.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Kaixi Hou authored
-
Li, Jiang authored
-
- 03 Jun, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Simon Mo authored
Signed-off-by:simon-mo <simon.mo@hey.com>
-
- 30 May, 2025 1 commit
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
- 29 May, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 28 May, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
- 23 May, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 21 May, 2025 2 commits
-
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
Mark McLoughlin authored
Signed-off-by:Mark McLoughlin <markmc@redhat.com>
-
- 16 May, 2025 1 commit
-
-
kliuae authored
Signed-off-by:kf <kuanfu.liu@embeddedllm.com>
-
- 15 May, 2025 2 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 14 May, 2025 2 commits
-
-
bnellnm authored
-
Michael Goin authored
-
- 11 May, 2025 2 commits
-
-
TJian authored
Signed-off-by:tjtanaa <tunjian.tan@embeddedllm.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 10 May, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 09 May, 2025 3 commits
-
-
vllmellm authored
Signed-off-by:vllmellm <vllm.ellm@embeddedllm.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Co-authored-by:
qli88 <qiang.li2@amd.com> Co-authored-by:
Hongxia Yang <62075498+hongxiayang@users.noreply.github.com>
-
- 08 May, 2025 2 commits
-
-
Jevin Jiang authored
-
Chanh Nguyen authored
Signed-off-by:
Chanh Nguyen <cnguyen@linkedin.com> Co-authored-by:
Chanh Nguyen <cnguyen@linkedin.com>
-
- 06 May, 2025 3 commits
-
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Jevin Jiang authored
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 04 May, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 02 May, 2025 2 commits
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 29 Apr, 2025 2 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
Zhengyuan Su (苏政渊) authored
Signed-off-by:
苏政渊 <suzhengyuan@moonshot.cn> Co-authored-by:
苏政渊 <suzhengyuan@moonshot.cn>
-
- 28 Apr, 2025 2 commits
-
-
Lucas Wilkinson authored
[BugFix] Fix cascade attention - RuntimeError: scheduler_metadata must have shape (metadata_size) (#17283) Signed-off-by:Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Jee Jee Li <pandaleefree@gmail.com> Co-authored-by:
Aaron Pham <contact@aarnphm.xyz>
-
- 27 Apr, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-