- 26 Mar, 2025 17 commits
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Alex Brooks authored
Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> Co-authored-by:
Joe Runde <joe@joerun.de>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
cyyever authored
Signed-off-by:cyy <cyyever@outlook.com>
-
Harry Mellor authored
Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
-
Reid authored
Signed-off-by:
reidliu41 <reid201711@gmail.com> Co-authored-by:
reidliu41 <reid201711@gmail.com>
-
wwl2755 authored
Signed-off-by:wwl2755 <wangwenlong2755@gmail.com>
-
vllmellm authored
Signed-off-by:
vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
-
Bryan Lu authored
Signed-off-by:
Bryan Lu <yuzhelu@amazon.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
daniel-salib authored
Signed-off-by:Daniel Salib <danielsalib@meta.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
Lucas Wilkinson authored
[BugFix] Fix nightly MLA failure (FA2 + MLA chunked prefill, i.e. V1, producing bad results) (#15492) Signed-off-by:LucasWilkinson <lwilkinson@neuralmagic.com>
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 25 Mar, 2025 23 commits
-
-
Varun Sundar Rabindranath authored
Signed-off-by:
Varun Sundar Rabindranath <varun@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
Chenyaaang authored
Signed-off-by:
Chenyaaang <llccyy1212@gmail.com> Signed-off-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com> Co-authored-by:
rshaw@neuralmagic.com <robertgshaw2@gmail.com>
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
Nick Hill authored
Signed-off-by:Nick Hill <nhill@redhat.com>
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Maximilien de Bayser authored
Signed-off-by:Max de Bayser <mbayser@br.ibm.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
Antonio Gómez authored
Co-authored-by:ServerAI <ai@exc-mad-ai.com>
-
yarongmu-google authored
Signed-off-by:Yarong Mu <ymu@google.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@gmail.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Md. Shafi Hussain authored
Signed-off-by:Md. Shafi Hussain <Md.Shafi.Hussain@ibm.com>
-
Thien Tran authored
Signed-off-by:Thien Tran <gau.nernst@yahoo.com.sg>
-
Siyuan Liu authored
Signed-off-by:Siyuan Liu <lsiyuan@google.com>
-
Lu Fang authored
Fix CUDA kernel index data type in vllm/csrc/quantization/gptq_marlin/awq_marlin_repack.cu +10 (#15160) Signed-off-by:
Lu Fang <lufang@fb.com> Co-authored-by:
Richard Barnes <rbarnes@meta.com>
-
Woosuk Kwon authored
Signed-off-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Russell Bryant authored
Signed-off-by:
Russell Bryant <rbryant@redhat.com> Co-authored-by:
Loc Huynh <jc1da.3011@gmail.com> Co-authored-by:
Michal Moskal <michal@moskal.me>
-
Chauncey authored
Signed-off-by:chaunceyjiang <chaunceyjiang@gmail.com>
-
Tyler Michael Smith authored
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-