- 01 Feb, 2025 2 commits
-
-
Simon Mo authored
From @mgoin in https://github.com/vllm-project/vllm/pull/12638 I cannot push to that branch, therefore a new PR to unblock release. --------- Signed-off-by:
mgoin <michael@neuralmagic.com> Signed-off-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
Lucas Wilkinson authored
This PR implements the Deepseek V3 support by performing matrix absorption the fp8 weights --------- Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com>
-
- 31 Jan, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
Zhuohan Li <zhuohan123@gmail.com> Co-authored-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
- 29 Jan, 2025 1 commit
-
-
Yanyi Liu authored
Signed-off-by:liuyanyi <wolfsonliu@163.com>
-
- 28 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 27 Jan, 2025 2 commits
-
-
Nicolò Lucchesi authored
[Feature] [Spec decode]: Enable MLPSpeculator/Medusa and `prompt_logprobs` with ChunkedPrefill (#10132) Signed-off-by:
NickLucche <nlucches@redhat.com> Signed-off-by:
wallashss <wallashss@ibm.com> Co-authored-by:
wallashss <wallashss@ibm.com>
-
Bowen Wang authored
Signed-off-by:
Bowen Wang <abmfy@icloud.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
- 26 Jan, 2025 1 commit
-
-
Matthew Hendrey authored
Signed-off-by:
Matthew Hendrey <matthew.hendrey@gmail.com> Signed-off-by:
Shangming Cai <caishangming@linux.alibaba.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
Yuan Tang <terrytangyuan@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Chen Zhang <zhangch99@outlook.com> Signed-off-by:
wangxiyuan <wangxiyuan1007@gmail.com> Co-authored-by:
shangmingc <caishangming@linux.alibaba.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by:
Yuan Tang <terrytangyuan@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Co-authored-by:
Cyrus Leung <tlleungac@connect.ust.hk> Co-authored-by:
Chen Zhang <zhangch99@outlook.com> Co-authored-by:
wangxiyuan <wangxiyuan1007@gmail.com>
-
- 24 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
- 23 Jan, 2025 2 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 22 Jan, 2025 2 commits
-
-
Konrad Zawora authored
Signed-off-by:Konrad Zawora <kzawora@habana.ai>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 21 Jan, 2025 2 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Jinzhen Lin authored
[Kernel] optimize moe_align_block_size for cuda graph and large num_experts (e.g. DeepSeek-V3) (#12222) Signed-off-by:
Jinzhen Lin <linjinzhen@hotmail.com> Co-authored-by:
Michael Goin <mgoin@redhat.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 19 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 17 Jan, 2025 1 commit
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 16 Jan, 2025 3 commits
-
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Isotr0py authored
Signed-off-by:Isotr0py <2037008807@qq.com>
-
- 15 Jan, 2025 4 commits
-
-
Roger Wang authored
Signed-off-by:Roger Wang <ywang@roblox.com>
-
Joe Runde authored
Signed-off-by:Joe Runde <Joseph.Runde@ibm.com>
-
kewang-xlnx authored
Signed-off-by:
kewang-xlnx <kewang@xilinx.com> Signed-off-by:
kewang2 <kewang2@amd.com> Co-authored-by:
kewang2 <kewang2@amd.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 10 Jan, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Chen Zhang authored
Signed-off-by:Chen Zhang <zhangch99@outlook.com>
-
- 09 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com> Co-authored-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com>
-
- 08 Jan, 2025 4 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Wallas Henrique authored
Signed-off-by:
Wallas Santos <wallashss@ibm.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 06 Jan, 2025 2 commits
-
-
Cyrus Leung authored
Signed-off-by:DarkLight1337 <tlleungac@connect.ust.hk>
-
Cody Yu authored
-
- 03 Jan, 2025 1 commit
-
-
Aurick Qiao authored
Co-authored-by:Aurick Qiao <aurick.qiao@snowflake.com>
-
- 30 Dec, 2024 2 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 29 Dec, 2024 1 commit
-
-
Kuntai Du authored
Signed-off-by:KuntaiDu <kuntai@uchicago.edu>
-
- 28 Dec, 2024 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 27 Dec, 2024 1 commit
-
-
Simon Mo authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
robertgshaw2-neuralmagic <rshaw@neuralmagic.com>
-
- 26 Dec, 2024 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Signed-off-by:
simon-mo <simon.mo@hey.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
HandH1998 <1335248067@qq.com>
-