- 31 Jan, 2025 3 commits
-
-
Tyler Michael Smith authored
Integrates the block-quantized kernels introduced in https://github.com/vllm-project/vllm/pull/11868 for use in linear layers. Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Robert Shaw authored
SUMMARY: * previous PR for pulling in block configs also changed defaults (https://github.com/vllm-project/vllm/pull/11589/files ) for FP8 * this broke L4 MoE since there was not enough SHM for the default configuration * this reverts the non-block example to the default Signed-off-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com>
-
Robert Shaw authored
Co-authored-by:simon-mo <xmo@berkeley.edu>
-
- 30 Jan, 2025 1 commit
-
-
Robert Shaw authored
Signed-off-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com> Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
- 29 Jan, 2025 1 commit
-
-
Jinzhen Lin authored
-
- 28 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 26 Jan, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
- 25 Jan, 2025 1 commit
-
-
Divakar Verma authored
Signed-off-by:Divakar Verma <divakar.verma@amd.com>
-
- 23 Jan, 2025 3 commits
-
-
Dipika Sikka authored
[BugFix] Fix parameter names and `process_after_weight_loading` for W4A16 MoE Group Act Order (#11528) Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
- 21 Jan, 2025 2 commits
-
-
Nicolò Lucchesi authored
Signed-off-by:NickLucche <nlucches@redhat.com>
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 19 Jan, 2025 3 commits
-
-
Martin Gleize authored
Signed-off-by:
Martin Gleize <mgleize@meta.com> Co-authored-by:
mgleize user <mgleize@a100-st-p4de24xlarge-4.fair-a100.hpcaas>
-
Roger Wang authored
Signed-off-by:
Roger Wang <ywang@roblox.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Co-authored-by:
imkero <kerorek@outlook.com> Co-authored-by:
DarkLight1337 <tlleungac@connect.ust.hk>
-
yancong authored
-
- 17 Jan, 2025 2 commits
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 16 Jan, 2025 3 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
Elfie Guo authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
Michael Goin <mgoin@redhat.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
- 15 Jan, 2025 3 commits
-
-
kewang-xlnx authored
Signed-off-by:
kewang-xlnx <kewang@xilinx.com> Signed-off-by:
kewang2 <kewang2@amd.com> Co-authored-by:
kewang2 <kewang2@amd.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
Rahul Tuli authored
Signed-off-by:Rahul Tuli <rahul@neuralmagic.com>
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 13 Jan, 2025 2 commits
-
-
Steve Luo authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
Isotr0py authored
Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Cyrus Leung <cyrus.tl.leung@gmail.com>
-
- 12 Jan, 2025 1 commit
-
-
Avshalom Manevich authored
-
- 11 Jan, 2025 1 commit
-
-
shaochangxu authored
Signed-off-by:
shaochangxu.scx <shaochangxu.scx@antgroup.com> Co-authored-by:
shaochangxu.scx <shaochangxu.scx@antgroup.com>
-
- 10 Jan, 2025 3 commits
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
wangxiyuan authored
Signed-off-by:
wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
youkaichao <youkaichao@gmail.com>
-
cennn authored
Co-authored-by:youkaichao <youkaichao@gmail.com>
-
- 09 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com> Co-authored-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com>
-
- 08 Jan, 2025 4 commits
-
-
rasmith authored
Signed-off-by:Randall Smith <Randall.Smith@amd.com>
-
Robert Shaw authored
Co-authored-by:Woosuk Kwon <woosuk.kwon@berkeley.edu>
-
Yan Ma authored
Signed-off-by:yan ma <yan.ma@intel.com>
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 07 Jan, 2025 1 commit
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
- 06 Jan, 2025 1 commit
-
-
Lucas Tucker authored
Signed-off-by:
lucast2021 <lucast2021@headroyce.org> Co-authored-by:
lucast2021 <lucast2021@headroyce.org>
-
- 04 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 03 Jan, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-