- 31 Jan, 2025 2 commits
-
-
Tyler Michael Smith authored
Integrates the block-quantized kernels introduced in https://github.com/vllm-project/vllm/pull/11868 for use in linear layers. Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Robert Shaw authored
Co-authored-by:simon-mo <xmo@berkeley.edu>
-
- 23 Jan, 2025 1 commit
-
-
Dipika Sikka authored
[BugFix] Fix parameter names and `process_after_weight_loading` for W4A16 MoE Group Act Order (#11528) Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
ElizaWszola <eliza@neuralmagic.com> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 17 Jan, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
-
- 16 Jan, 2025 1 commit
-
-
Elfie Guo authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
Michael Goin <mgoin@redhat.com> Co-authored-by:
mgoin <michael@neuralmagic.com>
-
- 09 Jan, 2025 1 commit
-
-
Cyrus Leung authored
Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com> Co-authored-by:
Maxime Fournioux <55544262+mfournioux@users.noreply.github.com>
-
- 30 Dec, 2024 1 commit
-
-
Li, Jiang authored
Signed-off-by:jiang1.li <jiang1.li@intel.com>
-
- 27 Dec, 2024 2 commits
-
-
Robert Shaw authored
-
Simon Mo authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
mgoin <michael@neuralmagic.com> Co-authored-by:
robertgshaw2-neuralmagic <rshaw@neuralmagic.com>
-
- 26 Dec, 2024 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <michael@neuralmagic.com> Signed-off-by:
simon-mo <simon.mo@hey.com> Signed-off-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
simon-mo <simon.mo@hey.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
HandH1998 <1335248067@qq.com>
-
- 06 Nov, 2024 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <michael@neuralmagic.com>
-
- 28 Oct, 2024 1 commit
-
-
wangshuai09 authored
Signed-off-by:wangshuai09 <391746016@qq.com>
-
- 18 Sep, 2024 1 commit
-
-
Cyrus Leung authored
-
- 30 Aug, 2024 1 commit
-
-
Wenxiang authored
Co-authored-by:
Your Name <you@example.com> Co-authored-by:
Zeqi Lin <zelin@microsoft.com> Co-authored-by:
Zeqi Lin <Zeqi.Lin@microsoft.com>
-
- 27 Aug, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:ElizaWszola <eliza@neuralmagic.com>
-
- 22 Aug, 2024 2 commits
-
-
Dipika Sikka authored
-
Michael Goin authored
-
- 21 Aug, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:ElizaWszola <eliza@neuralmagic.com>
-
- 16 Aug, 2024 2 commits
-
-
Mor Zusman authored
-
Charlie Fu authored
-
- 13 Aug, 2024 1 commit
-
-
Dipika Sikka authored
-
- 07 Aug, 2024 1 commit
-
-
Michael Goin authored
-
- 29 Jul, 2024 1 commit
-
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
- 25 Jul, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:mgoin <michael@neuralmagic.com>
-
- 23 Jul, 2024 2 commits
-
-
Michael Goin authored
-
Michael Goin authored
-
- 20 Jul, 2024 1 commit
-
-
Robert Shaw authored
-
- 19 Jul, 2024 1 commit
-
-
Robert Shaw authored
-
- 16 Jul, 2024 1 commit
-
-
Michael Goin authored
-
- 14 Jul, 2024 1 commit
-
-
Robert Shaw authored
-
- 11 Jul, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic.com>
-
- 07 Jul, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
- 03 Jul, 2024 2 commits
-
-
Michael Goin authored
-
youkaichao authored
-
- 02 Jul, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:
Robert Shaw <rshaw@neuralmagic> Co-authored-by:
Michael Goin <michael@neuralmagic.com>
-
- 01 Jul, 2024 1 commit
-
-
youkaichao authored
-
- 30 Jun, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
- 28 Jun, 2024 1 commit
-
-
Robert Shaw authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic>
-
- 20 Jun, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 14 Jun, 2024 1 commit
-
-
Tyler Michael Smith authored
-