- 18 Jan, 2026 1 commit
-
-
Wentao Ye authored
Signed-off-by:yewentao256 <zhyanwentao@126.com>
-
- 09 Jan, 2026 1 commit
-
-
Wentao Ye authored
[Perf] Optimize cutlass moe problem size calculation, 5.3% E2E Throughput improvement, 2.2% TTFT improvement (#31830) Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 09 Dec, 2025 1 commit
-
-
czhu-cohere authored
Signed-off-by:czhu-cohere <conway.zhu@cohere.com>
-
- 25 Nov, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:
mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com>
-
- 08 Oct, 2025 1 commit
-
-
Wentao Ye authored
Signed-off-by:
nicole-lihui <nicole.li@daocloud.io> Signed-off-by:
yewentao256 <zhyanwentao@126.com> Signed-off-by:
courage17340 <courage17340@163.com> Signed-off-by:
DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by:
Jacob Kahn <jacobkahn1@gmail.com> Signed-off-by:
Tyler Michael Smith <tlrmchlsmth@gmail.com> Signed-off-by:
Fadi Arafeh <fadi.arafeh@arm.com> Signed-off-by:
Roger Wang <hey@rogerw.io> Signed-off-by:
Agata Dobrzyniewicz <adobrzyniewicz@habana.ai> Signed-off-by:
Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by:
zxw <1020938856@qq.com> Signed-off-by:
Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by:
wang.yuqi <noooop@126.com> Signed-off-by:
Cyrus Leung <cyrus.tl.leung@gmail.com> Signed-off-by:
Kunshang Ji <kunshang.ji@intel.com> Signed-off-by:
chenlang <chen.lang5@zte.com.cn> Signed-off-by:
youkaichao <youkaichao@gmail.com> Signed-off-by:
Jonas Kuebler <kuebj@amazon.com> Signed-off-by: jiang1.li <jiang1...
-
- 03 Oct, 2025 1 commit
-
-
Jun Jiang authored
Signed-off-by:
Jun Jiang <jasl9187@hotmail.com> Co-authored-by:
Luka Govedič <ProExpertProg@users.noreply.github.com>
-
- 01 Oct, 2025 1 commit
-
-
Johnny authored
Signed-off-by:
Johnny <johnnynuca14@gmail.com> Signed-off-by:
johnnynunez <johnnynuca14@gmail.com> Signed-off-by:
Johnny <johnnync13@gmail.com> Signed-off-by:
Salvatore Cena <cena@cenas.it> Co-authored-by:
Aidyn-A <31858918+Aidyn-A@users.noreply.github.com> Co-authored-by:
Salvatore Cena <cena@cenas.it>
-
- 20 Aug, 2025 1 commit
-
-
shixianc authored
Signed-off-by:Shixian Cui <shixian@amazon.com>
-
- 22 Jul, 2025 1 commit
-
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Signed-off-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
jiahanc <173873397+jiahanc@users.noreply.github.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 02 Jul, 2025 1 commit
-
-
Joonchen Liau authored
Signed-off-by:
kaln27 <liaojuncheng123@foxmail.com> Co-authored-by:
mgoin <mgoin64@gmail.com>
-
- 27 Jun, 2025 1 commit
-
-
Michael Goin authored
Signed-off-by:mgoin <mgoin64@gmail.com>
-
- 07 Jun, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Jun, 2025 1 commit
-
-
Chiyue Wei authored
Signed-off-by:
Chiyue Wei <chiyuew@nvidia.com> Co-authored-by:
Chiyue Wei <chiyuew@nvidia.com>
-
- 22 May, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Tyler Michael Smith <tysmith@redhat.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 13 May, 2025 1 commit
-
-
Driss Guessous authored
Signed-off-by:drisspg <drisspguessous@gmail.com>
-
- 09 May, 2025 1 commit
-
-
Pavani Majety authored
-
- 08 May, 2025 1 commit
-
-
Shu Wang authored
Signed-off-by:Shu Wang <shuw@nvidia.com>
-
- 27 Mar, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Lucas Wilkinson <wilkinson.lucas@gmail.com>
-
- 08 Mar, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 04 Mar, 2025 1 commit
-
-
kushanam authored
-
- 31 Jan, 2025 2 commits
-
-
Tyler Michael Smith authored
Integrates the block-quantized kernels introduced in https://github.com/vllm-project/vllm/pull/11868 for use in linear layers. Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
Lucas Wilkinson authored
-
- 05 Jan, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
- 18 Dec, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Faraz Shahsavan <faraz.shahsavan@gmail.com> Co-authored-by:
ilmarkov <markovilya197@gmail.com> Co-authored-by:
Rahul Tuli <rahul@neuralmagic.com> Co-authored-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com>
-
- 22 Oct, 2024 1 commit
-
-
Lucas Wilkinson authored
-
- 04 Oct, 2024 1 commit
-
-
Lucas Wilkinson authored
-
- 06 Aug, 2024 1 commit
-
-
Luka Govedič authored
Co-authored-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 31 Jul, 2024 1 commit
-
-
Varun Sundar Rabindranath authored
Co-authored-by:Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 14 Jul, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 26 Jun, 2024 1 commit
-
-
Luka Govedič authored
Co-authored-by:
Chih-Chieh-Yang <7364402+cyang49@users.noreply.github.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
-
- 20 Jun, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 13 Jun, 2024 1 commit
-
-
Tyler Michael Smith authored
Co-authored-by:
Michael Goin <michael@neuralmagic.com> Co-authored-by:
youkaichao <youkaichao@gmail.com> Co-authored-by:
zifeitong <zifei.tong@parasail.io> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-neuralmagic@users.noreply.github.com>
-
- 09 Jun, 2024 1 commit
-
-
bnellnm authored
-
- 01 Jun, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 22 May, 2024 1 commit
-
-
Michael Goin authored
-
- 16 May, 2024 1 commit
-
-
Tyler Michael Smith authored
-