- 11 Aug, 2025 1 commit
-
-
zhuwenwen authored
-
- 16 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 06 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 04 Jul, 2025 1 commit
-
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Co-authored-by:
Duncan Moss <dmoss@nvidia.com>
-
- 27 Jun, 2025 2 commits
-
-
zhuwenwen authored
-
li haoyang authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 21 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 07 Jun, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Jun, 2025 1 commit
-
-
Chiyue Wei authored
Signed-off-by:
Chiyue Wei <chiyuew@nvidia.com> Co-authored-by:
Chiyue Wei <chiyuew@nvidia.com>
-
- 04 Jun, 2025 1 commit
-
-
Vadim Gimpelson authored
-
- 27 May, 2025 1 commit
-
-
almersawi authored
Signed-off-by:
Islam Almersawi <islam.almersawi@openinnovation.ai> Co-authored-by:
Islam Almersawi <islam.almersawi@openinnovation.ai>
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 11 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 09 May, 2025 1 commit
-
-
Pavani Majety authored
-
- 07 May, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Szymon Ożóg authored
Signed-off-by:
SzymonOzog <szymon.ozog@aleph-alpha.com> Signed-off-by:
SzymonOzog <szymon.ozog@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 05 May, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 01 May, 2025 1 commit
-
-
Sage Moore authored
[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867) Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 29 Apr, 2025 1 commit
-
-
TY-AMD authored
Signed-off-by:Tianyuan Wu <Tianyuan.Wu@amd.com>
-
- 27 Apr, 2025 1 commit
-
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
- 11 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 02 Apr, 2025 1 commit
-
-
LukasBluebaum authored
Signed-off-by:lukas.bluebaum <lukas.bluebaum@aleph-alpha.com>
-
- 01 Apr, 2025 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 31 Mar, 2025 2 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
zhuwenwen authored
-
- 27 Mar, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Lucas Wilkinson <wilkinson.lucas@gmail.com>
-
- 14 Mar, 2025 1 commit
-
-
DefTruth authored
-
- 12 Mar, 2025 2 commits
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@aleph-alpha.com>
-
- 07 Mar, 2025 1 commit
-
-
Jinzhen Lin authored
Signed-off-by:Jinzhen Lin <linjinzhen@hotmail.com>
-
- 06 Mar, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 01 Mar, 2025 1 commit
-
-
YajieWang authored
-
- 26 Feb, 2025 1 commit
-
-
zhangshao authored
-
- 22 Feb, 2025 1 commit
-
-
Kaixi Hou authored
-
- 21 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Co-authored-by:
Patrick Horn <patrick.horn@gmail.com> Co-authored-by:
simon-mo <xmo@berkeley.edu> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 15 Feb, 2025 1 commit
-
-
Sage Moore authored
-
- 14 Feb, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 13 Feb, 2025 1 commit
-
-
Kaixi Hou authored
-
- 05 Feb, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:
simon-mo <xmo@berkeley.edu> Signed-off-by:
Lucas Wilkinson <lcwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkins@redhat.com> Signed-off-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com> Co-authored-by:
simon-mo <xmo@berkeley.edu>
-
- 31 Jan, 2025 1 commit
-
-
Tyler Michael Smith authored
Integrates the block-quantized kernels introduced in https://github.com/vllm-project/vllm/pull/11868 for use in linear layers. Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-