"vllm/engine/arg_utils.py" did not exist on "1a956e136beae057746af6257ffa8da601730f10"
- 23 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 10 Sep, 2025 1 commit
-
-
zhuwenwen authored
-
- 16 Jul, 2025 1 commit
-
-
zhuwenwen authored
-
- 06 Jul, 2025 1 commit
-
-
Lucas Wilkinson authored
Signed-off-by:Lucas Wilkinson <lwilkins@redhat.com>
-
- 04 Jul, 2025 1 commit
-
-
Duncan Moss authored
Signed-off-by:
Duncan Moss <djm.moss@gmail.com> Co-authored-by:
Duncan Moss <dmoss@nvidia.com>
-
- 27 Jun, 2025 1 commit
-
-
li haoyang authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Signed-off-by:
Haoyang Li <Haoyang.Li@amd.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 21 Jun, 2025 1 commit
-
-
zhuwenwen authored
-
- 07 Jun, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <ewszola@redhat.com> Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 05 Jun, 2025 1 commit
-
-
Chiyue Wei authored
Signed-off-by:
Chiyue Wei <chiyuew@nvidia.com> Co-authored-by:
Chiyue Wei <chiyuew@nvidia.com>
-
- 04 Jun, 2025 1 commit
-
-
Vadim Gimpelson authored
-
- 13 May, 2025 1 commit
-
-
Tao He authored
Implements dual-chunk-flash-attn backend for dual chunk attention with sparse attention support (#11844)
-
- 09 May, 2025 1 commit
-
-
Pavani Majety authored
-
- 07 May, 2025 2 commits
-
-
Yong Hoon Shin authored
Signed-off-by:Yong Hoon Shin <yhshin@meta.com>
-
Szymon Ożóg authored
Signed-off-by:
SzymonOzog <szymon.ozog@aleph-alpha.com> Signed-off-by:
SzymonOzog <szymon.ozog@gmail.com> Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Isotr0py <2037008807@qq.com>
-
- 01 May, 2025 1 commit
-
-
Sage Moore authored
[torch.compile] Add torch inductor pass for fusing silu_and_mul with subsequent scaled_fp8_quant operations (#10867) Signed-off-by:Sage Moore <sage@neuralmagic.com>
-
- 27 Apr, 2025 1 commit
-
-
Kaixi Hou authored
Signed-off-by:kaixih <kaixih@nvidia.com>
-
- 11 Apr, 2025 1 commit
-
-
DefTruth authored
Signed-off-by:DefTruth <qiustudent_r@163.com>
-
- 02 Apr, 2025 1 commit
-
-
LukasBluebaum authored
Signed-off-by:lukas.bluebaum <lukas.bluebaum@aleph-alpha.com>
-
- 01 Apr, 2025 1 commit
-
-
Ilya Markov authored
Signed-off-by:
ilmarkov <imarkov@redhat.com> Co-authored-by:
ilmarkov <imarkov@redhat.com>
-
- 31 Mar, 2025 2 commits
-
-
youkaichao authored
Signed-off-by:youkaichao <youkaichao@gmail.com>
-
zhuwenwen authored
-
- 27 Mar, 2025 1 commit
-
-
ElizaWszola authored
Signed-off-by:
ElizaWszola <eliza@neuralmagic.com> Signed-off-by:
ElizaWszola <ewszola@redhat.com> Co-authored-by:
Lucas Wilkinson <wilkinson.lucas@gmail.com>
-
- 12 Mar, 2025 2 commits
-
-
Pavani Majety authored
Signed-off-by:Pavani Majety <pmajety@nvidia.com>
-
Szymon Ożóg authored
Signed-off-by:SzymonOzog <szymon.ozog@aleph-alpha.com>
-
- 26 Feb, 2025 1 commit
-
-
zhangshao authored
-
- 22 Feb, 2025 1 commit
-
-
Kaixi Hou authored
-
- 15 Feb, 2025 1 commit
-
-
Sage Moore authored
-
- 14 Feb, 2025 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 13 Feb, 2025 1 commit
-
-
Kaixi Hou authored
-
- 31 Jan, 2025 1 commit
-
-
Tyler Michael Smith authored
Integrates the block-quantized kernels introduced in https://github.com/vllm-project/vllm/pull/11868 for use in linear layers. Signed-off-by:
Tyler Michael Smith <tyler@neuralmagic.com>
-
- 23 Jan, 2025 1 commit
-
-
Gregory Shtrasberg authored
Signed-off-by:
Gregory Shtrasberg <Gregory.Shtrasberg@amd.com> Co-authored-by:
Micah Williamson <micah.williamson@amd.com>
-
- 15 Jan, 2025 1 commit
-
-
Jee Jee Li authored
Signed-off-by:Jee Jee Li <pandaleefree@gmail.com>
-
- 05 Jan, 2025 1 commit
-
-
Lu Fang authored
Signed-off-by:Lu Fang <lufang@fb.com>
-
- 19 Dec, 2024 1 commit
-
-
Tyler Michael Smith authored
Signed-off-by:Tyler Michael Smith <tyler@neuralmagic.com>
-
- 18 Dec, 2024 1 commit
-
-
Dipika Sikka authored
Co-authored-by:
Faraz Shahsavan <faraz.shahsavan@gmail.com> Co-authored-by:
ilmarkov <markovilya197@gmail.com> Co-authored-by:
Rahul Tuli <rahul@neuralmagic.com> Co-authored-by:
rshaw@neuralmagic.com <rshaw@neuralmagic.com>
-
- 13 Dec, 2024 1 commit
-
-
Luka Govedič authored
Signed-off-by:
luka <luka@neuralmagic.com> Co-authored-by:
Varun Sundar Rabindranath <varun@neuralmagic.com>
-
- 11 Dec, 2024 1 commit
-
-
王敏 authored
-
- 23 Nov, 2024 1 commit
-
-
kliuae authored
-
- 08 Nov, 2024 1 commit
-
-
Luka Govedič authored
Signed-off-by:
luka <luka@neuralmagic.com> Co-authored-by:
youkaichao <youkaichao@126.com>
-
- 07 Nov, 2024 1 commit
-
-
Hanzhi Zhou authored
Signed-off-by:Hanzhi Zhou <hanzhi713@gmail.com>
-