- 25 Mar, 2025 1 commit
-
-
Lu Fang authored
Fix CUDA kernel index data type in vllm/csrc/quantization/gptq_marlin/awq_marlin_repack.cu +10 (#15160) Signed-off-by:
Lu Fang <lufang@fb.com> Co-authored-by:
Richard Barnes <rbarnes@meta.com>
-
- 28 Jan, 2025 1 commit
-
-
Harry Mellor authored
Signed-off-by:Harry Mellor <19981378+hmellor@users.noreply.github.com>
-
- 04 Oct, 2024 1 commit
-
-
Lucas Wilkinson authored
-
- 31 Jul, 2024 1 commit
-
-
HandH1998 authored
-
- 30 Jul, 2024 1 commit
-
-
Tyler Michael Smith authored
-
- 21 Jul, 2024 1 commit
-
-
Alexander Matveev authored
-
- 09 Jun, 2024 1 commit
-
-
bnellnm authored
-
- 22 May, 2024 1 commit
-
-
Michael Goin authored
-
- 16 May, 2024 1 commit
-
-
Alexander Matveev authored
Co-authored-by:Robert Shaw <rshaw@neuralmagic.com>
-