[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by...
[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by adapting gptq_gemm (#26092)
Showing
This diff is collapsed.
Please register or sign in to comment