Merge pull request #1721 from Mhmd-Hisham/quantization-packing-bug-fix
[CUDA] Fixing quantization uint8 packing bug for NF4 and FP4
Showing
Please register or sign in to comment
[CUDA] Fixing quantization uint8 packing bug for NF4 and FP4