"...text-generation-inference.git" did not exist on "1b1bfa49b04448ab1afac2d9bf790fff7bd1871b"
Merge pull request #1721 from Mhmd-Hisham/quantization-packing-bug-fix
[CUDA] Fixing quantization uint8 packing bug for NF4 and FP4
Showing
Please register or sign in to comment