"TensorFlow2x/vscode:/vscode.git/clone" did not exist on "441c8f408cf451f41e784171335c0e59d5972d8e"
Use symmetric quantization in the `quantize` subcommand (#2120)
Packing of asymmetric quantization is broken, all (q)zeros values of `0` get reset to `1`, resulting in a loss of accuracy. So instead use symmetric quantization. To be able to distinguish models with symmetric and asymmetric quantization, a new config tensor `gptq_sym` is added. If this tensor is not present, we assume `sym=False`.
Showing
Please register or sign in to comment