• Daniël de Kok's avatar
    Support AWQ quantization with bias (#2117) · 14980df2
    Daniël de Kok authored
    When the AWQ quantizer was used with a layer that uses a bias,
    the bias tensor was not correctly passed/used. Instead, the
    value `true`/`1.0` was added to the linear transformation.
    
    Correctly pass through the bias when it is not `None`.
    
    Fixes #2106.
    14980df2
linear.py 8.1 KB