• Daniël de Kok's avatar
    GPTQ CI improvements (#2151) · 67ef0649
    Daniël de Kok authored
    * Add more representative Llama GPTQ test
    
    The Llama GPTQ test is updated to use a model with the commonly-used
    quantizer config format and activation sorting. The old test is
    kept around (but renamed) since it tests the format produced by
    `text-generation-server quantize`.
    
    * Add support for manually triggering a release build
    67ef0649
ci_build.yaml 1.03 KB