• Jerry Zhang's avatar
    Add support for quantization_config (#2842) · 758c5ed8
    Jerry Zhang authored
    * Add support for quantization_config
    
    Summary:
    Previously quantization_config is ignored, so torchao quantized models are not supported,
    this PR adds that.
    
    Test Plan:
    lm_eval --model hf --model_args pretrained=jerryzh168/gemma3-int4wo --tasks hellaswag --device cuda:0 --batch_size 8
    
    Reviewers:
    
    Subscribers:
    
    Tasks:
    
    Tags:
    
    * quantization_config is optional
    758c5ed8
huggingface.py 60.8 KB