• Hailey Schoelkopf's avatar
    Refactor `hf` modeling code (#1096) · e0eda4d3
    Hailey Schoelkopf authored
    * modularize HFLM code
    
    * pass through extra kwargs to AutoModel.from_pretrained call
    
    * remove explicit model_kwargs
    
    * rename gptq -> autogptq
    
    * fix tokenizer pad token errors
    
    * ensure model always respects device_map and autogptq's selected devices
    
    * add a _get_config helper fn
    e0eda4d3
huggingface.py 47.4 KB