"sgl-kernel/include/sgl_kernel_ops.h" did not exist on "ac2dc35d0e529a278450bceb4d234aae3a1c93d8"
Refactor `hf` modeling code (#1096)
* modularize HFLM code * pass through extra kwargs to AutoModel.from_pretrained call * remove explicit model_kwargs * rename gptq -> autogptq * fix tokenizer pad token errors * ensure model always respects device_map and autogptq's selected devices * add a _get_config helper fn
Showing
Please register or sign in to comment