Fix gguf loading via Transformers (#2596)
* hf support load gguf file
* code review
* code review
* code clean up
* note about use_fast compat with gguf
---------
Co-authored-by:
Qubitium-ModelCloud <qubitium@modelcloud.ai>
Showing
Please register or sign in to comment