"src/diffusers/hooks/context_parallel.py" did not exist on "043ab2520f6a19fce78e6e060a68dbc947edb9f9"
Move quantization to new backend (#10363)
* Move quantization logic to GGML via new backend This moves the model aware logic to Go code and calls GGMLs quantization code for model creation. * Remove "add model quantizations" This is no longer needed now that quantization is implemented in Go+GGML code directly.
Showing
Please register or sign in to comment