"...composable_kernel_rocm.git" did not exist on "89e1ebd4d5b1bd21fe4ad58fba37cc9f5e17f4a6"
Move quantization to new backend (#10363)
* Move quantization logic to GGML via new backend This moves the model aware logic to Go code and calls GGMLs quantization code for model creation. * Remove "add model quantizations" This is no longer needed now that quantization is implemented in Go+GGML code directly.
Showing
Please register or sign in to comment