Merge pull request #3682 from ollama/mxyng/quantize-all-the-things
quantize any fp16/fp32 model
Showing
llm/filetype.go
0 → 100644
server/model.go
0 → 100644
Please register or sign in to comment
quantize any fp16/fp32 model