quantize any fp16/fp32 model
- FROM /path/to/{safetensors,pytorch}
- FROM /path/to/fp{16,32}.bin
- FROM model:fp{16,32}
Showing
llm/filetype.go
0 → 100644
server/model.go
0 → 100644
types/ordered/map.go
0 → 100644
Please register or sign in to comment