fs/ggml/ggml.go · 19e6796eac59f691c07b1ff9d7fc13b47a53d3c4 · OpenDAS / ollama

Jesse Gross authored Oct 03, 2025

With the new version of GGML in #12245, KV cache quantization
no longer causes a fallback to CPU.

19e6796e

ggml.go 24.2 KB