ml/backend.go · d773b7d67161cba40a342e74d66b7363dfdd38d2 · OpenDAS / ollama

backend: API to support full precision matmul · d773b7d6

Jesse Gross authored Feb 13, 2025

Most tensor backends try to optimize performance by using a lower
precision for matmuls. However, some operations (such as kq) on
some models are sensitive to this and require full precision.

d773b7d6

backend.go 4.31 KB

Replace backend.go