• Jesse Gross's avatar
    backend: API to support full precision matmul · d773b7d6
    Jesse Gross authored
    Most tensor backends try to optimize performance by using a lower
    precision for matmuls. However, some operations (such as kq) on
    some models are sensitive to this and require full precision.
    d773b7d6
backend.go 4.31 KB