• Michael Yang's avatar
    model: load non-repeated tensors into multiple backends · bfce55db
    Michael Yang authored
    some tensors are expected to be used in repeating layers but are not
    themselves repeated. this change copies these tensors into the same
    backends as their repeating counterparts to minimize copying tensors
    between backends
    bfce55db
ggml.go 20.2 KB