• Jesse Gross's avatar
    ggml: Support closing backends · 756c78cf
    Jesse Gross authored
    In order to iteratively find the best memory allocation, we need to
    be able to free backend memory so we can try again.
    756c78cf
cache.go 7.59 KB