gpu/amd_linux.go · 58d95cc9bd446a8209e7388a96c70367cbafd653 · OpenDAS / ollama

Switch back to subprocessing for llama.cpp · 58d95cc9

Daniel Hiltgen authored Mar 14, 2024

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems. This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

58d95cc9

amd_linux.go 13.5 KB

Replace amd_linux.go