llm/server.go · b13fbad0fe663e1493e8ed4b04ed553f861abba9 · OpenDAS / ollama

llm: Separate llamaServer and ollamaServer code paths · b13fbad0

Jesse Gross authored Nov 06, 2025

Originally, llamaServer represented old memory estimates, which
could be used with either the old or new engine. ollamaServer was
used only for the new estimates and new engine. Since these
implementations did not map directly to engine, there was engine-
specific code in common code paths.

Now that new estimates are always used for the new engine, there is
a direct mapping between server type and engine. This separates out
most of the engine-specific code into the correct implementation
to make things easier to understand.

b13fbad0

server.go 52.8 KB

Replace server.go