• Jesse Gross's avatar
    llm: Separate llamaServer and ollamaServer code paths · b13fbad0
    Jesse Gross authored
    Originally, llamaServer represented old memory estimates, which
    could be used with either the old or new engine. ollamaServer was
    used only for the new estimates and new engine. Since these
    implementations did not map directly to engine, there was engine-
    specific code in common code paths.
    
    Now that new estimates are always used for the new engine, there is
    a direct mapping between server type and engine. This separates out
    most of the engine-specific code into the correct implementation
    to make things easier to understand.
    b13fbad0
server.go 52.8 KB