1. 20 Nov, 2024 1 commit
    • Daniel Hiltgen's avatar
      Improve crash reporting (#7728) · 909a88c5
      Daniel Hiltgen authored
      Many model crashes are masked behind "An existing connection was forcibly closed by the remote host"
      This captures that common error message and wires in any detected errors from the log.
      
      This also adds the deepseek context shift error to the known errors we capture.
      909a88c5
  2. 05 Aug, 2024 1 commit
  3. 04 Jul, 2024 1 commit
  4. 01 Jul, 2024 1 commit
  5. 01 Apr, 2024 1 commit
    • Daniel Hiltgen's avatar
      Switch back to subprocessing for llama.cpp · 58d95cc9
      Daniel Hiltgen authored
      This should resolve a number of memory leak and stability defects by allowing
      us to isolate llama.cpp in a separate process and shutdown when idle, and
      gracefully restart if it has problems.  This also serves as a first step to be
      able to run multiple copies to support multiple models concurrently.
      58d95cc9