1. 10 May, 2024 2 commits
  2. 09 May, 2024 5 commits
  3. 08 May, 2024 1 commit
  4. 07 May, 2024 1 commit
  5. 06 May, 2024 3 commits
  6. 05 May, 2024 1 commit
    • Daniel Hiltgen's avatar
      Centralize server config handling · f56aa200
      Daniel Hiltgen authored
      This moves all the env var reading into one central module
      and logs the loaded config once at startup which should
      help in troubleshooting user server logs
      f56aa200
  7. 01 May, 2024 4 commits
  8. 29 Apr, 2024 1 commit
  9. 26 Apr, 2024 1 commit
  10. 25 Apr, 2024 1 commit
  11. 23 Apr, 2024 4 commits
  12. 17 Apr, 2024 3 commits
  13. 16 Apr, 2024 2 commits
  14. 15 Apr, 2024 1 commit
  15. 10 Apr, 2024 2 commits
  16. 09 Apr, 2024 1 commit
  17. 06 Apr, 2024 1 commit
  18. 03 Apr, 2024 1 commit
  19. 02 Apr, 2024 2 commits
  20. 01 Apr, 2024 1 commit
    • Daniel Hiltgen's avatar
      Switch back to subprocessing for llama.cpp · 58d95cc9
      Daniel Hiltgen authored
      This should resolve a number of memory leak and stability defects by allowing
      us to isolate llama.cpp in a separate process and shutdown when idle, and
      gracefully restart if it has problems.  This also serves as a first step to be
      able to run multiple copies to support multiple models concurrently.
      58d95cc9