1. 19 Feb, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix cuda leaks · fc39a6cd
      Daniel Hiltgen authored
      This should resolve the problem where we don't fully unload from the GPU
      when we go idle.
      fc39a6cd
  2. 15 Feb, 2024 1 commit
  3. 14 Feb, 2024 4 commits
  4. 12 Feb, 2024 3 commits
  5. 09 Feb, 2024 1 commit
    • Daniel Hiltgen's avatar
      Shutdown faster · 66807615
      Daniel Hiltgen authored
      Make sure that when a shutdown signal comes, we shutdown quickly instead
      of waiting for a potentially long exchange to wrap up.
      66807615
  6. 08 Feb, 2024 1 commit
  7. 06 Feb, 2024 1 commit
  8. 02 Feb, 2024 1 commit
  9. 01 Feb, 2024 2 commits
  10. 31 Jan, 2024 1 commit
  11. 29 Jan, 2024 1 commit
  12. 25 Jan, 2024 3 commits
  13. 24 Jan, 2024 1 commit
  14. 23 Jan, 2024 2 commits
  15. 22 Jan, 2024 4 commits
  16. 21 Jan, 2024 3 commits
  17. 20 Jan, 2024 3 commits
  18. 19 Jan, 2024 4 commits
  19. 18 Jan, 2024 1 commit
  20. 17 Jan, 2024 1 commit
  21. 16 Jan, 2024 1 commit
    • Daniel Hiltgen's avatar
      Bump llama.cpp to b1842 and add new cuda lib dep · 795674dd
      Daniel Hiltgen authored
      Upstream llama.cpp has added a new dependency with the
      NVIDIA CUDA Driver Libraries (libcuda.so) which is part of the
      driver distribution, not the general cuda libraries, and is not
      available as an archive, so we can not statically link it.  This may
      introduce some additional compatibility challenges which we'll
      need to keep an eye on.
      795674dd