1. 03 Sep, 2024 2 commits
    • Daniel Hiltgen's avatar
      Log system memory at info (#6617) · 037a4d10
      Daniel Hiltgen authored
      On systems with low system memory, we can hit allocation failures that are difficult to diagnose
      without debug logs.  This will make it easier to spot.
      037a4d10
    • FellowTraveler's avatar
      Fix sprintf to snprintf (#5664) · 94fff580
      FellowTraveler authored
      /Users/au/src/ollama/llm/ext_server/server.cpp:289:9: warning: 'sprintf' is deprecated: This function is provided for compatibility reasons only. Due to security concerns inherent in the design of sprintf(3), it is highly recommended that you use snprintf(3) instead.
      94fff580
  2. 29 Aug, 2024 1 commit
  3. 27 Aug, 2024 1 commit
  4. 25 Aug, 2024 1 commit
  5. 23 Aug, 2024 2 commits
  6. 22 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix embeddings memory corruption (#6467) · 90ca8417
      Daniel Hiltgen authored
      * Fix embeddings memory corruption
      
      The patch was leading to a buffer overrun corruption.  Once removed though, parallism
      in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
      work around this, only use slot 0 for embeddings.
      
      * Fix embed integration test assumption
      
      The token eval count has changed with recent llama.cpp bumps (0.3.5+)
      90ca8417
  7. 21 Aug, 2024 1 commit
  8. 20 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Split rocm back out of bundle (#6432) · a017cf2f
      Daniel Hiltgen authored
      We're over budget for github's maximum release artifact size with rocm + 2 cuda
      versions.  This splits rocm back out as a discrete artifact, but keeps the layout so it can
      be extracted into the same location as the main bundle.
      a017cf2f
  9. 19 Aug, 2024 6 commits
  10. 12 Aug, 2024 1 commit
  11. 11 Aug, 2024 2 commits
  12. 07 Aug, 2024 1 commit
  13. 06 Aug, 2024 1 commit
  14. 05 Aug, 2024 4 commits
  15. 02 Aug, 2024 1 commit
  16. 31 Jul, 2024 5 commits
  17. 30 Jul, 2024 1 commit
  18. 29 Jul, 2024 1 commit
  19. 27 Jul, 2024 1 commit
  20. 26 Jul, 2024 1 commit
  21. 25 Jul, 2024 1 commit
  22. 24 Jul, 2024 1 commit
  23. 22 Jul, 2024 3 commits
    • Daniel Hiltgen's avatar
      Enable windows error dialog for subprocess startup · e12fff88
      Daniel Hiltgen authored
      Make sure if something goes wrong spawning the process, the user gets
      enough info to be able to try to self correct, or at least file a bug
      with details so we can fix it.  Once the process starts, we immediately
      change back to the recommended setting to prevent the blocking dialog.
      This ensures if the model fails to load (OOM, unsupported model type,
      etc.) the process will exit quickly and we can scan the stdout/stderr
      of the subprocess for the reason to report via API.
      e12fff88
    • Michael Yang's avatar
      string · e2c3f6b3
      Michael Yang authored
      e2c3f6b3
    • Michael Yang's avatar
      bool · 55cd3ddc
      Michael Yang authored
      55cd3ddc