1. 30 Aug, 2024 2 commits
  2. 29 Aug, 2024 2 commits
  3. 28 Aug, 2024 8 commits
  4. 27 Aug, 2024 12 commits
  5. 25 Aug, 2024 1 commit
  6. 23 Aug, 2024 7 commits
  7. 22 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix embeddings memory corruption (#6467) · 90ca8417
      Daniel Hiltgen authored
      * Fix embeddings memory corruption
      
      The patch was leading to a buffer overrun corruption.  Once removed though, parallism
      in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
      work around this, only use slot 0 for embeddings.
      
      * Fix embed integration test assumption
      
      The token eval count has changed with recent llama.cpp bumps (0.3.5+)
      90ca8417
  8. 21 Aug, 2024 7 commits