1. 27 Aug, 2024 1 commit
  2. 25 Aug, 2024 1 commit
  3. 23 Aug, 2024 2 commits
  4. 22 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix embeddings memory corruption (#6467) · 90ca8417
      Daniel Hiltgen authored
      * Fix embeddings memory corruption
      
      The patch was leading to a buffer overrun corruption.  Once removed though, parallism
      in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
      work around this, only use slot 0 for embeddings.
      
      * Fix embed integration test assumption
      
      The token eval count has changed with recent llama.cpp bumps (0.3.5+)
      90ca8417
  5. 21 Aug, 2024 1 commit
  6. 20 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Split rocm back out of bundle (#6432) · a017cf2f
      Daniel Hiltgen authored
      We're over budget for github's maximum release artifact size with rocm + 2 cuda
      versions.  This splits rocm back out as a discrete artifact, but keeps the layout so it can
      be extracted into the same location as the main bundle.
      a017cf2f
  7. 19 Aug, 2024 6 commits
  8. 12 Aug, 2024 1 commit
  9. 11 Aug, 2024 2 commits
  10. 07 Aug, 2024 1 commit
  11. 06 Aug, 2024 1 commit
  12. 05 Aug, 2024 4 commits
  13. 02 Aug, 2024 1 commit
  14. 31 Jul, 2024 5 commits
  15. 30 Jul, 2024 1 commit
  16. 29 Jul, 2024 1 commit
  17. 27 Jul, 2024 1 commit
  18. 26 Jul, 2024 1 commit
  19. 25 Jul, 2024 1 commit
  20. 24 Jul, 2024 1 commit
  21. 22 Jul, 2024 6 commits