"vscode:/vscode.git/clone" did not exist on "06301c0a0bbe554b10f2418d6d0482eaf37ba475"
  1. 22 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix embeddings memory corruption (#6467) · 90ca8417
      Daniel Hiltgen authored
      * Fix embeddings memory corruption
      
      The patch was leading to a buffer overrun corruption.  Once removed though, parallism
      in server.cpp lead to hitting an assert due to slot/seq IDs being >= token count.  To
      work around this, only use slot 0 for embeddings.
      
      * Fix embed integration test assumption
      
      The token eval count has changed with recent llama.cpp bumps (0.3.5+)
      90ca8417
  2. 21 Aug, 2024 8 commits
  3. 20 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Split rocm back out of bundle (#6432) · a017cf2f
      Daniel Hiltgen authored
      We're over budget for github's maximum release artifact size with rocm + 2 cuda
      versions.  This splits rocm back out as a discrete artifact, but keeps the layout so it can
      be extracted into the same location as the main bundle.
      a017cf2f
  4. 19 Aug, 2024 17 commits
  5. 18 Aug, 2024 2 commits
  6. 17 Aug, 2024 1 commit
  7. 16 Aug, 2024 1 commit
  8. 15 Aug, 2024 4 commits
  9. 14 Aug, 2024 4 commits
  10. 13 Aug, 2024 1 commit
    • Blake Mizerany's avatar
      server: reduce max connections used in download (#6347) · 8e1050f3
      Blake Mizerany authored
      The previous value of 64 was WAY too high and unnecessary. It reached
      diminishing returns and blew past it. This is a more reasonable number
      for _most_ normal cases. For users on cloud servers with excellent
      network quality, this will keep screaming for them, without hitting our
      CDN limits. For users with relatively poor network quality, this will
      keep them from saturating their network and causing other issues.
      8e1050f3