1. 03 Jan, 2026 3 commits
  2. 23 Dec, 2025 2 commits
  3. 19 Dec, 2025 1 commit
    • Jesse Gross's avatar
      llm: Avoid integer underflow on llama engine memory layout · 172b5924
      Jesse Gross authored
      On the llama engine, when we compute the memory layout, we reserve
      a buffer to allow for some flexibility for incorrect estimates.
      This is subtracted from GPU free memory and on GPUs with limited
      memory, it may underflow.
      
      Fixes #13494
      172b5924
  4. 18 Dec, 2025 4 commits
  5. 17 Dec, 2025 3 commits
  6. 16 Dec, 2025 8 commits
  7. 15 Dec, 2025 6 commits
  8. 13 Dec, 2025 2 commits
  9. 12 Dec, 2025 10 commits
  10. 11 Dec, 2025 1 commit