1. 07 Jan, 2026 1 commit
    • Devon Rifkin's avatar
      template: fix args-as-json rendering (#13636) · 626af2d8
      Devon Rifkin authored
      In #13525, I accidentally broke templates' ability to automatically
      render tool call function arguments as JSON.
      
      We do need these to be proper maps because we need templates to be able
      to call range, which can't be done on custom types.
      626af2d8
  2. 06 Jan, 2026 3 commits
  3. 03 Jan, 2026 5 commits
  4. 23 Dec, 2025 2 commits
  5. 19 Dec, 2025 1 commit
    • Jesse Gross's avatar
      llm: Avoid integer underflow on llama engine memory layout · 172b5924
      Jesse Gross authored
      On the llama engine, when we compute the memory layout, we reserve
      a buffer to allow for some flexibility for incorrect estimates.
      This is subtracted from GPU free memory and on GPUs with limited
      memory, it may underflow.
      
      Fixes #13494
      172b5924
  6. 18 Dec, 2025 4 commits
  7. 17 Dec, 2025 3 commits
  8. 16 Dec, 2025 8 commits
  9. 15 Dec, 2025 6 commits
  10. 13 Dec, 2025 2 commits
  11. 12 Dec, 2025 5 commits