1. 22 Jul, 2024 1 commit
    • Daniel Hiltgen's avatar
      Remove no longer supported max vram var · cc269ba0
      Daniel Hiltgen authored
      The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM
      scenarios.  With Concurrency this was no longer wired up, and the simplistic
      value doesn't map to multi-GPU setups.  Users can still set `num_gpu`
      to limit memory usage to avoid OOM if we get our predictions wrong.
      cc269ba0
  2. 21 Jul, 2024 2 commits
  3. 20 Jul, 2024 7 commits
  4. 19 Jul, 2024 3 commits
  5. 18 Jul, 2024 5 commits
  6. 17 Jul, 2024 8 commits
  7. 16 Jul, 2024 14 commits