1. 13 Aug, 2025 1 commit
    • Daniel Hiltgen's avatar
      discovery: fix cudart driver version (#11614) · 837379a9
      Daniel Hiltgen authored
      We prefer the nvcuda library, which reports driver versions. When we
      dropped cuda v11, we added a safety check for too-old drivers.  What
      we missed was the cudart fallback discovery logic didn't have driver
      version wired up.  This fixes cudart discovery to expose the driver
      version as well so we no longer reject all GPUs if nvcuda didn't work.
      837379a9
  2. 06 May, 2025 1 commit
  3. 05 May, 2025 1 commit
  4. 17 Oct, 2024 1 commit
  5. 19 Jun, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix bad symbol load detection · 52ce350b
      Daniel Hiltgen authored
      pointer deref's weren't correct on a few libraries, which explains
      some crashes on older systems or miswired symlinks for discovery libraries.
      52ce350b
  6. 14 Jun, 2024 2 commits
  7. 23 Apr, 2024 1 commit
    • Daniel Hiltgen's avatar
      Request and model concurrency · 34b9db5a
      Daniel Hiltgen authored
      This change adds support for multiple concurrent requests, as well as
      loading multiple models by spawning multiple runners. The default
      settings are currently set at 1 concurrent request per model and only 1
      loaded model at a time, but these can be adjusted by setting
      OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
      34b9db5a
  8. 01 Apr, 2024 2 commits
  9. 25 Mar, 2024 1 commit