1. 23 Apr, 2024 5 commits
    • Daniel Hiltgen's avatar
      Move nested payloads to installer and zip file on windows · 058f6cd2
      Daniel Hiltgen authored
      Now that the llm runner is an executable and not just a dll, more users are facing
      problems with security policy configurations on windows that prevent users
      writing to directories and then executing binaries from the same location.
      This change removes payloads from the main executable on windows and shifts them
      over to be packaged in the installer and discovered based on the executables location.
      This also adds a new zip file for people who want to "roll their own" installation model.
      058f6cd2
    • Daniel Hiltgen's avatar
      Detect and recover if runner removed · 58888a74
      Daniel Hiltgen authored
      Tmp cleaners can nuke the file out from underneath us.  This detects the missing
      runner, and re-initializes the payloads.
      58888a74
    • Daniel Hiltgen's avatar
      Request and model concurrency · 34b9db5a
      Daniel Hiltgen authored
      This change adds support for multiple concurrent requests, as well as
      loading multiple models by spawning multiple runners. The default
      settings are currently set at 1 concurrent request per model and only 1
      loaded model at a time, but these can be adjusted by setting
      OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
      34b9db5a
    • Michael Yang's avatar
      fix: mixtral graph · 435cc866
      Michael Yang authored
      435cc866
    • Daniel Hiltgen's avatar
      aa72281e
  2. 21 Apr, 2024 2 commits
  3. 18 Apr, 2024 3 commits
    • Jeremy's avatar
      Update gen_windows.ps1 · 6f18297b
      Jeremy authored
      Forgot a " on the write-host
      6f18297b
    • Jeremy's avatar
      Update gen_windows.ps1 · 15016413
      Jeremy authored
      Added OLLAMA_CUSTOM_CUDA_DEFS and OLLAMA_CUSTOM_ROCM_DEFS to customize GPU builds on Windows
      15016413
    • Jeremy's avatar
      Update gen_linux.sh · 440b7190
      Jeremy authored
      Added OLLAMA_CUSTOM_CUDA_DEFS and OLLAMA_CUSTOM_ROCM_DEFS instead of OLLAMA_CUSTOM_GPU_DEFS
      440b7190
  4. 17 Apr, 2024 6 commits
  5. 16 Apr, 2024 6 commits
  6. 15 Apr, 2024 2 commits
  7. 13 Apr, 2024 1 commit
  8. 11 Apr, 2024 1 commit
  9. 10 Apr, 2024 2 commits
  10. 09 Apr, 2024 4 commits
  11. 08 Apr, 2024 1 commit
  12. 07 Apr, 2024 1 commit
  13. 06 Apr, 2024 1 commit
  14. 04 Apr, 2024 3 commits
  15. 03 Apr, 2024 2 commits