1. 24 Apr, 2024 1 commit
    • Daniel Hiltgen's avatar
      Add back memory escape valve · 5445aaa9
      Daniel Hiltgen authored
      If we get our predictions wrong, this can be used to
      set a lower memory limit as a workaround.  Recent multi-gpu
      refactoring accidentally removed it, so this adds it back.
      5445aaa9
  2. 23 Apr, 2024 5 commits
    • Daniel Hiltgen's avatar
      Move nested payloads to installer and zip file on windows · 058f6cd2
      Daniel Hiltgen authored
      Now that the llm runner is an executable and not just a dll, more users are facing
      problems with security policy configurations on windows that prevent users
      writing to directories and then executing binaries from the same location.
      This change removes payloads from the main executable on windows and shifts them
      over to be packaged in the installer and discovered based on the executables location.
      This also adds a new zip file for people who want to "roll their own" installation model.
      058f6cd2
    • Daniel Hiltgen's avatar
      Detect and recover if runner removed · 58888a74
      Daniel Hiltgen authored
      Tmp cleaners can nuke the file out from underneath us.  This detects the missing
      runner, and re-initializes the payloads.
      58888a74
    • Daniel Hiltgen's avatar
      Request and model concurrency · 34b9db5a
      Daniel Hiltgen authored
      This change adds support for multiple concurrent requests, as well as
      loading multiple models by spawning multiple runners. The default
      settings are currently set at 1 concurrent request per model and only 1
      loaded model at a time, but these can be adjusted by setting
      OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
      34b9db5a
    • Michael Yang's avatar
      fix: mixtral graph · 435cc866
      Michael Yang authored
      435cc866
    • Daniel Hiltgen's avatar
      aa72281e
  3. 21 Apr, 2024 2 commits
  4. 18 Apr, 2024 3 commits
    • Jeremy's avatar
      Update gen_windows.ps1 · 6f18297b
      Jeremy authored
      Forgot a " on the write-host
      6f18297b
    • Jeremy's avatar
      Update gen_windows.ps1 · 15016413
      Jeremy authored
      Added OLLAMA_CUSTOM_CUDA_DEFS and OLLAMA_CUSTOM_ROCM_DEFS to customize GPU builds on Windows
      15016413
    • Jeremy's avatar
      Update gen_linux.sh · 440b7190
      Jeremy authored
      Added OLLAMA_CUSTOM_CUDA_DEFS and OLLAMA_CUSTOM_ROCM_DEFS instead of OLLAMA_CUSTOM_GPU_DEFS
      440b7190
  5. 17 Apr, 2024 6 commits
  6. 16 Apr, 2024 6 commits
  7. 15 Apr, 2024 2 commits
  8. 13 Apr, 2024 1 commit
  9. 11 Apr, 2024 1 commit
  10. 10 Apr, 2024 2 commits
  11. 09 Apr, 2024 4 commits
  12. 08 Apr, 2024 1 commit
  13. 07 Apr, 2024 1 commit
  14. 06 Apr, 2024 1 commit
  15. 04 Apr, 2024 3 commits
  16. 03 Apr, 2024 1 commit
    • Daniel Hiltgen's avatar
      Fix CI release glitches · e4a7e5b2
      Daniel Hiltgen authored
      The subprocess change moved the build directory
      arm64 builds weren't setting cross-compilation flags when building on x86
      e4a7e5b2