1. 13 May, 2025 1 commit
  2. 12 May, 2025 1 commit
    • Daniel Hiltgen's avatar
      Follow up to #10363 (#10647) · 9d6df908
      Daniel Hiltgen authored
      The quantization PR didn't block all unsupported file types,
      which this PR fixes.  It also updates the API docs to reflect
      the now reduced set of supported types.
      9d6df908
  3. 08 May, 2025 1 commit
  4. 07 May, 2025 1 commit
    • Daniel Hiltgen's avatar
      remove cuda v11 (#10569) · fa393554
      Daniel Hiltgen authored
      This reduces the size of our Windows installer payloads by ~256M by dropping
      support for nvidia drivers older than Feb 2023.  Hardware support is unchanged.
      
      Linux default bundle sizes are reduced by ~600M to 1G.
      fa393554
  5. 05 May, 2025 1 commit
  6. 29 Apr, 2025 1 commit
  7. 28 Apr, 2025 1 commit
  8. 22 Apr, 2025 1 commit
    • Devon Rifkin's avatar
      increase default context length to 4096 (#10364) · 424f6486
      Devon Rifkin authored
      * increase default context length to 4096
      
      We lower the default numParallel from 4 to 2 and use these "savings" to
      double the default context length from 2048 to 4096.
      
      We're memory neutral in cases when we previously would've used
      numParallel == 4, but we add the following mitigation to handle some
      cases where we would have previously fallen back to 1x2048 due to low
      VRAM: we decide between 2048 and 4096 using a runtime check, choosing
      2048 if we're on a one GPU system with total VRAM of <= 4 GB. We
      purposefully don't check the available VRAM because we don't want the
      context window size to change unexpectedly based on the available VRAM.
      
      We plan on making the default even larger, but this is a relatively
      low-risk change we can make to quickly double it.
      
      * fix tests
      
      add an explicit context length so they don't get truncated. The code
      that converts -1 from being a signal for doing a runtime check isn't
      running as part of these tests.
      
      * tweak small gpu message
      
      * clarify context length default
      
      also make it actually show up in `ollama serve --help`
      424f6486
  9. 15 Apr, 2025 2 commits
  10. 08 Apr, 2025 1 commit
  11. 01 Apr, 2025 1 commit
  12. 27 Mar, 2025 1 commit
  13. 25 Mar, 2025 1 commit
  14. 21 Mar, 2025 2 commits
  15. 13 Mar, 2025 1 commit
  16. 10 Mar, 2025 1 commit
  17. 07 Mar, 2025 1 commit
    • ‮rekcäH nitraM‮'s avatar
      Better WantedBy declaration · 25248f4b
      ‮rekcäH nitraM‮ authored
      The problem with default.target is that it always points to the target that is currently started. So if you boot into single user mode or the rescue mode still Ollama tries to start.
      
      I noticed this because either tried (and failed) to start all the time during a system update, where Ollama definitely is not wanted.
      25248f4b
  18. 05 Mar, 2025 1 commit
  19. 04 Mar, 2025 1 commit
    • Blake Mizerany's avatar
      server/.../backoff,syncs: don't break builds without synctest (#9484) · 55ab9f37
      Blake Mizerany authored
      Previously, developers without the synctest experiment enabled would see
      build failures when running tests in some server/internal/internal
      packages using the synctest package. This change makes the transition to
      use of the package less painful but guards the use of the synctest
      package with build tags.
      
      synctest is enabled in CI. If a new change will break a synctest
      package, it will break in CI, even if it does not break locally.
      
      The developer docs have been updated to help with any confusion about
      why package tests pass locally but fail in CI.
      55ab9f37
  20. 27 Feb, 2025 1 commit
    • Daniel Hiltgen's avatar
      Windows ARM build (#9120) · 688925ac
      Daniel Hiltgen authored
      * Windows ARM build
      
      Skip cmake, and note it's unused in the developer docs.
      
      * Win: only check for ninja when we need it
      
      On windows ARM, the cim lookup fails, but we don't need ninja anyway.
      688925ac
  21. 25 Feb, 2025 2 commits
  22. 22 Feb, 2025 1 commit
  23. 15 Feb, 2025 1 commit
  24. 13 Feb, 2025 1 commit
  25. 08 Feb, 2025 1 commit
  26. 07 Feb, 2025 2 commits
  27. 06 Feb, 2025 1 commit
  28. 05 Feb, 2025 1 commit
  29. 03 Feb, 2025 1 commit
  30. 02 Feb, 2025 1 commit
  31. 29 Jan, 2025 2 commits
    • Parth Sareen's avatar
    • Michael Yang's avatar
      next build (#8539) · dcfb7a10
      Michael Yang authored
      
      
      * add build to .dockerignore
      
      * test: only build one arch
      
      * add build to .gitignore
      
      * fix ccache path
      
      * filter amdgpu targets
      
      * only filter if autodetecting
      
      * Don't clobber gpu list for default runner
      
      This ensures the GPU specific environment variables are set properly
      
      * explicitly set CXX compiler for HIP
      
      * Update build_windows.ps1
      
      This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.
      
      * build: add ollama subdir
      
      * add .git to .dockerignore
      
      * docs: update development.md
      
      * update build_darwin.sh
      
      * remove unused scripts
      
      * llm: add cwd and build/lib/ollama to library paths
      
      * default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS
      
      * add additional cmake output vars for msvc
      
      * interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12
      
      * remove unncessary filepath.Dir, cleanup
      
      * add hardware-specific directory to path
      
      * use absolute server path
      
      * build: linux arm
      
      * cmake install targets
      
      * remove unused files
      
      * ml: visit each library path once
      
      * build: skip cpu variants on arm
      
      * build: install cpu targets
      
      * build: fix workflow
      
      * shorter names
      
      * fix rocblas install
      
      * docs: clean up development.md
      
      * consistent build dir removal in development.md
      
      * silence -Wimplicit-function-declaration build warnings in ggml-cpu
      
      * update readme
      
      * update development readme
      
      * llm: update library lookup logic now that there is one runner (#8587)
      
      * tweak development.md
      
      * update docs
      
      * add windows cuda/rocm tests
      
      ---------
      Co-authored-by: default avatarjmorganca <jmorganca@gmail.com>
      Co-authored-by: default avatarDaniel Hiltgen <daniel@ollama.com>
      dcfb7a10
  32. 23 Jan, 2025 1 commit
  33. 21 Jan, 2025 1 commit
  34. 20 Jan, 2025 1 commit
  35. 15 Jan, 2025 1 commit