1. 03 Oct, 2025 2 commits
  2. 02 Oct, 2025 1 commit
  3. 11 Sep, 2025 1 commit
  4. 10 Sep, 2025 1 commit
    • Daniel Hiltgen's avatar
      Add v12 + v13 cuda support (#12000) · 17a023f3
      Daniel Hiltgen authored
      * Add support for upcoming NVIDIA Jetsons
      
      The latest Jetsons with JetPack 7 are moving to an SBSA compatible model and
      will not require building a JetPack specific variant.
      
      * cuda: bring back dual versions
      
      This adds back dual CUDA versions for our releases,
      with v11 and v13 to cover a broad set of GPUs and
      driver versions.
      
      * win: break up native builds in build_windows.ps1
      
      * v11 build working on windows and linux
      
      * switch to cuda v12.8 not JIT
      
      * Set CUDA compression to size
      
      * enhance manual install linux docs
      17a023f3
  5. 30 Jul, 2025 1 commit
  6. 29 Jul, 2025 1 commit
  7. 17 Jul, 2025 1 commit
  8. 07 Jul, 2025 1 commit
  9. 26 Jun, 2025 1 commit
  10. 25 Jun, 2025 4 commits
  11. 24 Jun, 2025 2 commits
  12. 23 Jun, 2025 1 commit
    • Daniel Hiltgen's avatar
      Re-remove cuda v11 (#10694) · 1c6669e6
      Daniel Hiltgen authored
      * Re-remove cuda v11
      
      Revert the revert - drop v11 support requiring drivers newer than Feb 23
      
      This reverts commit c6bcdc42.
      
      * Simplify layout
      
      With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)
      
      * distinct sbsa variant for linux arm64
      
      This avoids accidentally trying to load the sbsa cuda libraries on
      a jetson system which results in crashes.
      
      * temporary prevent rocm+cuda mixed loading
      1c6669e6
  13. 13 May, 2025 1 commit
  14. 07 May, 2025 2 commits
  15. 16 Apr, 2025 1 commit
  16. 27 Feb, 2025 2 commits
  17. 25 Feb, 2025 3 commits
    • Blake Mizerany's avatar
      .github: always run tests, and other helpful fixes (#9348) · 0d694793
      Blake Mizerany authored
      During work on our new registry client, I ran into frustrations with CI
      where a misspelling in a comment caused the linter to fail, which caused
      the tests to not run, which caused the build to not be cached, which
      caused the next run to be slow, which caused me to be sad.
      
      This commit address these issues, and pulls in some helpful changes
      we've had in CI on ollama.com for some time now.
      
      They are:
      
      * Always run tests, even if the other checks fail.
      
      Tests are the most important part of CI, and should always run. Failures
      in tests can be correlated with failures in other checks, and can help
      surface the root cause of the failure sooner. This is especially
      important when the failure is platform specific, and the tests are not
      platform independent.
      
      * Check that `go generate` is clean.
      
      This prevents 'go generate' abuse regressions. This codebase used to use
      it to generate platform specific binary build artifacts. Let's make sure
      that does not happen again and this powerful tool is used correctly, and
      the generated code is checked in.
      
      Also, while adding `go generate` the check, it was revealed that the
      generated metal code was putting dates in the comments, resulting in
      non-deterministic builds. This is a bad practice, and this commit fixes
      that. Git tells us the most important date: the commit date along with
      other associated changes.
      
      * Check that `go mod tidy` is clean.
      
      A new job to check that `go mod tidy` is clean was added, to prevent
      easily preventable merge conflicts or go.mod changes being deferred to a
      future PR that is unrelated to the change that caused the go.mod to
      change.
      
      * More robust caching.
      
      We now cache the go build cache, and the go mod download cache
      independently. This is because the download cache contains zips that can
      be unpacked in parallel faster than they can be fetched and extracted by
      tar. This speeds up the build significantly.
      
      The linter is hostile enough. It does not need to also punish us with
      longer build times due to small failures like misspellings.
      0d694793
    • Daniel Hiltgen's avatar
      Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) · e91ae3d4
      Daniel Hiltgen authored
      * Bump cuda and rocm versions
      
      Update ROCm to linux:6.3 win:6.2 and CUDA v12 to 12.8.
      Yum has some silent failure modes, so largely switch to dnf.
      
      * Fix windows build script
      e91ae3d4
    • Blake Mizerany's avatar
      server/internal: copy bmizerany/ollama-go to internal package (#9294) · 348b3e09
      Blake Mizerany authored
      This commit copies (without history) the bmizerany/ollama-go repository
      with the intention of integrating it into the ollama as a replacement
      for the pushing, and pulling of models, and management of the cache they
      are pushed and pulled from.
      
      New homes for these packages will be determined as they are integrated
      and we have a better understanding of proper package boundaries.
      348b3e09
  18. 20 Feb, 2025 1 commit
    • Michael Yang's avatar
      ci: use clang for windows cpu builds · ba9ec3d0
      Michael Yang authored
      clang outputs are faster. we were previously building with clang via gcc
      wrapper in cgo but this was missed during the build updates so there was
      a drop in performance
      ba9ec3d0
  19. 18 Feb, 2025 1 commit
    • Michael Yang's avatar
      ci: set owner/group in tarball · 7b5d916a
      Michael Yang authored
      set owner and group when building the linux tarball so extracted files
      are consistent. this is the behaviour of release tarballs in version
      0.5.7 and lower
      7b5d916a
  20. 08 Feb, 2025 1 commit
    • Michael Yang's avatar
      ci: use windows-2022 to sign and bundle (#8941) · 1f766c36
      Michael Yang authored
      ollama requires vcruntime140_1.dll which isn't found on 2019. previously
      the job used the windows runner (2019) but it explicitly installs
      2022 to build the app. since the sign job doesn't actually build
      anything, it can use the windows-2022 runner instead.
      1f766c36
  21. 06 Feb, 2025 2 commits
  22. 05 Feb, 2025 3 commits
  23. 04 Feb, 2025 2 commits
  24. 03 Feb, 2025 2 commits
  25. 31 Jan, 2025 2 commits