1. 30 Jul, 2025 1 commit
  2. 29 Jul, 2025 1 commit
  3. 17 Jul, 2025 1 commit
  4. 07 Jul, 2025 1 commit
  5. 26 Jun, 2025 1 commit
  6. 25 Jun, 2025 4 commits
  7. 24 Jun, 2025 2 commits
  8. 23 Jun, 2025 1 commit
    • Daniel Hiltgen's avatar
      Re-remove cuda v11 (#10694) · 1c6669e6
      Daniel Hiltgen authored
      * Re-remove cuda v11
      
      Revert the revert - drop v11 support requiring drivers newer than Feb 23
      
      This reverts commit c6bcdc42.
      
      * Simplify layout
      
      With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)
      
      * distinct sbsa variant for linux arm64
      
      This avoids accidentally trying to load the sbsa cuda libraries on
      a jetson system which results in crashes.
      
      * temporary prevent rocm+cuda mixed loading
      1c6669e6
  9. 13 May, 2025 1 commit
  10. 07 May, 2025 2 commits
  11. 16 Apr, 2025 1 commit
  12. 27 Feb, 2025 2 commits
  13. 25 Feb, 2025 3 commits
    • Blake Mizerany's avatar
      .github: always run tests, and other helpful fixes (#9348) · 0d694793
      Blake Mizerany authored
      During work on our new registry client, I ran into frustrations with CI
      where a misspelling in a comment caused the linter to fail, which caused
      the tests to not run, which caused the build to not be cached, which
      caused the next run to be slow, which caused me to be sad.
      
      This commit address these issues, and pulls in some helpful changes
      we've had in CI on ollama.com for some time now.
      
      They are:
      
      * Always run tests, even if the other checks fail.
      
      Tests are the most important part of CI, and should always run. Failures
      in tests can be correlated with failures in other checks, and can help
      surface the root cause of the failure sooner. This is especially
      important when the failure is platform specific, and the tests are not
      platform independent.
      
      * Check that `go generate` is clean.
      
      This prevents 'go generate' abuse regressions. This codebase used to use
      it to generate platform specific binary build artifacts. Let's make sure
      that does not happen again and this powerful tool is used correctly, and
      the generated code is checked in.
      
      Also, while adding `go generate` the check, it was revealed that the
      generated metal code was putting dates in the comments, resulting in
      non-deterministic builds. This is a bad practice, and this commit fixes
      that. Git tells us the most important date: the commit date along with
      other associated changes.
      
      * Check that `go mod tidy` is clean.
      
      A new job to check that `go mod tidy` is clean was added, to prevent
      easily preventable merge conflicts or go.mod changes being deferred to a
      future PR that is unrelated to the change that caused the go.mod to
      change.
      
      * More robust caching.
      
      We now cache the go build cache, and the go mod download cache
      independently. This is because the download cache contains zips that can
      be unpacked in parallel faster than they can be fetched and extracted by
      tar. This speeds up the build significantly.
      
      The linter is hostile enough. It does not need to also punish us with
      longer build times due to small failures like misspellings.
      0d694793
    • Daniel Hiltgen's avatar
      Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) · e91ae3d4
      Daniel Hiltgen authored
      * Bump cuda and rocm versions
      
      Update ROCm to linux:6.3 win:6.2 and CUDA v12 to 12.8.
      Yum has some silent failure modes, so largely switch to dnf.
      
      * Fix windows build script
      e91ae3d4
    • Blake Mizerany's avatar
      server/internal: copy bmizerany/ollama-go to internal package (#9294) · 348b3e09
      Blake Mizerany authored
      This commit copies (without history) the bmizerany/ollama-go repository
      with the intention of integrating it into the ollama as a replacement
      for the pushing, and pulling of models, and management of the cache they
      are pushed and pulled from.
      
      New homes for these packages will be determined as they are integrated
      and we have a better understanding of proper package boundaries.
      348b3e09
  14. 20 Feb, 2025 1 commit
    • Michael Yang's avatar
      ci: use clang for windows cpu builds · ba9ec3d0
      Michael Yang authored
      clang outputs are faster. we were previously building with clang via gcc
      wrapper in cgo but this was missed during the build updates so there was
      a drop in performance
      ba9ec3d0
  15. 18 Feb, 2025 1 commit
    • Michael Yang's avatar
      ci: set owner/group in tarball · 7b5d916a
      Michael Yang authored
      set owner and group when building the linux tarball so extracted files
      are consistent. this is the behaviour of release tarballs in version
      0.5.7 and lower
      7b5d916a
  16. 08 Feb, 2025 1 commit
    • Michael Yang's avatar
      ci: use windows-2022 to sign and bundle (#8941) · 1f766c36
      Michael Yang authored
      ollama requires vcruntime140_1.dll which isn't found on 2019. previously
      the job used the windows runner (2019) but it explicitly installs
      2022 to build the app. since the sign job doesn't actually build
      anything, it can use the windows-2022 runner instead.
      1f766c36
  17. 06 Feb, 2025 2 commits
  18. 05 Feb, 2025 3 commits
  19. 04 Feb, 2025 2 commits
  20. 03 Feb, 2025 2 commits
  21. 31 Jan, 2025 2 commits
  22. 30 Jan, 2025 1 commit
  23. 29 Jan, 2025 1 commit
    • Michael Yang's avatar
      next build (#8539) · dcfb7a10
      Michael Yang authored
      
      
      * add build to .dockerignore
      
      * test: only build one arch
      
      * add build to .gitignore
      
      * fix ccache path
      
      * filter amdgpu targets
      
      * only filter if autodetecting
      
      * Don't clobber gpu list for default runner
      
      This ensures the GPU specific environment variables are set properly
      
      * explicitly set CXX compiler for HIP
      
      * Update build_windows.ps1
      
      This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.
      
      * build: add ollama subdir
      
      * add .git to .dockerignore
      
      * docs: update development.md
      
      * update build_darwin.sh
      
      * remove unused scripts
      
      * llm: add cwd and build/lib/ollama to library paths
      
      * default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS
      
      * add additional cmake output vars for msvc
      
      * interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12
      
      * remove unncessary filepath.Dir, cleanup
      
      * add hardware-specific directory to path
      
      * use absolute server path
      
      * build: linux arm
      
      * cmake install targets
      
      * remove unused files
      
      * ml: visit each library path once
      
      * build: skip cpu variants on arm
      
      * build: install cpu targets
      
      * build: fix workflow
      
      * shorter names
      
      * fix rocblas install
      
      * docs: clean up development.md
      
      * consistent build dir removal in development.md
      
      * silence -Wimplicit-function-declaration build warnings in ggml-cpu
      
      * update readme
      
      * update development readme
      
      * llm: update library lookup logic now that there is one runner (#8587)
      
      * tweak development.md
      
      * update docs
      
      * add windows cuda/rocm tests
      
      ---------
      Co-authored-by: default avatarjmorganca <jmorganca@gmail.com>
      Co-authored-by: default avatarDaniel Hiltgen <daniel@ollama.com>
      dcfb7a10
  24. 11 Dec, 2024 3 commits