Commits · 6dcc5dfb9c0a033e4e8dde627d55580600418fb6 · OpenDAS / ollama

30 Jul, 2025 1 commit
- Revert "CI: switch back to x86 macos builder" (#11588) · 6dcc5dfb
  Daniel Hiltgen authored Jul 30, 2025
```
This reverts commit 9d071e6089319b37acf62bb739e3430dcb2ac0c3.
```
  6dcc5dfb
29 Jul, 2025 1 commit
- CI: switch back to x86 macos builder (#11572) · 8afa6e83
  Daniel Hiltgen authored Jul 29, 2025
  
  8afa6e83
17 Jul, 2025 1 commit
- ci: switch mac builder to arm64 (#11379) · 191d9428
  Daniel Hiltgen authored Jul 17, 2025
```
The macos-13 is x86, while macos-13-xlarge is arm64
```
  191d9428
07 Jul, 2025 1 commit
- ci: modularization (#11324) · 12d8ad0d
  Daniel Hiltgen authored Jul 07, 2025
```
switch a few constants to variables
```
  12d8ad0d
26 Jun, 2025 1 commit
- ci: multi-stage release process (#11001) · 11ffc361
  Daniel Hiltgen authored Jun 26, 2025
  
  11ffc361
25 Jun, 2025 4 commits
- ci: arm sbsa fixes (#11194) · ad118d8b
  Daniel Hiltgen authored Jun 24, 2025
  
  ad118d8b
- ci: include dependencies · f0853413
  Daniel Hiltgen authored Jun 24, 2025
  
  f0853413
- ci: pick up arm sbsa cuda libs (#11192) · 4b4a90f2
  Daniel Hiltgen authored Jun 24, 2025
  
  4b4a90f2
- ci: recombine linux amd64 binaries (#11188) · 03274a6b
  Daniel Hiltgen authored Jun 24, 2025
```
Glue the rocm and archive builds back together.
```
  03274a6b
24 Jun, 2025 2 commits

ci: rocm parallel builds on windows (#11187) · 405d2f62

Daniel Hiltgen authored Jun 24, 2025

The preset CMAKE_HIP_FLAGS isn't getting used on Windows.
This passes the parallel flag in through the C/CXX flags, along
with suppression for some log spew warnings to quiet down the build.

405d2f62

CI: switch windows to vs 2022 (#11184) · c85c0ebf
Daniel Hiltgen authored Jun 24, 2025
```
* CI: switch windows to vs 2022

* ci: fix regex match
```
c85c0ebf

23 Jun, 2025 1 commit

Re-remove cuda v11 (#10694) · 1c6669e6

Daniel Hiltgen authored Jun 23, 2025

* Re-remove cuda v11

Revert the revert - drop v11 support requiring drivers newer than Feb 23

This reverts commit c6bcdc42.

* Simplify layout

With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)

* distinct sbsa variant for linux arm64

This avoids accidentally trying to load the sbsa cuda libraries on
a jetson system which results in crashes.

* temporary prevent rocm+cuda mixed loading

1c6669e6

13 May, 2025 1 commit

Revert "remove cuda v11 (#10569)" (#10692) · c6bcdc42

Daniel Hiltgen authored May 13, 2025

Bring back v11 until we can better warn users that their driver
is too old.

This reverts commit fa393554.

c6bcdc42

07 May, 2025 2 commits

CI: trigger downstream release process (#10508) · 3098c8b2
Daniel Hiltgen authored May 07, 2025

3098c8b2

remove cuda v11 (#10569) · fa393554

Daniel Hiltgen authored May 06, 2025

This reduces the size of our Windows installer payloads by ~256M by dropping
support for nvidia drivers older than Feb 2023. Hardware support is unchanged.

Linux default bundle sizes are reduced by ~600M to 1G.

fa393554

16 Apr, 2025 1 commit
- llama: update to commit 71e90e88 (#10192) · 943464cc
  Jeffrey Morgan authored Apr 16, 2025
  
  943464cc
27 Feb, 2025 2 commits
- .github/workflows: swap order of go test and golangci-lint (#9389) · 76e903cf
  Blake Mizerany authored Feb 26, 2025
```
The linter is secondary to the tests, so it should run after the tests,
exposing test failures faster.
```
  76e903cf
- ml/backend/ggml: follow on fixes after updating vendored code (#9388) · a5272130
  Jeffrey Morgan authored Feb 26, 2025
```
Fixes sync filters and lowers CUDA version to 11.3 in test.yaml
```
  a5272130
25 Feb, 2025 3 commits

.github: always run tests, and other helpful fixes (#9348) · 0d694793

Blake Mizerany authored Feb 25, 2025

During work on our new registry client, I ran into frustrations with CI
where a misspelling in a comment caused the linter to fail, which caused
the tests to not run, which caused the build to not be cached, which
caused the next run to be slow, which caused me to be sad.

This commit address these issues, and pulls in some helpful changes
we've had in CI on ollama.com for some time now.

They are:

* Always run tests, even if the other checks fail.

Tests are the most important part of CI, and should always run. Failures
in tests can be correlated with failures in other checks, and can help
surface the root cause of the failure sooner. This is especially
important when the failure is platform specific, and the tests are not
platform independent.

* Check that `go generate` is clean.

This prevents 'go generate' abuse regressions. This codebase used to use
it to generate platform specific binary build artifacts. Let's make sure
that does not happen again and this powerful tool is used correctly, and
the generated code is checked in.

Also, while adding `go generate` the check, it was revealed that the
generated metal code was putting dates in the comments, resulting in
non-deterministic builds. This is a bad practice, and this commit fixes
that. Git tells us the most important date: the commit date along with
other associated changes.

* Check that `go mod tidy` is clean.

A new job to check that `go mod tidy` is clean was added, to prevent
easily preventable merge conflicts or go.mod changes being deferred to a
future PR that is unrelated to the change that caused the go.mod to
change.

* More robust caching.

We now cache the go build cache, and the go mod download cache
independently. This is because the download cache contains zips that can
be unpacked in parallel faster than they can be fetched and extracted by
tar. This speeds up the build significantly.

The linter is hostile enough. It does not need to also punish us with
longer build times due to small failures like misspellings.

0d694793

Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) · e91ae3d4

Daniel Hiltgen authored Feb 25, 2025

* Bump cuda and rocm versions

Update ROCm to linux:6.3 win:6.2 and CUDA v12 to 12.8.
Yum has some silent failure modes, so largely switch to dnf.

* Fix windows build script

e91ae3d4

server/internal: copy bmizerany/ollama-go to internal package (#9294) · 348b3e09

Blake Mizerany authored Feb 24, 2025

This commit copies (without history) the bmizerany/ollama-go repository
with the intention of integrating it into the ollama as a replacement
for the pushing, and pulling of models, and management of the cache they
are pushed and pulled from.

New homes for these packages will be determined as they are integrated
and we have a better understanding of proper package boundaries.

348b3e09

20 Feb, 2025 1 commit

ci: use clang for windows cpu builds · ba9ec3d0

Michael Yang authored Feb 20, 2025

clang outputs are faster. we were previously building with clang via gcc
wrapper in cgo but this was missed during the build updates so there was
a drop in performance

ba9ec3d0

18 Feb, 2025 1 commit

ci: set owner/group in tarball · 7b5d916a

Michael Yang authored Feb 14, 2025

set owner and group when building the linux tarball so extracted files
are consistent. this is the behaviour of release tarballs in version
0.5.7 and lower

7b5d916a

08 Feb, 2025 1 commit

ci: use windows-2022 to sign and bundle (#8941) · 1f766c36

Michael Yang authored Feb 08, 2025

ollama requires vcruntime140_1.dll which isn't found on 2019. previously
the job used the windows runner (2019) but it explicitly installs
2022 to build the app. since the sign job doesn't actually build
anything, it can use the windows-2022 runner instead.

1f766c36

06 Feb, 2025 2 commits

ci: fix linux archive (#8862) · 1c198977

Michael Yang authored Feb 05, 2025

the find returns intermediate directories which pulls the parent
directories. it also omits files under lib/ollama.

switch back to globbing

1c198977

chore: update gitattributes (#8860) · 5b446cc8
Michael Yang authored Feb 05, 2025
```
* chore: update gitattributes
* chore: add build info source
```
5b446cc8

05 Feb, 2025 2 commits
- ci: fix linux archive · 070ad913
  Michael Yang authored Feb 05, 2025
  
  070ad913
- ci: split docker build by platform · 63f0269f
  Michael Yang authored Feb 04, 2025
```
this improves build reliability and concurrency
```
  63f0269f
04 Feb, 2025 2 commits
- fix extra quote · 65b7ecac
  Michael Yang authored Feb 03, 2025
  
  65b7ecac
- fix linux archive · f9d2d891
  Michael Yang authored Feb 03, 2025
  
  f9d2d891
03 Feb, 2025 2 commits
- fix build · 669dc31c
  Michael Yang authored Feb 03, 2025
  
  669dc31c
- fix release workflow · e8061840
  Michael Yang authored Jan 31, 2025
  
  e8061840
31 Jan, 2025 2 commits

fix docker build-args · 475333d5

Michael Yang authored Jan 31, 2025

env context is not accessible from job.*.strategy. since it's in the
environment, just tell docker to use the environment variable[1]

[1]: https://docs.docker.com/reference/cli/docker/buildx/build/#build-arg

475333d5

build: set CFLAGS=-O3 specifically for cpu.go · 39fd8930
Michael Yang authored Jan 30, 2025

39fd8930

30 Jan, 2025 1 commit
- build: set goflags in linux release · 3f0cb36b
  Michael Yang authored Jan 30, 2025
  
  3f0cb36b
29 Jan, 2025 1 commit

next build (#8539) · dcfb7a10

Michael Yang authored Jan 29, 2025



* add build to .dockerignore

* test: only build one arch

* add build to .gitignore

* fix ccache path

* filter amdgpu targets

* only filter if autodetecting

* Don't clobber gpu list for default runner

This ensures the GPU specific environment variables are set properly

* explicitly set CXX compiler for HIP

* Update build_windows.ps1

This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.

* build: add ollama subdir

* add .git to .dockerignore

* docs: update development.md

* update build_darwin.sh

* remove unused scripts

* llm: add cwd and build/lib/ollama to library paths

* default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS

* add additional cmake output vars for msvc

* interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12

* remove unncessary filepath.Dir, cleanup

* add hardware-specific directory to path

* use absolute server path

* build: linux arm

* cmake install targets

* remove unused files

* ml: visit each library path once

* build: skip cpu variants on arm

* build: install cpu targets

* build: fix workflow

* shorter names

* fix rocblas install

* docs: clean up development.md

* consistent build dir removal in development.md

* silence -Wimplicit-function-declaration build warnings in ggml-cpu

* update readme

* update development readme

* llm: update library lookup logic now that there is one runner (#8587)

* tweak development.md

* update docs

* add windows cuda/rocm tests

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>

dcfb7a10

11 Dec, 2024 3 commits
- ci: fix artifact path prefix for missing windows payloads (#8052) · 581a4a55
  Daniel Hiltgen authored Dec 11, 2024
```
upload-artifacts strips off leading common paths so when
the ./build/ artifacts were removed, the ./dist/windows-amd64
prefix became common and was stripped, making the
later download-artifacts place them in the wrong location
```
  581a4a55
- ci: build dir changed (#8037) · 6a6328a5
  Daniel Hiltgen authored Dec 10, 2024
```
Remove no longer relevant build log dir
```
  6a6328a5
- llama: update vendored code to commit 40c6d79f (#7875) · 527cc978
  Jeffrey Morgan authored Dec 10, 2024
  
  527cc978
10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234