Commits · 730ed6e9e12d0bf182d554a54dee8bbbef6a88c7 · OpenDAS / ollama

03 Oct, 2025 2 commits
- ci: fix windows build (#12485) · 730ed6e9
  Daniel Hiltgen authored Oct 02, 2025
  
  730ed6e9
- ci: fix windows build (#12484) · dc066016
  Daniel Hiltgen authored Oct 02, 2025
  
  dc066016
02 Oct, 2025 1 commit
- AMD: block running on unsupported gfx900/gfx906 (#12481) · 55ca8272
  Daniel Hiltgen authored Oct 02, 2025
  
  55ca8272
11 Sep, 2025 1 commit

CI: fix windows cuda build (#12246) · 61fb912c

Daniel Hiltgen authored Sep 11, 2025

* ci: adjust cuda component list

v13 has a different breakdown of the components required to build ollama

* review comments

61fb912c

10 Sep, 2025 1 commit

Add v12 + v13 cuda support (#12000) · 17a023f3

Daniel Hiltgen authored Sep 10, 2025

* Add support for upcoming NVIDIA Jetsons

The latest Jetsons with JetPack 7 are moving to an SBSA compatible model and
will not require building a JetPack specific variant.

* cuda: bring back dual versions

This adds back dual CUDA versions for our releases,
with v11 and v13 to cover a broad set of GPUs and
driver versions.

* win: break up native builds in build_windows.ps1

* v11 build working on windows and linux

* switch to cuda v12.8 not JIT

* Set CUDA compression to size

* enhance manual install linux docs

17a023f3

30 Jul, 2025 1 commit
- Revert "CI: switch back to x86 macos builder" (#11588) · 6dcc5dfb
  Daniel Hiltgen authored Jul 30, 2025
```
This reverts commit 9d071e6089319b37acf62bb739e3430dcb2ac0c3.
```
  6dcc5dfb
29 Jul, 2025 1 commit
- CI: switch back to x86 macos builder (#11572) · 8afa6e83
  Daniel Hiltgen authored Jul 29, 2025
  
  8afa6e83
17 Jul, 2025 1 commit
- ci: switch mac builder to arm64 (#11379) · 191d9428
  Daniel Hiltgen authored Jul 17, 2025
```
The macos-13 is x86, while macos-13-xlarge is arm64
```
  191d9428
07 Jul, 2025 1 commit
- ci: modularization (#11324) · 12d8ad0d
  Daniel Hiltgen authored Jul 07, 2025
```
switch a few constants to variables
```
  12d8ad0d
26 Jun, 2025 1 commit
- ci: multi-stage release process (#11001) · 11ffc361
  Daniel Hiltgen authored Jun 26, 2025
  
  11ffc361
25 Jun, 2025 4 commits
- ci: arm sbsa fixes (#11194) · ad118d8b
  Daniel Hiltgen authored Jun 24, 2025
  
  ad118d8b
- ci: include dependencies · f0853413
  Daniel Hiltgen authored Jun 24, 2025
  
  f0853413
- ci: pick up arm sbsa cuda libs (#11192) · 4b4a90f2
  Daniel Hiltgen authored Jun 24, 2025
  
  4b4a90f2
- ci: recombine linux amd64 binaries (#11188) · 03274a6b
  Daniel Hiltgen authored Jun 24, 2025
```
Glue the rocm and archive builds back together.
```
  03274a6b
24 Jun, 2025 2 commits

ci: rocm parallel builds on windows (#11187) · 405d2f62

Daniel Hiltgen authored Jun 24, 2025

The preset CMAKE_HIP_FLAGS isn't getting used on Windows.
This passes the parallel flag in through the C/CXX flags, along
with suppression for some log spew warnings to quiet down the build.

405d2f62

CI: switch windows to vs 2022 (#11184) · c85c0ebf
Daniel Hiltgen authored Jun 24, 2025
```
* CI: switch windows to vs 2022

* ci: fix regex match
```
c85c0ebf

23 Jun, 2025 1 commit

Re-remove cuda v11 (#10694) · 1c6669e6

Daniel Hiltgen authored Jun 23, 2025

* Re-remove cuda v11

Revert the revert - drop v11 support requiring drivers newer than Feb 23

This reverts commit c6bcdc42.

* Simplify layout

With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)

* distinct sbsa variant for linux arm64

This avoids accidentally trying to load the sbsa cuda libraries on
a jetson system which results in crashes.

* temporary prevent rocm+cuda mixed loading

1c6669e6

13 May, 2025 1 commit

Revert "remove cuda v11 (#10569)" (#10692) · c6bcdc42

Daniel Hiltgen authored May 13, 2025

Bring back v11 until we can better warn users that their driver
is too old.

This reverts commit fa393554.

c6bcdc42

07 May, 2025 2 commits

CI: trigger downstream release process (#10508) · 3098c8b2
Daniel Hiltgen authored May 07, 2025

3098c8b2

remove cuda v11 (#10569) · fa393554

Daniel Hiltgen authored May 06, 2025

This reduces the size of our Windows installer payloads by ~256M by dropping
support for nvidia drivers older than Feb 2023. Hardware support is unchanged.

Linux default bundle sizes are reduced by ~600M to 1G.

fa393554

25 Feb, 2025 1 commit

Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) · e91ae3d4

Daniel Hiltgen authored Feb 25, 2025

* Bump cuda and rocm versions

Update ROCm to linux:6.3 win:6.2 and CUDA v12 to 12.8.
Yum has some silent failure modes, so largely switch to dnf.

* Fix windows build script

e91ae3d4

20 Feb, 2025 1 commit

ci: use clang for windows cpu builds · ba9ec3d0

Michael Yang authored Feb 20, 2025

clang outputs are faster. we were previously building with clang via gcc
wrapper in cgo but this was missed during the build updates so there was
a drop in performance

ba9ec3d0

18 Feb, 2025 1 commit

ci: set owner/group in tarball · 7b5d916a

Michael Yang authored Feb 14, 2025

set owner and group when building the linux tarball so extracted files
are consistent. this is the behaviour of release tarballs in version
0.5.7 and lower

7b5d916a

08 Feb, 2025 1 commit

ci: use windows-2022 to sign and bundle (#8941) · 1f766c36

Michael Yang authored Feb 08, 2025

ollama requires vcruntime140_1.dll which isn't found on 2019. previously
the job used the windows runner (2019) but it explicitly installs
2022 to build the app. since the sign job doesn't actually build
anything, it can use the windows-2022 runner instead.

1f766c36

06 Feb, 2025 1 commit

ci: fix linux archive (#8862) · 1c198977

Michael Yang authored Feb 05, 2025

the find returns intermediate directories which pulls the parent
directories. it also omits files under lib/ollama.

switch back to globbing

1c198977

05 Feb, 2025 2 commits
- ci: fix linux archive · 070ad913
  Michael Yang authored Feb 05, 2025
  
  070ad913
- ci: split docker build by platform · 63f0269f
  Michael Yang authored Feb 04, 2025
```
this improves build reliability and concurrency
```
  63f0269f
04 Feb, 2025 2 commits
- fix extra quote · 65b7ecac
  Michael Yang authored Feb 03, 2025
  
  65b7ecac
- fix linux archive · f9d2d891
  Michael Yang authored Feb 03, 2025
  
  f9d2d891
03 Feb, 2025 2 commits
- fix build · 669dc31c
  Michael Yang authored Feb 03, 2025
  
  669dc31c
- fix release workflow · e8061840
  Michael Yang authored Jan 31, 2025
  
  e8061840
31 Jan, 2025 2 commits

fix docker build-args · 475333d5

Michael Yang authored Jan 31, 2025

env context is not accessible from job.*.strategy. since it's in the
environment, just tell docker to use the environment variable[1]

[1]: https://docs.docker.com/reference/cli/docker/buildx/build/#build-arg

475333d5

build: set CFLAGS=-O3 specifically for cpu.go · 39fd8930
Michael Yang authored Jan 30, 2025

39fd8930

30 Jan, 2025 1 commit
- build: set goflags in linux release · 3f0cb36b
  Michael Yang authored Jan 30, 2025
  
  3f0cb36b
29 Jan, 2025 1 commit

next build (#8539) · dcfb7a10

Michael Yang authored Jan 29, 2025



* add build to .dockerignore

* test: only build one arch

* add build to .gitignore

* fix ccache path

* filter amdgpu targets

* only filter if autodetecting

* Don't clobber gpu list for default runner

This ensures the GPU specific environment variables are set properly

* explicitly set CXX compiler for HIP

* Update build_windows.ps1

This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.

* build: add ollama subdir

* add .git to .dockerignore

* docs: update development.md

* update build_darwin.sh

* remove unused scripts

* llm: add cwd and build/lib/ollama to library paths

* default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS

* add additional cmake output vars for msvc

* interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12

* remove unncessary filepath.Dir, cleanup

* add hardware-specific directory to path

* use absolute server path

* build: linux arm

* cmake install targets

* remove unused files

* ml: visit each library path once

* build: skip cpu variants on arm

* build: install cpu targets

* build: fix workflow

* shorter names

* fix rocblas install

* docs: clean up development.md

* consistent build dir removal in development.md

* silence -Wimplicit-function-declaration build warnings in ggml-cpu

* update readme

* update development readme

* llm: update library lookup logic now that there is one runner (#8587)

* tweak development.md

* update docs

* add windows cuda/rocm tests

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>

dcfb7a10

11 Dec, 2024 2 commits

ci: fix artifact path prefix for missing windows payloads (#8052) · 581a4a55

Daniel Hiltgen authored Dec 11, 2024

upload-artifacts strips off leading common paths so when
the ./build/ artifacts were removed, the ./dist/windows-amd64
prefix became common and was stripped, making the
later download-artifacts place them in the wrong location

581a4a55

ci: build dir changed (#8037) · 6a6328a5
Daniel Hiltgen authored Dec 10, 2024
```
Remove no longer relevant build log dir
```
6a6328a5

10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

04 Nov, 2024 2 commits
- CI: Switch to v13 macos runner (#7498) · 046054fa
  Daniel Hiltgen authored Nov 04, 2024
  
  046054fa
- CI: matrix strategy fix (#7496) · 95483f34
  Daniel Hiltgen authored Nov 04, 2024
```
Github actions matrix strategy can't access env settings
```
  95483f34