Commits · 20c5fd39c8b275c0c7d7e7be8ce03d48aa32c64e · OpenDAS / ollama

07 May, 2025 2 commits

CI: trigger downstream release process (#10508) · 3098c8b2
Daniel Hiltgen authored May 07, 2025

3098c8b2

Daniel Hiltgen authored May 06, 2025

This reduces the size of our Windows installer payloads by ~256M by dropping
support for nvidia drivers older than Feb 2023. Hardware support is unchanged.

Linux default bundle sizes are reduced by ~600M to 1G.

fa393554

25 Feb, 2025 1 commit

Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) · e91ae3d4

Daniel Hiltgen authored Feb 25, 2025

* Bump cuda and rocm versions

Update ROCm to linux:6.3 win:6.2 and CUDA v12 to 12.8.
Yum has some silent failure modes, so largely switch to dnf.

* Fix windows build script

e91ae3d4

20 Feb, 2025 1 commit

ci: use clang for windows cpu builds · ba9ec3d0

Michael Yang authored Feb 20, 2025

clang outputs are faster. we were previously building with clang via gcc
wrapper in cgo but this was missed during the build updates so there was
a drop in performance

ba9ec3d0

18 Feb, 2025 1 commit

ci: set owner/group in tarball · 7b5d916a

Michael Yang authored Feb 14, 2025

set owner and group when building the linux tarball so extracted files
are consistent. this is the behaviour of release tarballs in version
0.5.7 and lower

7b5d916a

08 Feb, 2025 1 commit

ci: use windows-2022 to sign and bundle (#8941) · 1f766c36

Michael Yang authored Feb 08, 2025

ollama requires vcruntime140_1.dll which isn't found on 2019. previously
the job used the windows runner (2019) but it explicitly installs
2022 to build the app. since the sign job doesn't actually build
anything, it can use the windows-2022 runner instead.

1f766c36

06 Feb, 2025 1 commit

ci: fix linux archive (#8862) · 1c198977

Michael Yang authored Feb 05, 2025

the find returns intermediate directories which pulls the parent
directories. it also omits files under lib/ollama.

switch back to globbing

1c198977

05 Feb, 2025 2 commits
- ci: fix linux archive · 070ad913
  Michael Yang authored Feb 05, 2025
  
  070ad913
- ci: split docker build by platform · 63f0269f
  Michael Yang authored Feb 04, 2025
```
this improves build reliability and concurrency
```
  63f0269f
04 Feb, 2025 2 commits
- fix extra quote · 65b7ecac
  Michael Yang authored Feb 03, 2025
  
  65b7ecac
- fix linux archive · f9d2d891
  Michael Yang authored Feb 03, 2025
  
  f9d2d891
03 Feb, 2025 2 commits
- fix build · 669dc31c
  Michael Yang authored Feb 03, 2025
  
  669dc31c
- fix release workflow · e8061840
  Michael Yang authored Jan 31, 2025
  
  e8061840
31 Jan, 2025 2 commits

fix docker build-args · 475333d5

Michael Yang authored Jan 31, 2025

env context is not accessible from job.*.strategy. since it's in the
environment, just tell docker to use the environment variable[1]

[1]: https://docs.docker.com/reference/cli/docker/buildx/build/#build-arg

475333d5

build: set CFLAGS=-O3 specifically for cpu.go · 39fd8930
Michael Yang authored Jan 30, 2025

39fd8930

30 Jan, 2025 1 commit
- build: set goflags in linux release · 3f0cb36b
  Michael Yang authored Jan 30, 2025
  
  3f0cb36b
29 Jan, 2025 1 commit

next build (#8539) · dcfb7a10

Michael Yang authored Jan 29, 2025



* add build to .dockerignore

* test: only build one arch

* add build to .gitignore

* fix ccache path

* filter amdgpu targets

* only filter if autodetecting

* Don't clobber gpu list for default runner

This ensures the GPU specific environment variables are set properly

* explicitly set CXX compiler for HIP

* Update build_windows.ps1

This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.

* build: add ollama subdir

* add .git to .dockerignore

* docs: update development.md

* update build_darwin.sh

* remove unused scripts

* llm: add cwd and build/lib/ollama to library paths

* default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS

* add additional cmake output vars for msvc

* interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12

* remove unncessary filepath.Dir, cleanup

* add hardware-specific directory to path

* use absolute server path

* build: linux arm

* cmake install targets

* remove unused files

* ml: visit each library path once

* build: skip cpu variants on arm

* build: install cpu targets

* build: fix workflow

* shorter names

* fix rocblas install

* docs: clean up development.md

* consistent build dir removal in development.md

* silence -Wimplicit-function-declaration build warnings in ggml-cpu

* update readme

* update development readme

* llm: update library lookup logic now that there is one runner (#8587)

* tweak development.md

* update docs

* add windows cuda/rocm tests

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>

dcfb7a10

11 Dec, 2024 2 commits

ci: fix artifact path prefix for missing windows payloads (#8052) · 581a4a55

Daniel Hiltgen authored Dec 11, 2024

upload-artifacts strips off leading common paths so when
the ./build/ artifacts were removed, the ./dist/windows-amd64
prefix became common and was stripped, making the
later download-artifacts place them in the wrong location

581a4a55

ci: build dir changed (#8037) · 6a6328a5
Daniel Hiltgen authored Dec 10, 2024
```
Remove no longer relevant build log dir
```
6a6328a5

10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

04 Nov, 2024 3 commits
- CI: Switch to v13 macos runner (#7498) · 046054fa
  Daniel Hiltgen authored Nov 04, 2024
  
  046054fa
- CI: matrix strategy fix (#7496) · 95483f34
  Daniel Hiltgen authored Nov 04, 2024
```
Github actions matrix strategy can't access env settings
```
  95483f34
- Sign windows arm64 official binaries (#7493) · 44bd9e59
  Daniel Hiltgen authored Nov 04, 2024
  
  44bd9e59
02 Nov, 2024 1 commit

CI: omit unused tools for faster release builds (#7432) · b8d5036e

Daniel Hiltgen authored Nov 02, 2024

This leverages caching, and some reduced installer scope to try
to speed up builds. It also tidies up some windows build logic
that was only relevant for the older generate/cmake builds.

b8d5036e

30 Oct, 2024 2 commits

Soften windows clang requirement (#7428) · 712e99d4

Daniel Hiltgen authored Oct 30, 2024

This will no longer error if built with regular gcc on windows.  To help
triage issues that may come in related to different compilers, the runner now
reports the compier used by cgo.

712e99d4

Remove submodule and shift to Go server - 0.4.0 (#7157) · b754f5a6

Daniel Hiltgen authored Oct 30, 2024

* Remove llama.cpp submodule and shift new build to top

* CI: install msys and clang gcc on win

Needed for deepseek to work properly on windows

b754f5a6

24 Sep, 2024 1 commit

CI: Fix win arm version defect (#6940) · e9e9bdb8

Daniel Hiltgen authored Sep 24, 2024

write-host in powershell writes directly to the console and will not be picked
up by a pipe.  Echo, or write-output will.

e9e9bdb8

21 Sep, 2024 1 commit

CI: win arm artifact dist dir (#6900) · 2a038c1d

Daniel Hiltgen authored Sep 20, 2024

The upload artifact is missing the dist prefix since all
payloads are in the same directory, so restore the prefix
on download.

2a038c1d

20 Sep, 2024 3 commits

CI: win arm adjustments (#6898) · 616c5eaf
Daniel Hiltgen authored Sep 20, 2024

616c5eaf
CI: adjust step ordering for win arm to match x64 (#6895) · f5ff917b
Daniel Hiltgen authored Sep 20, 2024

f5ff917b

Add Windows arm64 support to official builds (#5712) · d632e23f

Daniel Hiltgen authored Sep 20, 2024

* Unified arm/x86 windows installer

This adjusts the installer payloads to be architecture aware so we can cary
both amd64 and arm64 binaries in the installer, and install only the applicable
architecture at install time.

* Include arm64 in official windows build

* Harden schedule test for slow windows timers

This test seems to be a bit flaky on windows, so give it more time to converge

d632e23f

17 Sep, 2024 1 commit

CI: dist directories no longer present (#6834) · 8f9ab5e1

Daniel Hiltgen authored Sep 16, 2024

The new buildx based build no longer leaves the dist/linux-* directories
around, so we don't have to clean them up before uploading.

8f9ab5e1

16 Sep, 2024 2 commits

CI: clean up naming, fix tagging latest (#6832) · 7717bb6a

Daniel Hiltgen authored Sep 16, 2024

The rocm CI step for RCs was incorrectly tagging them as the latest rocm build.
The multiarch manifest was incorrectly tagged twice (with and without the
prefix "v"). Static windows artifacts weren't being carried between build
jobs. This also fixes the latest tagging script.

7717bb6a

CI: set platform build build_linux script to keep buildx happy (#6829) · 0ec2915e
Daniel Hiltgen authored Sep 16, 2024
```
The runners don't have emulation set up so the default multi-platform build
wont work.
```
0ec2915e

12 Sep, 2024 1 commit

Optimize container images for startup (#6547) · cd5c8f64

Daniel Hiltgen authored Sep 12, 2024

* Optimize container images for startup

This change adjusts how to handle runner payloads to support
container builds where we keep them extracted in the filesystem.
This makes it easier to optimize the cpu/cuda vs cpu/rocm images for
size, and should result in faster startup times for container images.

* Refactor payload logic and add buildx support for faster builds

* Move payloads around

* Review comments

* Converge to buildx based helper scripts

* Use docker buildx action for release

cd5c8f64

20 Aug, 2024 1 commit

Split rocm back out of bundle (#6432) · a017cf2f

Daniel Hiltgen authored Aug 20, 2024

We're over budget for github's maximum release artifact size with rocm + 2 cuda
versions. This splits rocm back out as a discrete artifact, but keeps the layout so it can
be extracted into the same location as the main bundle.

a017cf2f

19 Aug, 2024 4 commits
- CI: remove directories from dist dir before upload step (#6429) · 19e5a890
  Daniel Hiltgen authored Aug 19, 2024
  
  19e5a890
- CI: handle directories during checksum (#6427) · f91c9e37
  Daniel Hiltgen authored Aug 19, 2024
  
  f91c9e37
- Fix overlapping artifact name on CI · d8be22e4
  Daniel Hiltgen authored Aug 19, 2024
  
  d8be22e4
- Review comments · f9e31da9
  Daniel Hiltgen authored Aug 15, 2024
  
  f9e31da9