Commits · 1579c4f06d7c6cc96a91f76be3e0dcad7839e9eb · OpenDAS / ollama

03 Mar, 2025 2 commits
- build: install binutils alongside gcc in Dockerfile (#9475) · 1579c4f0
  Jeffrey Morgan authored Mar 03, 2025
  
  1579c4f0
- build: install ccache manually in Dockerfile (#9464) · e41c4cbe
  Jeffrey Morgan authored Mar 02, 2025
```
Reverts ccache installation to be done manually via curl instead of
using the dnf package manager as this has side effects of prepending
ccache's install directory to the front of the PATH
```
  e41c4cbe
25 Feb, 2025 2 commits

Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) · e91ae3d4

Daniel Hiltgen authored Feb 25, 2025

* Bump cuda and rocm versions

Update ROCm to linux:6.3 win:6.2 and CUDA v12 to 12.8.
Yum has some silent failure modes, so largely switch to dnf.

* Fix windows build script

e91ae3d4

docker: upgrade rocm to 6.3.3 (#8211) · 6ecd7f64

José Pekkarinen authored Feb 25, 2025

centos-7 images have been deprecated upstream and replaced with
almalinux-8 images instead, requiring some small extra work.
Signed-off-by: José Pekkarinen <jose.pekkarinen@foxhound.fi>

6ecd7f64

29 Jan, 2025 1 commit

next build (#8539) · dcfb7a10

Michael Yang authored Jan 29, 2025



* add build to .dockerignore

* test: only build one arch

* add build to .gitignore

* fix ccache path

* filter amdgpu targets

* only filter if autodetecting

* Don't clobber gpu list for default runner

This ensures the GPU specific environment variables are set properly

* explicitly set CXX compiler for HIP

* Update build_windows.ps1

This isn't complete, but is close.  Dependencies are missing, and it only builds the "default" preset.

* build: add ollama subdir

* add .git to .dockerignore

* docs: update development.md

* update build_darwin.sh

* remove unused scripts

* llm: add cwd and build/lib/ollama to library paths

* default DYLD_LIBRARY_PATH to LD_LIBRARY_PATH in runner on macOS

* add additional cmake output vars for msvc

* interim edits to make server detection logic work with dll directories like lib/ollama/cuda_v12

* remove unncessary filepath.Dir, cleanup

* add hardware-specific directory to path

* use absolute server path

* build: linux arm

* cmake install targets

* remove unused files

* ml: visit each library path once

* build: skip cpu variants on arm

* build: install cpu targets

* build: fix workflow

* shorter names

* fix rocblas install

* docs: clean up development.md

* consistent build dir removal in development.md

* silence -Wimplicit-function-declaration build warnings in ggml-cpu

* update readme

* update development readme

* llm: update library lookup logic now that there is one runner (#8587)

* tweak development.md

* update docs

* add windows cuda/rocm tests

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>

dcfb7a10

06 Jan, 2025 1 commit

Add CUSTOM_CPU_FLAGS to Dockerfile. (#8284) · cdf3a181

frob authored Jan 07, 2025



* Add CUSTOM_CPU_FLAGS.

* fix golangci-lint error.

---------
Co-authored-by: Richard Lyons <rick@frob.com.au>

cdf3a181

14 Dec, 2024 1 commit
- ci: be more aggressive on parallelism in build (#8102) · b75ccfc5
  Daniel Hiltgen authored Dec 14, 2024
  
  b75ccfc5
11 Dec, 2024 1 commit
- ci: fix linux version (#8054) · 36d111e7
  Daniel Hiltgen authored Dec 11, 2024
```
Pass through the version override so the makefiles use it
```
  36d111e7
10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

15 Nov, 2024 1 commit
- build: fix arm container image (#7674) · a0ea067b
  Daniel Hiltgen authored Nov 14, 2024
```
Fix a rebase glitch from the old C++ runner build model
```
  a0ea067b
12 Nov, 2024 1 commit
- Jetpack support for Go server (#7217) · df011054
  Daniel Hiltgen authored Nov 12, 2024
```
This adds support for the Jetson JetPack variants into the Go runner
```
  df011054
30 Oct, 2024 1 commit

Remove submodule and shift to Go server - 0.4.0 (#7157) · b754f5a6

Daniel Hiltgen authored Oct 30, 2024

* Remove llama.cpp submodule and shift new build to top

* CI: install msys and clang gcc on win

Needed for deepseek to work properly on windows

b754f5a6

27 Oct, 2024 1 commit
- Bump to latest Go 1.22 patch (#7379) · abd5dfd0
  Daniel Hiltgen authored Oct 26, 2024
  
  abd5dfd0
08 Oct, 2024 1 commit

Re-introduce the `llama` package (#5034) · 96efd905

Jeffrey Morgan authored Oct 08, 2024

* Re-introduce the llama package

This PR brings back the llama package, making it possible to call llama.cpp and
ggml APIs from Go directly via CGo. This has a few advantages:

- C APIs can be called directly from Go without needing to use the previous
  "server" REST API
- On macOS and for CPU builds on Linux and Windows, Ollama can be built without
  a go generate ./... step, making it easy to get up and running to hack on
  parts of Ollama that don't require fast inference
- Faster build times for AVX,AVX2,CUDA and ROCM (a full build of all runners
  takes <5 min on a fast CPU)
- No git submodule making it easier to clone and build from source

This is a big PR, but much of it is vendor code except for:

- llama.go CGo bindings
- example/: a simple example of running inference
- runner/: a subprocess server designed to replace the llm/ext_server package
- Makefile an as minimal as possible Makefile to build the runner package for
  different...

96efd905

12 Sep, 2024 1 commit

Optimize container images for startup (#6547) · cd5c8f64

Daniel Hiltgen authored Sep 12, 2024

* Optimize container images for startup

This change adjusts how to handle runner payloads to support
container builds where we keep them extracted in the filesystem.
This makes it easier to optimize the cpu/cuda vs cpu/rocm images for
size, and should result in faster startup times for container images.

* Refactor payload logic and add buildx support for faster builds

* Move payloads around

* Review comments

* Converge to buildx based helper scripts

* Use docker buildx action for release

cd5c8f64

10 Sep, 2024 1 commit

Quiet down dockers new lint warnings (#6716) · 4a8069f9

Daniel Hiltgen authored Sep 09, 2024

* Quiet down dockers new lint warnings

Docker has recently added lint warnings to build.  This cleans up those warnings.

* Fix go lint regression

4a8069f9

03 Sep, 2024 1 commit
- Reduce docker image size (#5847) · 9df5f0e8
  R0CKSTAR authored Sep 04, 2024
```
Signed-off-by: Xiaodong Ye <yeahdongcn@gmail.com>
```
  9df5f0e8
20 Aug, 2024 1 commit

Split rocm back out of bundle (#6432) · a017cf2f

Daniel Hiltgen authored Aug 20, 2024

We're over budget for github's maximum release artifact size with rocm + 2 cuda
versions. This splits rocm back out as a discrete artifact, but keeps the layout so it can
be extracted into the same location as the main bundle.

a017cf2f

19 Aug, 2024 7 commits
- Adjust layout to bin+lib/ollama · 88bb9e33
  Daniel Hiltgen authored Aug 14, 2024
  
  88bb9e33
- Remove Jetpack · 3b19cdba
  Daniel Hiltgen authored Aug 13, 2024
  
  3b19cdba
- Enable cuda v12 flags · f6c811b3
  Daniel Hiltgen authored Jul 12, 2024
  
  f6c811b3
- Add cuda v12 variant and selection logic · 4fe3a556
  Daniel Hiltgen authored Jun 13, 2024
```
Based on compute capability and driver version, pick
v12 or v11 cuda variants.
```
  4fe3a556
- Add Jetson cuda variants for arm · d470ebe7
  Daniel Hiltgen authored May 30, 2024
```
This adds new variants for arm64 specific to Jetson platforms
```
  d470ebe7
- Wire up ccache and pigz in the docker based build · c7bcb003
  Daniel Hiltgen authored Aug 09, 2024
```
This should help speed things up a little
```
  c7bcb003
- Refactor linux packaging · 74d45f01
  Daniel Hiltgen authored Jul 08, 2024
```
This adjusts linux to follow a similar model to windows with a discrete archive
(zip/tgz) to cary the primary executable, and dependent libraries. Runners are
still carried as payloads inside the main binary

Darwin retain the payload model where the go binary is fully self contained.
```
  74d45f01
22 Jul, 2024 1 commit
- Bump Go patch version · 5d604eec
  Daniel Hiltgen authored Jul 22, 2024
  
  5d604eec
17 Jul, 2024 1 commit
- bump go version to 1.22.5 to fix security vulnerabilities · f02f8366
  lreed authored Jul 17, 2024
  
  f02f8366
15 Jul, 2024 1 commit
- Bump linux ROCm to 6.1.2 · 224337b3
  Daniel Hiltgen authored Jul 15, 2024
  
  224337b3
02 Jul, 2024 1 commit

Switch amd container image base to rocky 8 · 020bd60a

Daniel Hiltgen authored Jul 02, 2024

The centos 7 arm mirrors have disappeared due to the EOL 2 days
ago, and the vault sed workaround which works for x86 doesn't work for arm.

020bd60a

14 Jun, 2024 1 commit
- Bump ROCm linux to 6.1.1 · 26ab6773
  Daniel Hiltgen authored Jun 06, 2024
  
  26ab6773
17 Apr, 2024 2 commits
- rearranged conditional logic for static build, dockerfile updated · 8aec92fa
  Jeremy authored Apr 17, 2024
  
  8aec92fa
- move static build to its own flag · 70261b9b
  Jeremy authored Apr 17, 2024
  
  70261b9b
11 Apr, 2024 1 commit
- Fix rocm deps with new subprocess paths · c2d813bd
  Daniel Hiltgen authored Apr 11, 2024
  
  c2d813bd
01 Apr, 2024 1 commit

Switch back to subprocessing for llama.cpp · 58d95cc9

Daniel Hiltgen authored Mar 14, 2024

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems. This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

58d95cc9

28 Mar, 2024 1 commit
- Bump ROCm to 6.0.2 patch release · c91a4ebc
  Daniel Hiltgen authored Mar 27, 2024
  
  c91a4ebc
26 Mar, 2024 2 commits
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
- Revert "Switch arm cuda base image to centos 7" · e0319bd7
  Daniel Hiltgen authored Mar 25, 2024
```
This reverts commit 5dacc1eb.
```
  e0319bd7
25 Mar, 2024 1 commit

Switch arm cuda base image to centos 7 · 5dacc1eb

Daniel Hiltgen authored Mar 25, 2024

We had started using rocky linux 8, but they've updated to GCC 10.3,
which breaks NVCC.  10.2 is compatible (or 10.4, but that's not
available from rocky linux 8 repos yet)

5dacc1eb

21 Mar, 2024 1 commit
- doc: faq gpu compatibility (#3142) · a5ba0fcf
  Bruce MacDonald authored Mar 21, 2024
  
  a5ba0fcf
15 Mar, 2024 1 commit
- Wire up more complete CI for releases · 540f4af4
  Daniel Hiltgen authored Mar 07, 2024
```
Flesh out our github actions CI so we can build official releaes.
```
  540f4af4