Commits · 1b272d5bcd6dcf9ddad12f3bd00cc94b4f0cb658 · OpenDAS / ollama

26 Mar, 2024 1 commit
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
10 Mar, 2024 1 commit
- only copy deps for `amd64` in `build_linux.sh` · cdf65e79
  Jeffrey Morgan authored Mar 09, 2024
  
  cdf65e79
07 Mar, 2024 1 commit

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

23 Jan, 2024 1 commit

Set a default version using git describe · 3005ec74

Daniel Hiltgen authored Jan 22, 2024

If a VERSION is not specified, this will generate a version string that
represents the state of the repo.  For example `0.1.21-12-gffaf52e1-dirty`
representing 12 commits away from 0.1.21 tag, on commit gffaf52e1
and the tree is dirty.

3005ec74

21 Jan, 2024 2 commits

Make CPU builds parallel and customizable AMD GPUs · df54c723

Daniel Hiltgen authored Jan 21, 2024

The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.

df54c723

Combine the 2 Dockerfiles and add ROCm · da72235e

Daniel Hiltgen authored Jan 21, 2024

This renames Dockerfile.build to Dockerfile, and adds some new stages
to support 2 modes of building - the build_linux.sh script uses
intermediate stages to extract the artifacts for ./dist, and the default
build generates a container image usable by both cuda and rocm cards.
This required transitioniing the x86 base to the rocm image to avoid
layer bloat.

da72235e

11 Jan, 2024 1 commit

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

10 Jan, 2024 1 commit

Support optional override of the target archictures · 9754ae4c

Daniel Hiltgen authored Jan 10, 2024

This can help speed up incremental builds when you're only testing one
archicture, like amd64. E.g.
BUILD_ARCH=amd64 ./scripts/build_linux.sh && scp ./dist/ollama-linux-amd64 test-system:

9754ae4c

05 Jan, 2024 1 commit
- update build · f9961c70
  Michael Yang authored Jan 04, 2024
  
  f9961c70
03 Jan, 2024 1 commit
- use `docker build` in build scripts · ec261422
  Jeffrey Morgan authored Jan 02, 2024
  
  ec261422
22 Dec, 2023 3 commits
- cache docker builds in `build_linux.sh` · b8008102
  Jeffrey Morgan authored Dec 22, 2023
  
  b8008102
- Quiet down llama.cpp logging by default · e5202eb6
  Daniel Hiltgen authored Dec 22, 2023
```
By default builds will now produce non-debug and non-verbose binaries.
To enable verbose logs in llama.cpp and debug symbols in the
native code, set `CGO_CFLAGS=-g`
```
  e5202eb6
- Remove CPU build, fixup linux build script · fa24e73b
  Daniel Hiltgen authored Dec 21, 2023
  
  fa24e73b
19 Dec, 2023 3 commits
- Refine build to support CPU only · 1b991d0b
  Daniel Hiltgen authored Dec 13, 2023
```
If someone checks out the ollama repo and doesn't install the CUDA
library, this will ensure they can build a CPU only version
```
  1b991d0b
- Adapted rocm support to cgo based llama.cpp · 35934b2e
  Daniel Hiltgen authored Nov 29, 2023
  
  35934b2e
- Add cgo implementation for llama.cpp · d4cd6957
  Daniel Hiltgen authored Nov 13, 2023
```
Run the server.cpp directly inside the Go runtime via cgo
while retaining the LLM Go abstractions.
```
  d4cd6957
29 Sep, 2023 1 commit
- update build_darwin.sh · 92d454ec
  Michael Yang authored Sep 26, 2023
  
  92d454ec
22 Sep, 2023 1 commit

Add `Dockerfile.build` for building linux binaries (#558) · f997e29e

Jeffrey Morgan authored Sep 22, 2023



Add `Dockerfile.build` for building linux binaries

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f997e29e