Commits · 1b272d5bcd6dcf9ddad12f3bd00cc94b4f0cb658 · OpenDAS / ollama

26 Mar, 2024 2 commits
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
- Use Rocky Linux Vault to get GCC 10.2 installed · b8c2be61
  Daniel Hiltgen authored Mar 25, 2024
```
This should hopefully only be a temporary workaround until Rocky 8
picks up GCC 10.4 which fixes the NVCC bug
```
  b8c2be61
23 Mar, 2024 1 commit

Revamp go based integration tests · 949b6c01

Daniel Hiltgen authored Mar 23, 2024

This uplevels the integration tests to run the server which can allow
testing an existing server, or a remote server.

949b6c01

15 Mar, 2024 2 commits
- Wire up more complete CI for releases · 540f4af4
  Daniel Hiltgen authored Mar 07, 2024
```
Flesh out our github actions CI so we can build official releaes.
```
  540f4af4
- Add ROCm support to linux install script (#2966) · 6459377a
  Daniel Hiltgen authored Mar 14, 2024
  
  6459377a
11 Mar, 2024 1 commit
- use `-trimpath` when building releases (#3069) · b5fcd9d3
  Jeffrey Morgan authored Mar 11, 2024
  
  b5fcd9d3
10 Mar, 2024 1 commit
- only copy deps for `amd64` in `build_linux.sh` · cdf65e79
  Jeffrey Morgan authored Mar 09, 2024
  
  cdf65e79
07 Mar, 2024 1 commit

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

29 Feb, 2024 1 commit
- Add ollama user to video group · 74468513
  Daniel Hiltgen authored Feb 29, 2024
```
On OpenSUSE, ollama needs to be a member of the video group
to access the GPU
```
  74468513
27 Feb, 2024 1 commit
- Refine container image build script · 98e0b7e9
  Daniel Hiltgen authored Feb 26, 2024
```
Allow overriding the platform, image name, and tag latest for
standard and rocm images.
```
  98e0b7e9
22 Feb, 2024 2 commits
- restore windows build flags and compression · 275ea015
  Jeffrey Morgan authored Feb 22, 2024
  
  275ea015
- fix `build_windows.ps1` script to run `go build` with the correct flags · 8782dd56
  Jeffrey Morgan authored Feb 22, 2024
  
  8782dd56
21 Feb, 2024 3 commits
- Update install.sh success message · f983ef7f
  Josh authored Feb 21, 2024
  
  f983ef7f
- Windows build + installer adjustments (#2656) · 1ae1c336
  Jeffrey Morgan authored Feb 21, 2024
```
* remove `-w -s` linker flags on windows

* use `zip` for windows installer compression
```
  1ae1c336
- add `dist` directory in `build_windows.ps` · 92423b06
  Jeffrey Morgan authored Feb 21, 2024
  
  92423b06
16 Feb, 2024 1 commit
- Fix duplicate menus on update and exit on signals · df6dc4fd
  Daniel Hiltgen authored Feb 16, 2024
```
Also fixes a few fit-and-finish items for better developer experience
```
  df6dc4fd
15 Feb, 2024 4 commits
- Prepare to distribute standalone windows executable · 272e53a1
  Daniel Hiltgen authored Feb 15, 2024
```
This will be useful for our automated test riggig, and may be useful for
advanced users who want to "roll their own" system service
```
  272e53a1
- set exe metadata using resource files · 7ad9844a
  jmorganca authored Feb 15, 2024
  
  7ad9844a
- Implement new Go based Desktop app · 29e90cc1
  Daniel Hiltgen authored Dec 26, 2023
```
This focuses on Windows first, but coudl be used for Mac
and possibly linux in the future.
```
  29e90cc1
- Move Mac App to a new dir · 9da9e8fb
  Daniel Hiltgen authored Feb 13, 2024
  
  9da9e8fb
09 Feb, 2024 1 commit
- Update domain name references in docs and install script (#2435) · 1c8435ff
  Jeffrey Morgan authored Feb 09, 2024
  
  1c8435ff
26 Jan, 2024 1 commit
- Add back ROCm container support · 75c44aa3
  Daniel Hiltgen authored Jan 25, 2024
```
This adds ROCm support back as a discrete image.
```
  75c44aa3
23 Jan, 2024 1 commit

Set a default version using git describe · 3005ec74

Daniel Hiltgen authored Jan 22, 2024

If a VERSION is not specified, this will generate a version string that
represents the state of the repo.  For example `0.1.21-12-gffaf52e1-dirty`
representing 12 commits away from 0.1.21 tag, on commit gffaf52e1
and the tree is dirty.

3005ec74

21 Jan, 2024 2 commits

Make CPU builds parallel and customizable AMD GPUs · df54c723

Daniel Hiltgen authored Jan 21, 2024

The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.

df54c723

Combine the 2 Dockerfiles and add ROCm · da72235e

Daniel Hiltgen authored Jan 21, 2024

This renames Dockerfile.build to Dockerfile, and adds some new stages
to support 2 modes of building - the build_linux.sh script uses
intermediate stages to extract the artifacts for ./dist, and the default
build generates a container image usable by both cuda and rocm cards.
This required transitioniing the x86 base to the rocm image to avoid
layer bloat.

da72235e

19 Jan, 2024 1 commit
- use `gzip` for runner embedding (#2067) · dc88cc39
  Jeffrey Morgan authored Jan 19, 2024
  
  dc88cc39
17 Jan, 2024 1 commit
- Add multiple CPU variants for Intel Mac · 1b249748
  Daniel Hiltgen authored Jan 12, 2024
```
This also refines the build process for the ext_server build.
```
  1b249748
16 Jan, 2024 1 commit

install: pin fedora to max 37 · d9bfb2f0

Michael Yang authored Jan 16, 2024

repos for fedora 38 and newer do not exist as of this commit

```
$ dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo
Adding repo from: https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo
Status code: 404 for https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo (IP: 152.195.19.142)
Error: Configuration of repo failed
```

d9bfb2f0

11 Jan, 2024 2 commits

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

DRY out the Dockefile.build · 052b33b8
Daniel Hiltgen authored Jan 06, 2024

052b33b8

10 Jan, 2024 1 commit

Support optional override of the target archictures · 9754ae4c

Daniel Hiltgen authored Jan 10, 2024

This can help speed up incremental builds when you're only testing one
archicture, like amd64. E.g.
BUILD_ARCH=amd64 ./scripts/build_linux.sh && scp ./dist/ollama-linux-amd64 test-system:

9754ae4c

09 Jan, 2024 1 commit
- clean up cmake `build` directory when cross compiling macOS builds · 34344d80
  Jeffrey Morgan authored Jan 09, 2024
  
  34344d80
05 Jan, 2024 1 commit
- update build · f9961c70
  Michael Yang authored Jan 04, 2024
  
  f9961c70
04 Jan, 2024 1 commit

Fail fast on WSL1 while allowing on WSL2 · 2fcd41ef

Daniel Hiltgen authored Jan 03, 2024

This prevents users from accidentally installing on WSL1 with instructions
guiding how to upgrade their WSL instance to version 2. Once running WSL2
if you have an NVIDIA card, you can follow their instructions to set up
GPU passthrough and run models on the GPU. This is not possible on WSL1.

2fcd41ef

03 Jan, 2024 2 commits
- Add ollama user to render group for Radeon support · 2588cb2d
  Daniel Hiltgen authored Jan 03, 2024
```
For the ROCm libraries to access the driver, we need to add the ollama user
to the render group.
```
  2588cb2d
- use `docker build` in build scripts · ec261422
  Jeffrey Morgan authored Jan 02, 2024
  
  ec261422
23 Dec, 2023 1 commit

Guard integration tests with a tag · 697bea69

Daniel Hiltgen authored Dec 22, 2023

This should help CI avoid running the integration test logic in a
container where it's not currently possible.

697bea69

22 Dec, 2023 3 commits
- cache docker builds in `build_linux.sh` · b8008102
  Jeffrey Morgan authored Dec 22, 2023
  
  b8008102
- Quiet down llama.cpp logging by default · e5202eb6
  Daniel Hiltgen authored Dec 22, 2023
```
By default builds will now produce non-debug and non-verbose binaries.
To enable verbose logs in llama.cpp and debug symbols in the
native code, set `CGO_CFLAGS=-g`
```
  e5202eb6
- Remove CPU build, fixup linux build script · fa24e73b
  Daniel Hiltgen authored Dec 21, 2023
  
  fa24e73b