Commits · 485016bfbb30325c610a2e5071d282fab640ec28 · OpenDAS / ollama

26 May, 2024 1 commit
- Update install.sh · 485016bf
  Jeffrey Morgan authored May 26, 2024
  
  485016bf
01 May, 2024 1 commit
- Support Fedoras standard ROCm location · e592e8fc
  Daniel Hiltgen authored May 01, 2024
  
  e592e8fc
27 Apr, 2024 2 commits
- Use architecture specific folders in installer script · 6d3152a9
  Hernan Martinez authored Apr 26, 2024
  
  6d3152a9
- Use architecture specific folders in the build script · 204349b1
  Hernan Martinez authored Apr 26, 2024
  
  204349b1
26 Apr, 2024 2 commits
- Fix exe name for zip packaging on windows · 40bc4622
  Daniel Hiltgen authored Apr 26, 2024
```
The zip file encodes the OS and architecture, so keep the short exe name
```
  40bc4622
- Move cuda/rocm dependency gathering into generate script · 8feb97dc
  Daniel Hiltgen authored Apr 25, 2024
```
This will make it simpler for CI to accumulate artifacts from prior steps
```
  8feb97dc
23 Apr, 2024 1 commit

Move nested payloads to installer and zip file on windows · 058f6cd2

Daniel Hiltgen authored Apr 23, 2024

Now that the llm runner is an executable and not just a dll, more users are facing
problems with security policy configurations on windows that prevent users
writing to directories and then executing binaries from the same location.
This change removes payloads from the main executable on windows and shifts them
over to be packaged in the installer and discovered based on the executables location.
This also adds a new zip file for people who want to "roll their own" installation model.

058f6cd2

28 Mar, 2024 1 commit
- CI automation for tagging latest images · 539043f5
  Daniel Hiltgen authored Mar 28, 2024
  
  539043f5
26 Mar, 2024 2 commits
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
- Use Rocky Linux Vault to get GCC 10.2 installed · b8c2be61
  Daniel Hiltgen authored Mar 25, 2024
```
This should hopefully only be a temporary workaround until Rocky 8
picks up GCC 10.4 which fixes the NVCC bug
```
  b8c2be61
23 Mar, 2024 1 commit

Revamp go based integration tests · 949b6c01

Daniel Hiltgen authored Mar 23, 2024

This uplevels the integration tests to run the server which can allow
testing an existing server, or a remote server.

949b6c01

15 Mar, 2024 2 commits
- Wire up more complete CI for releases · 540f4af4
  Daniel Hiltgen authored Mar 07, 2024
```
Flesh out our github actions CI so we can build official releaes.
```
  540f4af4
- Add ROCm support to linux install script (#2966) · 6459377a
  Daniel Hiltgen authored Mar 14, 2024
  
  6459377a
11 Mar, 2024 1 commit
- use `-trimpath` when building releases (#3069) · b5fcd9d3
  Jeffrey Morgan authored Mar 11, 2024
  
  b5fcd9d3
10 Mar, 2024 1 commit
- only copy deps for `amd64` in `build_linux.sh` · cdf65e79
  Jeffrey Morgan authored Mar 09, 2024
  
  cdf65e79
07 Mar, 2024 1 commit

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

29 Feb, 2024 1 commit
- Add ollama user to video group · 74468513
  Daniel Hiltgen authored Feb 29, 2024
```
On OpenSUSE, ollama needs to be a member of the video group
to access the GPU
```
  74468513
27 Feb, 2024 1 commit
- Refine container image build script · 98e0b7e9
  Daniel Hiltgen authored Feb 26, 2024
```
Allow overriding the platform, image name, and tag latest for
standard and rocm images.
```
  98e0b7e9
22 Feb, 2024 2 commits
- restore windows build flags and compression · 275ea015
  Jeffrey Morgan authored Feb 22, 2024
  
  275ea015
- fix `build_windows.ps1` script to run `go build` with the correct flags · 8782dd56
  Jeffrey Morgan authored Feb 22, 2024
  
  8782dd56
21 Feb, 2024 3 commits
- Update install.sh success message · f983ef7f
  Josh authored Feb 21, 2024
  
  f983ef7f
- Windows build + installer adjustments (#2656) · 1ae1c336
  Jeffrey Morgan authored Feb 21, 2024
```
* remove `-w -s` linker flags on windows

* use `zip` for windows installer compression
```
  1ae1c336
- add `dist` directory in `build_windows.ps` · 92423b06
  Jeffrey Morgan authored Feb 21, 2024
  
  92423b06
16 Feb, 2024 1 commit
- Fix duplicate menus on update and exit on signals · df6dc4fd
  Daniel Hiltgen authored Feb 16, 2024
```
Also fixes a few fit-and-finish items for better developer experience
```
  df6dc4fd
15 Feb, 2024 4 commits
- Prepare to distribute standalone windows executable · 272e53a1
  Daniel Hiltgen authored Feb 15, 2024
```
This will be useful for our automated test riggig, and may be useful for
advanced users who want to "roll their own" system service
```
  272e53a1
- set exe metadata using resource files · 7ad9844a
  jmorganca authored Feb 15, 2024
  
  7ad9844a
- Implement new Go based Desktop app · 29e90cc1
  Daniel Hiltgen authored Dec 26, 2023
```
This focuses on Windows first, but coudl be used for Mac
and possibly linux in the future.
```
  29e90cc1
- Move Mac App to a new dir · 9da9e8fb
  Daniel Hiltgen authored Feb 13, 2024
  
  9da9e8fb
09 Feb, 2024 1 commit
- Update domain name references in docs and install script (#2435) · 1c8435ff
  Jeffrey Morgan authored Feb 09, 2024
  
  1c8435ff
26 Jan, 2024 1 commit
- Add back ROCm container support · 75c44aa3
  Daniel Hiltgen authored Jan 25, 2024
```
This adds ROCm support back as a discrete image.
```
  75c44aa3
23 Jan, 2024 1 commit

Set a default version using git describe · 3005ec74

Daniel Hiltgen authored Jan 22, 2024

If a VERSION is not specified, this will generate a version string that
represents the state of the repo.  For example `0.1.21-12-gffaf52e1-dirty`
representing 12 commits away from 0.1.21 tag, on commit gffaf52e1
and the tree is dirty.

3005ec74

21 Jan, 2024 2 commits

Make CPU builds parallel and customizable AMD GPUs · df54c723

Daniel Hiltgen authored Jan 21, 2024

The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.

df54c723

Combine the 2 Dockerfiles and add ROCm · da72235e

Daniel Hiltgen authored Jan 21, 2024

This renames Dockerfile.build to Dockerfile, and adds some new stages
to support 2 modes of building - the build_linux.sh script uses
intermediate stages to extract the artifacts for ./dist, and the default
build generates a container image usable by both cuda and rocm cards.
This required transitioniing the x86 base to the rocm image to avoid
layer bloat.

da72235e

19 Jan, 2024 1 commit
- use `gzip` for runner embedding (#2067) · dc88cc39
  Jeffrey Morgan authored Jan 19, 2024
  
  dc88cc39
17 Jan, 2024 1 commit
- Add multiple CPU variants for Intel Mac · 1b249748
  Daniel Hiltgen authored Jan 12, 2024
```
This also refines the build process for the ext_server build.
```
  1b249748
16 Jan, 2024 1 commit

install: pin fedora to max 37 · d9bfb2f0

Michael Yang authored Jan 16, 2024

repos for fedora 38 and newer do not exist as of this commit

```
$ dnf config-manager --add-repo https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo
Adding repo from: https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo
Status code: 404 for https://developer.download.nvidia.com/compute/cuda/repos/fedora38/x86_64/cuda-fedora38.repo (IP: 152.195.19.142)
Error: Configuration of repo failed
```

d9bfb2f0

11 Jan, 2024 2 commits

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

DRY out the Dockefile.build · 052b33b8
Daniel Hiltgen authored Jan 06, 2024

052b33b8

10 Jan, 2024 1 commit

Support optional override of the target archictures · 9754ae4c

Daniel Hiltgen authored Jan 10, 2024

This can help speed up incremental builds when you're only testing one
archicture, like amd64. E.g.
BUILD_ARCH=amd64 ./scripts/build_linux.sh && scp ./dist/ollama-linux-amd64 test-system:

9754ae4c

09 Jan, 2024 1 commit
- clean up cmake `build` directory when cross compiling macOS builds · 34344d80
  Jeffrey Morgan authored Jan 09, 2024
  
  34344d80