Commits · 3b19cdba2a090772b2e886dbfbf712992fafe0cd · OpenDAS / ollama

19 Aug, 2024 6 commits
- Remove Jetpack · 3b19cdba
  Daniel Hiltgen authored Aug 13, 2024
  
  3b19cdba
- Enable cuda v12 flags · f6c811b3
  Daniel Hiltgen authored Jul 12, 2024
  
  f6c811b3
- Add cuda v12 variant and selection logic · 4fe3a556
  Daniel Hiltgen authored Jun 13, 2024
```
Based on compute capability and driver version, pick
v12 or v11 cuda variants.
```
  4fe3a556
- Add Jetson cuda variants for arm · d470ebe7
  Daniel Hiltgen authored May 30, 2024
```
This adds new variants for arm64 specific to Jetson platforms
```
  d470ebe7
- Wire up ccache and pigz in the docker based build · c7bcb003
  Daniel Hiltgen authored Aug 09, 2024
```
This should help speed things up a little
```
  c7bcb003
- Refactor linux packaging · 74d45f01
  Daniel Hiltgen authored Jul 08, 2024
```
This adjusts linux to follow a similar model to windows with a discrete archive
(zip/tgz) to cary the primary executable, and dependent libraries. Runners are
still carried as payloads inside the main binary

Darwin retain the payload model where the go binary is fully self contained.
```
  74d45f01
22 Jul, 2024 1 commit
- Bump Go patch version · 5d604eec
  Daniel Hiltgen authored Jul 22, 2024
  
  5d604eec
17 Jul, 2024 1 commit
- bump go version to 1.22.5 to fix security vulnerabilities · f02f8366
  lreed authored Jul 17, 2024
  
  f02f8366
15 Jul, 2024 1 commit
- Bump linux ROCm to 6.1.2 · 224337b3
  Daniel Hiltgen authored Jul 15, 2024
  
  224337b3
02 Jul, 2024 1 commit

Switch amd container image base to rocky 8 · 020bd60a

Daniel Hiltgen authored Jul 02, 2024

The centos 7 arm mirrors have disappeared due to the EOL 2 days
ago, and the vault sed workaround which works for x86 doesn't work for arm.

020bd60a

14 Jun, 2024 1 commit
- Bump ROCm linux to 6.1.1 · 26ab6773
  Daniel Hiltgen authored Jun 06, 2024
  
  26ab6773
17 Apr, 2024 2 commits
- rearranged conditional logic for static build, dockerfile updated · 8aec92fa
  Jeremy authored Apr 17, 2024
  
  8aec92fa
- move static build to its own flag · 70261b9b
  Jeremy authored Apr 17, 2024
  
  70261b9b
11 Apr, 2024 1 commit
- Fix rocm deps with new subprocess paths · c2d813bd
  Daniel Hiltgen authored Apr 11, 2024
  
  c2d813bd
01 Apr, 2024 1 commit

Switch back to subprocessing for llama.cpp · 58d95cc9

Daniel Hiltgen authored Mar 14, 2024

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems. This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

58d95cc9

28 Mar, 2024 1 commit
- Bump ROCm to 6.0.2 patch release · c91a4ebc
  Daniel Hiltgen authored Mar 27, 2024
  
  c91a4ebc
26 Mar, 2024 2 commits
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
- Revert "Switch arm cuda base image to centos 7" · e0319bd7
  Daniel Hiltgen authored Mar 25, 2024
```
This reverts commit 5dacc1eb.
```
  e0319bd7
25 Mar, 2024 1 commit

Switch arm cuda base image to centos 7 · 5dacc1eb

Daniel Hiltgen authored Mar 25, 2024

We had started using rocky linux 8, but they've updated to GCC 10.3,
which breaks NVCC.  10.2 is compatible (or 10.4, but that's not
available from rocky linux 8 repos yet)

5dacc1eb

21 Mar, 2024 1 commit
- doc: faq gpu compatibility (#3142) · a5ba0fcf
  Bruce MacDonald authored Mar 21, 2024
  
  a5ba0fcf
15 Mar, 2024 1 commit
- Wire up more complete CI for releases · 540f4af4
  Daniel Hiltgen authored Mar 07, 2024
```
Flesh out our github actions CI so we can build official releaes.
```
  540f4af4
11 Mar, 2024 1 commit
- use `-trimpath` when building releases (#3069) · b5fcd9d3
  Jeffrey Morgan authored Mar 11, 2024
  
  b5fcd9d3
10 Mar, 2024 1 commit
- Rename ROCm deps file to avoid confusion (#3025) · 82ca694d
  Daniel Hiltgen authored Mar 09, 2024
  
  82ca694d
07 Mar, 2024 2 commits

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c

29 Feb, 2024 1 commit
- Add env var so podman will map cuda GPUs · 794a916a
  Daniel Hiltgen authored Feb 29, 2024
```
Without this env var, podman's GPU logic doesn't map the GPU through
```
  794a916a
26 Jan, 2024 2 commits

Add back ROCm container support · 75c44aa3
Daniel Hiltgen authored Jan 25, 2024
```
This adds ROCm support back as a discrete image.
```
75c44aa3

Switch back to ubuntu base · a34e1ad3

Daniel Hiltgen authored Jan 25, 2024

The size increase for rocm support in the standard image is problematic
We'll revisit multiple tags for rocm support in a follow up PR.

a34e1ad3

21 Jan, 2024 2 commits

Make CPU builds parallel and customizable AMD GPUs · df54c723

Daniel Hiltgen authored Jan 21, 2024

The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.

df54c723

Combine the 2 Dockerfiles and add ROCm · da72235e

Daniel Hiltgen authored Jan 21, 2024

This renames Dockerfile.build to Dockerfile, and adds some new stages
to support 2 modes of building - the build_linux.sh script uses
intermediate stages to extract the artifacts for ./dist, and the default
build generates a container image usable by both cuda and rocm cards.
This required transitioniing the x86 base to the rocm image to avoid
layer bloat.

da72235e

19 Dec, 2023 2 commits

Adapted rocm support to cgo based llama.cpp · 35934b2e
Daniel Hiltgen authored Nov 29, 2023

35934b2e

Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. · f8ef4439

65a authored Oct 16, 2023

The build tags rocm or cuda must be specified to both go generate and go build.
ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well
as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the
CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also
used to switch VRAM detection between cuda and rocm implementations, using
added "accelerator_foo.go" files which contain architecture specific functions
and variables. accelerator_none is used when no tags are set, and a helper
function addRunner will ignore it if it is the chosen accelerator. Fix go
generate commands, thanks @deadmeu for testing.

f8ef4439

01 Dec, 2023 1 commit
- docker: set PATH, LD_LIBRARY_PATH, and capabilities (#1336) · 0409c1fa
  Michael Yang authored Nov 30, 2023
```
* docker: set PATH, LD_LIBRARY_PATH, and capabilities

* example: update k8s gpu manifest
```
  0409c1fa
13 Oct, 2023 1 commit
- use Go `1.21.3` in `Dockerfile` · 89ba19fe
  Jeffrey Morgan authored Oct 12, 2023
  
  89ba19fe
03 Oct, 2023 1 commit
- update `Dockerfile` to pass `GOFLAGS` · dc87e9c9
  Jeffrey Morgan authored Oct 03, 2023
  
  dc87e9c9
30 Sep, 2023 2 commits
- fix docker build (#659) · 0a4f21c0
  Michael Yang authored Sep 30, 2023
  
  0a4f21c0
- docker: fix volume permission errors · 9abb6625
  Jeffrey Morgan authored Sep 30, 2023
  
  9abb6625
29 Sep, 2023 1 commit
- update build_darwin.sh · 92d454ec
  Michael Yang authored Sep 26, 2023
  
  92d454ec
27 Sep, 2023 1 commit
- use `11.8.0` nvidia dockerfile base image for now · 2ded8ab2
  Jeffrey Morgan authored Sep 25, 2023
  
  2ded8ab2
22 Sep, 2023 1 commit
- replace dockerfile · 93d3a256
  Michael Yang authored Sep 22, 2023
  
  93d3a256