Commits · 08600d5bec85b7fc74cb8a166d0365fc83360087 · OpenDAS / ollama

04 Apr, 2024 1 commit
- CI subprocess path fix · 08600d5b
  Daniel Hiltgen authored Apr 03, 2024
  
  08600d5b
03 Apr, 2024 1 commit
- Fix macOS builds on older SDKs (#3467) · cd135317
  Jeffrey Morgan authored Apr 03, 2024
  
  cd135317
02 Apr, 2024 1 commit
- Fix windows lint CI flakiness · 841adda1
  Daniel Hiltgen authored Apr 02, 2024
  
  841adda1
01 Apr, 2024 2 commits

Switch back to subprocessing for llama.cpp · 58d95cc9

Daniel Hiltgen authored Mar 14, 2024

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems. This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

58d95cc9

fix generate output · 1ec0df10
Michael Yang authored Apr 01, 2024

1ec0df10

28 Mar, 2024 2 commits
- Bump ROCm to 6.0.2 patch release · c91a4ebc
  Daniel Hiltgen authored Mar 27, 2024
  
  c91a4ebc
- CI windows gpu builds · b79c7e45
  Daniel Hiltgen authored Mar 28, 2024
```
If we're doing generate, test windows cuda and rocm as well
```
  b79c7e45
27 Mar, 2024 5 commits
- fix: workflows · 5255d0af
  Michael Yang authored Mar 27, 2024
  
  5255d0af
- stub stub · 8838ae78
  Michael Yang authored Mar 27, 2024
  
  8838ae78
- mangle arch · db75402a
  Michael Yang authored Mar 27, 2024
  
  db75402a
- only generate on changes to llm subdirectory · 1e85a140
  Michael Yang authored Mar 27, 2024
  
  1e85a140
- only generate cuda/rocm when changes to llm detected · 5b0c48d2
  Michael Yang authored Mar 27, 2024
  
  5b0c48d2
07 Mar, 2024 4 commits

fix ci · 2cb74e23
Michael Yang authored Mar 07, 2024

2cb74e23
no ci test on docs, examples · 72431031
Michael Yang authored Mar 07, 2024

72431031

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c

06 Feb, 2024 3 commits
- enable rocm builds · 46c847c4
  Michael Yang authored Feb 06, 2024
  
  46c847c4
- use linux runners · 92b1a21f
  Michael Yang authored Feb 06, 2024
  
  92b1a21f
- disable rocm builds · f06b99a4
  Michael Yang authored Feb 06, 2024
  
  f06b99a4
25 Jan, 2024 5 commits
- only generate gpu libs · a8c5413d
  Michael Yang authored Jan 19, 2024
  
  a8c5413d
- archive ollama binaries · 5580de45
  Michael Yang authored Dec 22, 2023
  
  5580de45
- build cuda and rocm · 946431d5
  Michael Yang authored Dec 22, 2023
  
  946431d5
- remove env setting · 06101260
  Michael Yang authored Jan 18, 2024
  
  06101260
- stub generate outputs for lint · 8e5d359a
  Michael Yang authored Jan 24, 2024
  
  8e5d359a
18 Jan, 2024 2 commits
- Go bump to v1.21 to pick up slog · ecbfc018
  Daniel Hiltgen authored Jan 18, 2024
  
  ecbfc018
- Disable arm64 for test phase · b992bf65
  Daniel Hiltgen authored Jan 17, 2024
```
The runners are x86 so we can only run binaries that match.
```
  b992bf65
17 Jan, 2024 1 commit
- Add multiple CPU variants for Intel Mac · 1b249748
  Daniel Hiltgen authored Jan 12, 2024
```
This also refines the build process for the ext_server build.
```
  1b249748
14 Jan, 2024 1 commit
- Add macos cross-compile CI coverage · b3035112
  Daniel Hiltgen authored Jan 14, 2024
  
  b3035112
12 Jan, 2024 1 commit
- update actions/setup-go · 6a5bfc2e
  purificant authored Jan 12, 2024
  
  6a5bfc2e
09 Jan, 2024 1 commit
- add lint and test on pull_request · 99725314
  Michael Yang authored Dec 15, 2023
  
  99725314