Commits · 4d4f75a8a8a349e73dfd85ec0737ad42f5171eb0 · OpenDAS / ollama

07 May, 2024 2 commits
- Revert "fix golangci workflow missing gofmt and goimports (#4190)" · 4d4f75a8
  Michael Yang authored May 07, 2024
```
This reverts commit 04f971c8.
```
  4d4f75a8
- fix golangci workflow missing gofmt and goimports (#4190) · 04f971c8
  alwqx authored May 08, 2024
  
  04f971c8
26 Apr, 2024 2 commits
- .github/workflows/test.yaml: add in-flight cancellations on new push (#3956) · 05489427
  Blake Mizerany authored Apr 26, 2024
```
Also, remove a superfluous 'go get'
```
  05489427
- use merge base for diff-tree · 6fef042f
  Michael Yang authored Apr 26, 2024
  
  6fef042f
23 Apr, 2024 2 commits

Move nested payloads to installer and zip file on windows · 058f6cd2

Daniel Hiltgen authored Apr 23, 2024

Now that the llm runner is an executable and not just a dll, more users are facing
problems with security policy configurations on windows that prevent users
writing to directories and then executing binaries from the same location.
This change removes payloads from the main executable on windows and shifts them
over to be packaged in the installer and discovered based on the executables location.
This also adds a new zip file for people who want to "roll their own" installation model.

058f6cd2

Make CI lint verbvose · 939d6a86
Daniel Hiltgen authored Apr 23, 2024

939d6a86

10 Apr, 2024 1 commit
- fix ci · 2b4ca6cf
  Michael Yang authored Apr 10, 2024
  
  2b4ca6cf
09 Apr, 2024 3 commits

Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564) · 1524f323
Blake Mizerany authored Apr 09, 2024

1524f323

build.go: introduce a friendlier way to build Ollama (#3548) · fccf3eec

Blake Mizerany authored Apr 09, 2024

This commit introduces a more friendly way to build Ollama dependencies
and the binary without abusing `go generate` and removing the
unnecessary extra steps it brings with it.

This script also provides nicer feedback to the user about what is
happening during the build process.

At the end, it prints a helpful message to the user about what to do
next (e.g. run the new local Ollama).

fccf3eec

ci: use go-version-file · cb8352d6
Michael Yang authored Apr 09, 2024

cb8352d6

04 Apr, 2024 1 commit
- CI subprocess path fix · 08600d5b
  Daniel Hiltgen authored Apr 03, 2024
  
  08600d5b
03 Apr, 2024 1 commit
- Fix macOS builds on older SDKs (#3467) · cd135317
  Jeffrey Morgan authored Apr 03, 2024
  
  cd135317
02 Apr, 2024 1 commit
- Fix windows lint CI flakiness · 841adda1
  Daniel Hiltgen authored Apr 02, 2024
  
  841adda1
01 Apr, 2024 2 commits

Switch back to subprocessing for llama.cpp · 58d95cc9

Daniel Hiltgen authored Mar 14, 2024

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems. This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

58d95cc9

fix generate output · 1ec0df10
Michael Yang authored Apr 01, 2024

1ec0df10

28 Mar, 2024 2 commits
- Bump ROCm to 6.0.2 patch release · c91a4ebc
  Daniel Hiltgen authored Mar 27, 2024
  
  c91a4ebc
- CI windows gpu builds · b79c7e45
  Daniel Hiltgen authored Mar 28, 2024
```
If we're doing generate, test windows cuda and rocm as well
```
  b79c7e45
27 Mar, 2024 5 commits
- fix: workflows · 5255d0af
  Michael Yang authored Mar 27, 2024
  
  5255d0af
- stub stub · 8838ae78
  Michael Yang authored Mar 27, 2024
  
  8838ae78
- mangle arch · db75402a
  Michael Yang authored Mar 27, 2024
  
  db75402a
- only generate on changes to llm subdirectory · 1e85a140
  Michael Yang authored Mar 27, 2024
  
  1e85a140
- only generate cuda/rocm when changes to llm detected · 5b0c48d2
  Michael Yang authored Mar 27, 2024
  
  5b0c48d2
07 Mar, 2024 4 commits

fix ci · 2cb74e23
Michael Yang authored Mar 07, 2024

2cb74e23
no ci test on docs, examples · 72431031
Michael Yang authored Mar 07, 2024

72431031

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c

06 Feb, 2024 3 commits
- enable rocm builds · 46c847c4
  Michael Yang authored Feb 06, 2024
  
  46c847c4
- use linux runners · 92b1a21f
  Michael Yang authored Feb 06, 2024
  
  92b1a21f
- disable rocm builds · f06b99a4
  Michael Yang authored Feb 06, 2024
  
  f06b99a4
25 Jan, 2024 5 commits
- only generate gpu libs · a8c5413d
  Michael Yang authored Jan 19, 2024
  
  a8c5413d
- archive ollama binaries · 5580de45
  Michael Yang authored Dec 22, 2023
  
  5580de45
- build cuda and rocm · 946431d5
  Michael Yang authored Dec 22, 2023
  
  946431d5
- remove env setting · 06101260
  Michael Yang authored Jan 18, 2024
  
  06101260
- stub generate outputs for lint · 8e5d359a
  Michael Yang authored Jan 24, 2024
  
  8e5d359a
18 Jan, 2024 2 commits
- Go bump to v1.21 to pick up slog · ecbfc018
  Daniel Hiltgen authored Jan 18, 2024
  
  ecbfc018
- Disable arm64 for test phase · b992bf65
  Daniel Hiltgen authored Jan 17, 2024
```
The runners are x86 so we can only run binaries that match.
```
  b992bf65
17 Jan, 2024 1 commit
- Add multiple CPU variants for Intel Mac · 1b249748
  Daniel Hiltgen authored Jan 12, 2024
```
This also refines the build process for the ext_server build.
```
  1b249748
14 Jan, 2024 1 commit
- Add macos cross-compile CI coverage · b3035112
  Daniel Hiltgen authored Jan 14, 2024
  
  b3035112
12 Jan, 2024 1 commit
- update actions/setup-go · 6a5bfc2e
  purificant authored Jan 12, 2024
  
  6a5bfc2e
09 Jan, 2024 1 commit
- add lint and test on pull_request · 99725314
  Michael Yang authored Dec 15, 2023
  
  99725314