Commits · f9584deba5d7faa1ff547fca5ea4e19818394de8 · OpenDAS / ollama

08 Oct, 2024 1 commit

Daniel Hiltgen authored Oct 08, 2024

The recent change to applying patches leaves the submodule dirty based on
"new commits" being present. This ensures we clean up so the tree no longer
reports dirty after a `go generate ./...` run.

The Makefile was being a bit too aggressive in cleaning things up and would result in deleting the placeholder files which someone might accidentally commit.

f9584deb

17 Sep, 2024 1 commit

make patches git am-able · 7bd7b027

Michael Yang authored Sep 16, 2024

raw diffs can be applied using `git apply` but not with `git am`. git
patches, e.g. through `git format-patch` are both apply-able and am-able

7bd7b027

13 Sep, 2024 1 commit
- Fix incremental builds on linux (#6780) · 56b9af33
  Daniel Hiltgen authored Sep 13, 2024
```
scripts: fix incremental builds on linux or similar
```
  56b9af33
12 Sep, 2024 1 commit

Optimize container images for startup (#6547) · cd5c8f64

Daniel Hiltgen authored Sep 12, 2024

* Optimize container images for startup

This change adjusts how to handle runner payloads to support
container builds where we keep them extracted in the filesystem.
This makes it easier to optimize the cpu/cuda vs cpu/rocm images for
size, and should result in faster startup times for container images.

* Refactor payload logic and add buildx support for faster builds

* Move payloads around

* Review comments

* Converge to buildx based helper scripts

* Use docker buildx action for release

cd5c8f64

29 Aug, 2024 1 commit
- remove any unneeded build artifacts · 11018196
  Michael Yang authored Aug 29, 2024
  
  11018196
19 Aug, 2024 2 commits

Wire up ccache and pigz in the docker based build · c7bcb003
Daniel Hiltgen authored Aug 09, 2024
```
This should help speed things up a little
```
c7bcb003

Refactor linux packaging · 74d45f01

Daniel Hiltgen authored Jul 08, 2024

This adjusts linux to follow a similar model to windows with a discrete archive
(zip/tgz) to cary the primary executable, and dependent libraries. Runners are
still carried as payloads inside the main binary

Darwin retain the payload model where the go binary is fully self contained.

74d45f01

06 Jul, 2024 1 commit

llm: fix missing dylibs by restoring old build behavior on Linux and macOS (#5511) · 2cc854f8

Jeffrey Morgan authored Jul 05, 2024

* Revert "fix cmake build (#5505)"

This reverts commit 4fd5f352.

* llm: fix missing dylibs by restoring old build behavior

* crlf -> lf

2cc854f8

05 Jul, 2024 1 commit
- fix cmake build (#5505) · 4fd5f352
  Jeffrey Morgan authored Jul 05, 2024
  
  4fd5f352
25 Apr, 2024 1 commit
- Remove trailing spaces (#3889) · 5f73c087
  Roy Yang authored Apr 25, 2024
  
  5f73c087
01 Apr, 2024 1 commit

Switch back to subprocessing for llama.cpp · 58d95cc9

Daniel Hiltgen authored Mar 14, 2024

This should resolve a number of memory leak and stability defects by allowing
us to isolate llama.cpp in a separate process and shutdown when idle, and
gracefully restart if it has problems. This also serves as a first step to be
able to run multiple copies to support multiple models concurrently.

58d95cc9

25 Mar, 2024 1 commit
- add support for libcudart.so for CUDA devices (adds Jetson support) · dfc6721b
  Jeremy authored Mar 25, 2024
  
  dfc6721b
12 Mar, 2024 1 commit
- Adapt our build for imported server.cpp · 85129d3a
  Daniel Hiltgen authored Mar 12, 2024
  
  85129d3a
07 Mar, 2024 1 commit
- fix some typos (#2973) · 23ebe8fe
  John authored Mar 07, 2024
```
Signed-off-by: hishope <csqiye@126.com>
```
  23ebe8fe
29 Feb, 2024 1 commit

Omit build date from gzip headers · 76e5d9ec

Bernhard M. Wiedemann authored Feb 29, 2024

See https://reproducible-builds.org/ for why this is good.

This patch was done while working on reproducible builds for openSUSE.

76e5d9ec

02 Feb, 2024 1 commit

Harden generate patching model · e1f50377

Daniel Hiltgen authored Feb 01, 2024

Only apply patches if we have any, and make sure to cleanup
every file we patched at the end to leave the tree clean

e1f50377

25 Jan, 2024 1 commit
- Fix clearing kv cache between requests with the same prompt (#2186) · a64570dc
  Jeffrey Morgan authored Jan 25, 2024
```
* Fix clearing kv cache between requests with the same prompt

* fix powershell script
```
  a64570dc
20 Jan, 2024 2 commits
- Add compute capability 5.0, 7.5, and 8.0 · a447a083
  Daniel Hiltgen authored Jan 20, 2024
  
  a447a083
- sign dylibs on macOS (#2101) · 4c54f0dd
  Jeffrey Morgan authored Jan 19, 2024
  
  4c54f0dd
19 Jan, 2024 1 commit
- use `gzip` for runner embedding (#2067) · dc88cc39
  Jeffrey Morgan authored Jan 19, 2024
  
  dc88cc39
17 Jan, 2024 1 commit
- Add multiple CPU variants for Intel Mac · 1b249748
  Daniel Hiltgen authored Jan 12, 2024
```
This also refines the build process for the ext_server build.
```
  1b249748
13 Jan, 2024 2 commits
- add `gcc -lstdc++` flag for linux cpu (#1974) · 288ef8ff
  Jeffrey Morgan authored Jan 13, 2024
  
  288ef8ff
- use g++ to build `libext_server.so` on linux (#1972) · 4cf17990
  Jeffrey Morgan authored Jan 13, 2024
  
  4cf17990
11 Jan, 2024 1 commit

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

05 Jan, 2024 1 commit
- remove unused generate patches (#1810) · 3367b5f3
  Bruce MacDonald authored Jan 05, 2024
  
  3367b5f3
04 Jan, 2024 3 commits
- Cleaup stale submodule · 9983fa5f
  Daniel Hiltgen authored Jan 04, 2024
```
If the tree has a stale submodule, make sure we clean it up first
```
  9983fa5f
- Code shuffle to clean up the llm dir · 77d96da9
  Daniel Hiltgen authored Jan 04, 2024
  
  77d96da9
- update cmake flags for `amd64` macOS (#1780) · 29340c2e
  Jeffrey Morgan authored Jan 03, 2024
```
* update cmake flags for intel macOS

* remove `LLAMA_K_QUANTS`

* put back `CMAKE_OSX_DEPLOYMENT_TARGET` and disable `LLAMA_F16C`
```
  29340c2e
02 Jan, 2024 3 commits

Rename the ollama cmakefile · 738a8d12
Daniel Hiltgen authored Dec 24, 2023

738a8d12

Switch windows build to fully dynamic · d966b730

Daniel Hiltgen authored Dec 23, 2023

Refactor where we store build outputs, and support a fully dynamic loading
model on windows so the base executable has no special dependencies thus
doesn't require a special PATH.

d966b730

Refactor how we augment llama.cpp · 9a70aecc

Daniel Hiltgen authored Dec 22, 2023

This changes the model for llama.cpp inclusion so we're not applying a patch,
but instead have the C++ code directly in the ollama tree, which should make it
easier to refine and update over time.

9a70aecc

22 Dec, 2023 2 commits
- Quiet down llama.cpp logging by default · e5202eb6
  Daniel Hiltgen authored Dec 22, 2023
```
By default builds will now produce non-debug and non-verbose binaries.
To enable verbose logs in llama.cpp and debug symbols in the
native code, set `CGO_CFLAGS=-g`
```
  e5202eb6
- Remove CPU build, fixup linux build script · fa24e73b
  Daniel Hiltgen authored Dec 21, 2023
  
  fa24e73b
20 Dec, 2023 1 commit

Revamp the dynamic library shim · 7555ea44

Daniel Hiltgen authored Dec 20, 2023

This switches the default llama.cpp to be CPU based, and builds the GPU variants
as dynamically loaded libraries which we can select at runtime.

This also bumps the ROCm library to version 6 given 5.7 builds don't work
on the latest ROCm library that just shipped.

7555ea44

19 Dec, 2023 3 commits
- Build linux using ubuntu 20.04 · 89bbaafa
  Daniel Hiltgen authored Dec 18, 2023
```
This changes the container-based linux build to use an older Ubuntu
distro to improve our compatibility matrix for older user machines
```
  89bbaafa
- Adapted rocm support to cgo based llama.cpp · 35934b2e
  Daniel Hiltgen authored Nov 29, 2023
  
  35934b2e
- Add cgo implementation for llama.cpp · d4cd6957
  Daniel Hiltgen authored Nov 13, 2023
```
Run the server.cpp directly inside the Go runtime via cgo
while retaining the LLM Go abstractions.
```
  d4cd6957