Commits · de2fbdec991ac52ff015818b19482fdff22e2deb · OpenDAS / ollama

11 Jan, 2024 3 commits

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

DRY out the Dockefile.build · 052b33b8
Daniel Hiltgen authored Jan 06, 2024

052b33b8

Support multiple variants for a given llm lib type · 8da7bef0

Daniel Hiltgen authored Jan 05, 2024

In some cases we may want multiple variants for a given GPU type or CPU.
This adds logic to have an optional Variant which we can use to select
an optimal library, but also allows us to try multiple variants in case
some fail to load.

This can be useful for scenarios such as ROCm v5 vs v6 incompatibility
or potentially CPU features.

8da7bef0

05 Jan, 2024 1 commit
- update build · f9961c70
  Michael Yang authored Jan 04, 2024
  
  f9961c70
22 Dec, 2023 1 commit
- Remove CPU build, fixup linux build script · fa24e73b
  Daniel Hiltgen authored Dec 21, 2023
  
  fa24e73b
20 Dec, 2023 1 commit

Revamp the dynamic library shim · 7555ea44

Daniel Hiltgen authored Dec 20, 2023

This switches the default llama.cpp to be CPU based, and builds the GPU variants
as dynamically loaded libraries which we can select at runtime.

This also bumps the ROCm library to version 6 given 5.7 builds don't work
on the latest ROCm library that just shipped.

7555ea44

19 Dec, 2023 3 commits

Build linux using ubuntu 20.04 · 89bbaafa

Daniel Hiltgen authored Dec 18, 2023

This changes the container-based linux build to use an older Ubuntu
distro to improve our compatibility matrix for older user machines

89bbaafa

Adapted rocm support to cgo based llama.cpp · 35934b2e
Daniel Hiltgen authored Nov 29, 2023

35934b2e

Use build tags to generate accelerated binaries for CUDA and ROCm on Linux. · f8ef4439

65a authored Oct 16, 2023

The build tags rocm or cuda must be specified to both go generate and go build.
ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well
as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the
CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also
used to switch VRAM detection between cuda and rocm implementations, using
added "accelerator_foo.go" files which contain architecture specific functions
and variables. accelerator_none is used when no tags are set, and a helper
function addRunner will ignore it if it is the chosen accelerator. Fix go
generate commands, thanks @deadmeu for testing.

f8ef4439

13 Oct, 2023 2 commits
- use lower glibc versions in `Dockerfile.build` · d890890f
  Jeffrey Morgan authored Oct 13, 2023
  
  d890890f
- update `Dockerfile.build` for linux binary builds · 6f58c776
  Jeffrey Morgan authored Oct 12, 2023
  
  6f58c776
03 Oct, 2023 1 commit
- update `Dockerfile` to pass `GOFLAGS` · dc87e9c9
  Jeffrey Morgan authored Oct 03, 2023
  
  dc87e9c9
29 Sep, 2023 1 commit
- update build_darwin.sh · 92d454ec
  Michael Yang authored Sep 26, 2023
  
  92d454ec
22 Sep, 2023 1 commit

Add `Dockerfile.build` for building linux binaries (#558) · f997e29e

Jeffrey Morgan authored Sep 22, 2023



Add `Dockerfile.build` for building linux binaries

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f997e29e