Commits · 89bf98bcf2bcbbb018c9374c53137e9c7ab20f10 · OpenDAS / ollama

23 May, 2024 1 commit
- Tidy up developer guide a little · 1b2d1560
  Daniel Hiltgen authored May 23, 2024
  
  1b2d1560
01 May, 2024 1 commit
- chore: fix typo in docs/development.md (#4073) · 68755f1f
  alwqx authored May 02, 2024
  
  68755f1f
09 Apr, 2024 2 commits

Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564) · 1524f323
Blake Mizerany authored Apr 09, 2024

1524f323

build.go: introduce a friendlier way to build Ollama (#3548) · fccf3eec

Blake Mizerany authored Apr 09, 2024

This commit introduces a more friendly way to build Ollama dependencies
and the binary without abusing `go generate` and removing the
unnecessary extra steps it brings with it.

This script also provides nicer feedback to the user about what is
happening during the build process.

At the end, it prints a helpful message to the user about what to do
next (e.g. run the new local Ollama).

fccf3eec

26 Mar, 2024 1 commit
- remove need for `$VSINSTALLDIR` since build will fail if `ninja` cannot be found (#3350) · 856b8ec1
  Jeffrey Morgan authored Mar 26, 2024
  
  856b8ec1
25 Mar, 2024 1 commit
- Fix ROCm link in `development.md` · f38b705d
  Jeffrey Morgan authored Mar 25, 2024
  
  f38b705d
09 Mar, 2024 1 commit
- Doc how to set up ROCm builds on windows · 0fdebb34
  Daniel Hiltgen authored Mar 09, 2024
  
  0fdebb34
07 Mar, 2024 3 commits

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

update go to 1.22 in other places (#2975) · d481fb3c
Jeffrey Morgan authored Mar 07, 2024

d481fb3c
fix some typos (#2973) · 23ebe8fe
John authored Mar 07, 2024
```
Signed-off-by: hishope <csqiye@126.com>
```
23ebe8fe

21 Jan, 2024 1 commit

Make CPU builds parallel and customizable AMD GPUs · df54c723

Daniel Hiltgen authored Jan 21, 2024

The linux build now support parallel CPU builds to speed things up.
This also exposes AMD GPU targets as an optional setting for advaced
users who want to alter our default set.

df54c723

20 Jan, 2024 1 commit
- Add compute capability 5.0, 7.5, and 8.0 · a447a083
  Daniel Hiltgen authored Jan 20, 2024
  
  a447a083
18 Jan, 2024 3 commits
- Go bump to v1.21 to pick up slog · ecbfc018
  Daniel Hiltgen authored Jan 18, 2024
  
  ecbfc018
- Mechanical switch from log to slog · fedd705a
  Daniel Hiltgen authored Jan 18, 2024
```
A few obvious levels were adjusted, but generally everything mapped to "info" level.
```
  fedd705a
- Refine the linux cuda/rocm developer docs · 9cd20b0e
  Daniel Hiltgen authored Jan 18, 2024
  
  9cd20b0e
11 Jan, 2024 1 commit

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

25 Dec, 2023 1 commit
- Add windows native build instructions · e201efa1
  Daniel Hiltgen authored Dec 24, 2023
  
  e201efa1
22 Dec, 2023 1 commit

Quiet down llama.cpp logging by default · e5202eb6

Daniel Hiltgen authored Dec 22, 2023

By default builds will now produce non-debug and non-verbose binaries.
To enable verbose logs in llama.cpp and debug symbols in the
native code, set `CGO_CFLAGS=-g`

e5202eb6

19 Dec, 2023 1 commit

Refine build to support CPU only · 1b991d0b

Daniel Hiltgen authored Dec 13, 2023

If someone checks out the ollama repo and doesn't install the CUDA
library, this will ensure they can build a CPU only version

1b991d0b

01 Oct, 2023 1 commit
- add some missing code directives in docs (#664) · 4fc10acc
  Jiayu Liu authored Oct 02, 2023
  
  4fc10acc
20 Sep, 2023 4 commits
- embed libraries using cmake · 6c6a31a1
  Michael Yang authored Sep 20, 2023
  
  6c6a31a1
- remove libcuda.so · fc6ec356
  Bruce MacDonald authored Sep 20, 2023
  
  fc6ec356
- only package 11.8 runner · 1255bc9b
  Bruce MacDonald authored Sep 20, 2023
  
  1255bc9b
- pack in cuda libs · 4e8be787
  Bruce MacDonald authored Sep 20, 2023
  
  4e8be787
14 Sep, 2023 1 commit

support for packaging in multiple cuda runners (#509) · 2540c918

Bruce MacDonald authored Sep 14, 2023



* enable packaging multiple cuda versions
* use nvcc cuda version if available

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

2540c918

12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

25 Aug, 2023 1 commit
- update README.md · 041f9ad1
  Michael Yang authored Aug 25, 2023
  
  041f9ad1
08 Aug, 2023 1 commit
- docs: format with `prettier` · 1f78e409
  Jeffrey Morgan authored Aug 08, 2023
  
  1f78e409
24 Jul, 2023 1 commit
- update development.md · 24e43e32
  Michael Yang authored Jul 24, 2023
  
  24e43e32
21 Jul, 2023 1 commit
- Note that CGO must be enabled in dev docs · 52f04e39
  Bruce MacDonald authored Jul 21, 2023
  
  52f04e39
18 Jul, 2023 1 commit
- Some simple modelfile examples · 3d9498dc
  Matt Williams authored Jul 17, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  3d9498dc
07 Jul, 2023 1 commit
- add publish script · 1358e27b
  Jeffrey Morgan authored Jul 07, 2023
  
  1358e27b
28 Jun, 2023 4 commits
- update development.md · 98119569
  Michael Yang authored Jun 28, 2023
  
  98119569
- move desktop docs to `desktop/` · 9ba58c8a
  Jeffrey Morgan authored Jun 28, 2023
  
  9ba58c8a
- move desktop docs to `desktop/` · 9f868d82
  Jeffrey Morgan authored Jun 28, 2023
  
  9f868d82
- poetry development · 4018b3c5
  Bruce MacDonald authored Jun 28, 2023
  
  4018b3c5
27 Jun, 2023 3 commits
- simplify loading · ecfb4aba
  Bruce MacDonald authored Jun 27, 2023
  
  ecfb4aba
- Update development.md · 2906cbab
  Michael Chiang authored Jun 27, 2023
  
  2906cbab
- Update development.md · 9d14e751
  Michael Chiang authored Jun 27, 2023
  
  9d14e751