Commits · 5e7fd6906f4653fa671aa5d2e2d4dd5bdf17fd36 · OpenDAS / ollama

19 Dec, 2023 1 commit

Bruce MacDonald authored Nov 24, 2023



- remove ggml runner
- automatically pull gguf models when ggml detected
- tell users to update to gguf in the case automatic pull fails
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>

811b1f03

21 Nov, 2023 1 commit
- update llama.cpp · a00fac4e
  Michael Yang authored Nov 21, 2023
  
  a00fac4e
24 Oct, 2023 1 commit
- fix metal assertion errors · b0c9cd0f
  Jeffrey Morgan authored Oct 24, 2023
  
  b0c9cd0f
23 Oct, 2023 1 commit
- update default log target · c9167494
  Michael Yang authored Oct 23, 2023
  
  c9167494
06 Oct, 2023 1 commit
- rename server subprocess (#700) · 5d22319a
  Bruce MacDonald authored Oct 06, 2023
```
- this makes it easier to see that the subprocess is associated with ollama
```
  5d22319a
21 Sep, 2023 1 commit
- silence warm up log · 058d0cd0
  Michael Yang authored Sep 21, 2023
  
  058d0cd0
20 Sep, 2023 1 commit
- embed libraries using cmake · 6c6a31a1
  Michael Yang authored Sep 20, 2023
  
  6c6a31a1
18 Sep, 2023 1 commit

subprocess improvements (#524) · 66003e1d

Bruce MacDonald authored Sep 18, 2023

* subprocess improvements

- increase start-up timeout
- when runner fails to start fail rather than timing out
- try runners in order rather than choosing 1 runner
- embed metal runner in metal dir rather than gpu
- refactor logging and error messages

* Update llama.go

* Update llama.go

* simplify by using glob

66003e1d

12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

07 Sep, 2023 1 commit
- GGUF support (#441) · 09dd2aef
  Bruce MacDonald authored Sep 07, 2023
  
  09dd2aef
06 Sep, 2023 1 commit
- macos `amd64` compatibility fixes · 213ffdb5
  Jeffrey Morgan authored Sep 05, 2023
  
  213ffdb5
05 Sep, 2023 2 commits
- metal: add missing barriers for mul-mat (#469) · d18282bf
  Bruce MacDonald authored Sep 05, 2023
  
  d18282bf
- generate binary dependencies based on GOARCH on macos (#459) · 7fa6e516
  Jeffrey Morgan authored Sep 05, 2023
  
  7fa6e516
30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79