Commits · ee4fd16f2c02a643e70c5393f7bb27cfda58671f · OpenDAS / ollama

20 Sep, 2023 6 commits
- rename generate.go · a9ed7cc6
  Michael Yang authored Sep 20, 2023
  
  a9ed7cc6
- embed libraries using cmake · 6c6a31a1
  Michael Yang authored Sep 20, 2023
  
  6c6a31a1
- remove libcuda.so · fc6ec356
  Bruce MacDonald authored Sep 20, 2023
  
  fc6ec356
- only package 11.8 runner · 1255bc9b
  Bruce MacDonald authored Sep 20, 2023
  
  1255bc9b
- use cuda_version · b9bb5ca2
  Bruce MacDonald authored Sep 20, 2023
  
  b9bb5ca2
- pack in cuda libs · 4e8be787
  Bruce MacDonald authored Sep 20, 2023
  
  4e8be787
18 Sep, 2023 1 commit

subprocess improvements (#524) · 66003e1d

Bruce MacDonald authored Sep 18, 2023

* subprocess improvements

- increase start-up timeout
- when runner fails to start fail rather than timing out
- try runners in order rather than choosing 1 runner
- embed metal runner in metal dir rather than gpu
- refactor logging and error messages

* Update llama.go

* Update llama.go

* simplify by using glob

66003e1d

14 Sep, 2023 1 commit

support for packaging in multiple cuda runners (#509) · 2540c918

Bruce MacDonald authored Sep 14, 2023



* enable packaging multiple cuda versions
* use nvcc cuda version if available

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

2540c918

12 Sep, 2023 2 commits
- fix ggml arm64 cuda build (#520) · f59c4d03
  Bruce MacDonald authored Sep 12, 2023
  
  f59c4d03
- first pass at linux gpu support (#454) · f2216370
  Bruce MacDonald authored Sep 12, 2023
```
* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
```
  f2216370
07 Sep, 2023 1 commit
- GGUF support (#441) · 09dd2aef
  Bruce MacDonald authored Sep 07, 2023
  
  09dd2aef
06 Sep, 2023 2 commits
- set minimum `CMAKE_OSX_DEPLOYMENT_TARGET` to 11.0 · 61dda6a5
  Jeffrey Morgan authored Sep 06, 2023
  
  61dda6a5
- macos `amd64` compatibility fixes · 213ffdb5
  Jeffrey Morgan authored Sep 05, 2023
  
  213ffdb5
05 Sep, 2023 2 commits
- metal: add missing barriers for mul-mat (#469) · d18282bf
  Bruce MacDonald authored Sep 05, 2023
  
  d18282bf
- generate binary dependencies based on GOARCH on macos (#459) · 7fa6e516
  Jeffrey Morgan authored Sep 05, 2023
  
  7fa6e516
30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

26 Aug, 2023 1 commit
- add missing entries for 34B · 177b69a2
  Jeffrey Morgan authored Aug 25, 2023
  
  177b69a2
25 Aug, 2023 1 commit
- patch llama.cpp for 34B · 7a378f8b
  Michael Yang authored Aug 25, 2023
  
  7a378f8b
14 Aug, 2023 1 commit
- update llama.cpp · f7b61333
  Michael Yang authored Aug 14, 2023
  
  f7b61333
13 Aug, 2023 1 commit
- update `llama.cpp` to `f64d44a` · 22885aea
  Jeffrey Morgan authored Aug 12, 2023
  
  22885aea
10 Aug, 2023 1 commit
- partial decode ggml bin for more info · fccf8d17
  Michael Yang authored Jul 21, 2023
  
  fccf8d17
03 Aug, 2023 1 commit
- update llama.cpp · c5bcf328
  Michael Yang authored Aug 03, 2023
  
  c5bcf328
01 Aug, 2023 1 commit
- update llama.cpp · 7a1c3e62
  Michael Yang authored Aug 01, 2023
  
  7a1c3e62
28 Jul, 2023 1 commit
- update `llama.cpp` to `d91f3f0` · dffc8b6e
  Jeffrey Morgan authored Jul 28, 2023
  
  dffc8b6e
27 Jul, 2023 1 commit
- update llama.cpp · 18ffeeec
  Michael Yang authored Jul 25, 2023
  
  18ffeeec
20 Jul, 2023 1 commit
- update llama.cpp to e782c9e735f93ab4767ffc37462c523b73a17ddc · a83eaa7a
  Michael Yang authored Jul 19, 2023
  
  a83eaa7a
11 Jul, 2023 1 commit
- vendor llama.cpp · 442dec1c
  Michael Yang authored Jul 11, 2023
  
  442dec1c