Commits · 42998d797d79c790354625d387b50e692c1e27f0 · OpenDAS / ollama

30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
22 Aug, 2023 3 commits
- build release mode · 95187d7e
  Michael Yang authored Aug 22, 2023
  
  95187d7e
- fix `FROM` instruction erroring when referring to a file · a9f6c566
  Jeffrey Morgan authored Aug 22, 2023
  
  a9f6c566
- Strip protocol from model path (#377) · 0a892419
  Ryan Baker authored Aug 21, 2023
  
  0a892419
15 Aug, 2023 1 commit
- use loaded llm for embeddings · 326de489
  Bruce MacDonald authored Aug 15, 2023
  
  326de489
11 Aug, 2023 1 commit
- add maximum retries when pushing (#334) · d9cf18e2
  Patrick Devine authored Aug 11, 2023
  
  d9cf18e2
10 Aug, 2023 4 commits
- clean up cli flags · 040a5b97
  Jeffrey Morgan authored Aug 10, 2023
  
  040a5b97
- implement loading ggml lora adapters through the modelfile · 6de5d032
  Michael Yang authored Aug 03, 2023
  
  6de5d032
- partial decode ggml bin for more info · fccf8d17
  Michael Yang authored Jul 21, 2023
  
  fccf8d17
- embeddings endpoint · 4b3507f0
  Bruce MacDonald authored Aug 08, 2023
```
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  4b3507f0
09 Aug, 2023 3 commits
- allow for concurrent pulls of the same files · 868e3b31
  Bruce MacDonald authored Jul 25, 2023
  
  868e3b31
- fix build errors · 09d8bf67
  Bruce MacDonald authored Aug 09, 2023
  
  09d8bf67
- use content type `application/x-ndjson` for streaming responses · cff002b8
  Jeffrey Morgan authored Aug 08, 2023
  
  cff002b8
08 Aug, 2023 3 commits
- add `0.0.0.0` as an allowed origin by default · a027a7dd
  Jeffrey Morgan authored Aug 08, 2023
```
Fixes #282
```
  a027a7dd
- pr comments · 21ddcaa1
  Bruce MacDonald authored Aug 08, 2023
```
- default to embeddings enabled
- move embedding logic for loaded model to request
- allow embedding full directory
- close llm on reload
```
  21ddcaa1
- embed text document in modelfile · a6f6d18f
  Bruce MacDonald authored Aug 04, 2023
  
  a6f6d18f
07 Aug, 2023 2 commits

automatically set num_keep if num_keep < 0 · 4dc5b117

Michael Yang authored Aug 07, 2023

num_keep defines how many tokens to keep in the context when truncating
inputs. if left to its default value of -1, the server will calculate
num_keep to be the left of the system instructions

4dc5b117

pass flags to `serve` to allow setting allowed-origins + host and port · fb593b7b

cmiller01 authored Aug 07, 2023

* resolves: https://github.com/jmorganca/ollama/issues/300 and
https://github.com/jmorganca/ollama/issues/282

* example usage:
```
ollama serve --port 9999 --allowed-origins "http://foo.example.com,http://192.0.0.1"
```

fb593b7b

03 Aug, 2023 1 commit
- server: compare options correctly · e3fb1fd3
  Jeffrey Morgan authored Aug 03, 2023
  
  e3fb1fd3
02 Aug, 2023 1 commit
- server: reset digest at end of generate · 03cff3a2
  Jeffrey Morgan authored Aug 02, 2023
  
  03cff3a2
01 Aug, 2023 4 commits
- use head to check heartbeat · 76599436
  Bruce MacDonald authored Aug 01, 2023
  
  76599436
- read runner parameter options from map · 1c5a8770
  Bruce MacDonald authored Aug 01, 2023
```
- read runner options from map to see what was specified explicitly and overwrite zero values
```
  1c5a8770
- allow specifying zero values in modelfile · daa0d1de
  Bruce MacDonald authored Jul 31, 2023
  
  daa0d1de
- cache loaded model · 528bafa5
  Jeffrey Morgan authored Jul 31, 2023
  
  528bafa5
31 Jul, 2023 1 commit
- log prediction failures · 671eec6d
  Bruce MacDonald authored Jul 31, 2023
  
  671eec6d
27 Jul, 2023 3 commits
- add session expiration · f62a8827
  Michael Yang authored Jul 19, 2023
  
  f62a8827
- add load duration · 32aec66e
  Michael Yang authored Jul 18, 2023
  
  32aec66e
- session id · 35af37a2
  Michael Yang authored Jul 18, 2023
  
  35af37a2
25 Jul, 2023 1 commit
- download models when creating from modelfile · 4c1caa37
  Bruce MacDonald authored Jul 25, 2023
  
  4c1caa37
24 Jul, 2023 1 commit
- add copy command (#191) · 4cb42ca5
  Patrick Devine authored Jul 24, 2023
  
  4cb42ca5
22 Jul, 2023 2 commits
- use gin-contrib/cors middleware · 8609db77
  Michael Yang authored Jul 21, 2023
  
  8609db77
- change error handler behavior and fix error when a model isn't found (#173) · 6d6b0d33
  Patrick Devine authored Jul 21, 2023
  
  6d6b0d33
21 Jul, 2023 1 commit
- allow pushing/pulling to insecure registries (#157) · 9f6e9786
  Patrick Devine authored Jul 21, 2023
  
  9f6e9786
20 Jul, 2023 6 commits
- add rm command for models (#151) · e7a393de
  Patrick Devine authored Jul 20, 2023
  
  e7a393de
- fix stream errors · 1f27d7f1
  Michael Yang authored Jul 20, 2023
  
  1f27d7f1
- suppress error when running list before pulling image · 09dc6273
  Bruce MacDonald authored Jul 20, 2023
  
  09dc6273
- remove unused code · 3ec4ebc5
  Bruce MacDonald authored Jul 20, 2023
  
  3ec4ebc5
- separate prompt into template and system · df146c41
  Michael Yang authored Jul 17, 2023
  
  df146c41
- allow relative paths in `FROM` instruction · 2d305fa9
  Jeffrey Morgan authored Jul 19, 2023
  
  2d305fa9