Commits · ec2a31e9b3785ff21f1f0a8df4b4cd351f43f8ea · OpenDAS / ollama

08 Nov, 2023 1 commit

support raw generation requests (#952) · ec2a31e9

Bruce MacDonald authored Nov 08, 2023

- add the optional `raw` generate request parameter to bypass prompt formatting and response context
-add raw request to docs

ec2a31e9

03 Nov, 2023 2 commits
- Restore system prompt on requests and default `num_keep` to `0` · 17678b72
  Jeffrey Morgan authored Nov 03, 2023
  
  17678b72
- Set `NumKeep` to `4` by default (#982) · 06589a3b
  Jeffrey Morgan authored Nov 02, 2023
  
  06589a3b
02 Nov, 2023 1 commit
- update default NumKeep · 6db3691b
  Michael Yang authored Nov 02, 2023
  
  6db3691b
19 Oct, 2023 1 commit

do not reload the running llm when runtime params change (#840) · fe6f3b48

Bruce MacDonald authored Oct 19, 2023

- only reload the running llm if the model has changed, or the options for loading the running model have changed
- rename loaded llm to runner to differentiate from loaded model image
- remove logic which keeps the first system prompt in the generation context

fe6f3b48

13 Oct, 2023 1 commit

improve api error handling (#781) · 6fe17813

Bruce MacDonald authored Oct 13, 2023

- remove new lines from llama.cpp error messages relayed to client
- check api option types and return error on wrong type
- change num layers from 95% VRAM to 92% VRAM

6fe17813

12 Oct, 2023 1 commit
- validate api options fields from map (#711) · 7804b8fa
  Bruce MacDonald authored Oct 12, 2023
  
  7804b8fa
11 Oct, 2023 1 commit
- optional parameter to not stream response (#639) · 274d5a5f
  Bruce MacDonald authored Oct 11, 2023
```
* update streaming request accept header
* add optional stream param to request bodies
```
  274d5a5f
05 Oct, 2023 1 commit
- output type parsed from modelfile (#678) · 2130c070
  Bruce MacDonald authored Oct 05, 2023
  
  2130c070
02 Oct, 2023 1 commit

Relay default values to llama runner (#672) · 1fbf3585

Bruce MacDonald authored Oct 02, 2023



* include seed in params for llama.cpp server and remove empty filter for temp

* relay default predict options to llama.cpp

- reorganize options to match predict request for readability

* omit empty stop

---------
Co-authored-by: hallh <hallh@users.noreply.github.com>

1fbf3585

29 Sep, 2023 1 commit
- remove unused push/pull params (#650) · a1b2d95f
  Bruce MacDonald authored Sep 29, 2023
  
  a1b2d95f
28 Sep, 2023 1 commit
- use int64 consistently · f40b3de7
  Michael Yang authored Sep 28, 2023
  
  f40b3de7
12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

06 Sep, 2023 1 commit
- add show command (#474) · 790d24eb
  Patrick Devine authored Sep 06, 2023
  
  790d24eb
31 Aug, 2023 1 commit
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
17 Aug, 2023 1 commit
- ignore nil map values · f723bf08
  Michael Yang authored Aug 17, 2023
  
  f723bf08
10 Aug, 2023 4 commits
- s/parmeter/parameter/ · f27bc261
  Michael Yang authored Aug 10, 2023
  
  f27bc261
- fix could not convert int · 81d8d7b7
  Michael Yang authored Aug 10, 2023
  
  81d8d7b7
- Token auth (#314) · be989d89
  Patrick Devine authored Aug 10, 2023
  
  be989d89
- embeddings endpoint · 4b3507f0
  Bruce MacDonald authored Aug 08, 2023
```
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  4b3507f0
08 Aug, 2023 2 commits
- pr comments · 21ddcaa1
  Bruce MacDonald authored Aug 08, 2023
```
- default to embeddings enabled
- move embedding logic for loaded model to request
- allow embedding full directory
- close llm on reload
```
  21ddcaa1
- allow overriding `template` and `system` in `/api/generate` · 8713ac23
  Jeffrey Morgan authored Aug 08, 2023
```
Fixes #297
Fixes #296
```
  8713ac23
07 Aug, 2023 1 commit

automatically set num_keep if num_keep < 0 · 4dc5b117

Michael Yang authored Aug 07, 2023

num_keep defines how many tokens to keep in the context when truncating
inputs. if left to its default value of -1, the server will calculate
num_keep to be the left of the system instructions

4dc5b117

04 Aug, 2023 1 commit
- configurable rope frequency parameters · b9f4d675
  Michael Yang authored Aug 03, 2023
  
  b9f4d675
01 Aug, 2023 2 commits
- read runner parameter options from map · 1c5a8770
  Bruce MacDonald authored Aug 01, 2023
```
- read runner options from map to see what was specified explicitly and overwrite zero values
```
  1c5a8770
- cache loaded model · 528bafa5
  Jeffrey Morgan authored Jul 31, 2023
  
  528bafa5
28 Jul, 2023 3 commits
- allow specifying stop conditions in modelfile · 184ad8f0
  Bruce MacDonald authored Jul 27, 2023
  
  184ad8f0
- lower batch size to 512 · 822a0e36
  Jeffrey Morgan authored Jul 28, 2023
  
  822a0e36
- add stop conditions · fadf75f9
  Michael Yang authored Jul 27, 2023
  
  fadf75f9
27 Jul, 2023 8 commits
- add NumGQA · ad3a7d0e
  Michael Yang authored Jul 27, 2023
  
  ad3a7d0e
- increase default batch size to 1024 · 688661ab
  Jeffrey Morgan authored Jul 27, 2023
  
  688661ab
- sample metrics · cca61181
  Michael Yang authored Jul 25, 2023
  
  cca61181
- lock on llm.lock(); decrease batch size · c4904161
  Michael Yang authored Jul 20, 2023
  
  c4904161
- add session expiration · f62a8827
  Michael Yang authored Jul 19, 2023
  
  f62a8827
- update predict code · 3003fc03
  Michael Yang authored Jul 19, 2023
  
  3003fc03
- add load duration · 32aec66e
  Michael Yang authored Jul 18, 2023
  
  32aec66e
- session id · 35af37a2
  Michael Yang authored Jul 18, 2023
  
  35af37a2
25 Jul, 2023 1 commit
- download models when creating from modelfile · 4c1caa37
  Bruce MacDonald authored Jul 25, 2023
  
  4c1caa37