Commits · 785b4eb5bf9179dfbd798f59ca154bf4449d8f06 · OpenDAS / ollama

16 Oct, 2023 2 commits
- server: print version on start · 1af493c5
  Michael Yang authored Oct 13, 2023
  
  1af493c5
- deprecate modelfile embed command (#759) · a0c3e989
  Bruce MacDonald authored Oct 16, 2023
  
  a0c3e989
12 Oct, 2023 1 commit
- validate api options fields from map (#711) · 7804b8fa
  Bruce MacDonald authored Oct 12, 2023
  
  7804b8fa
11 Oct, 2023 1 commit
- optional parameter to not stream response (#639) · 274d5a5f
  Bruce MacDonald authored Oct 11, 2023
```
* update streaming request accept header
* add optional stream param to request bodies
```
  274d5a5f
06 Oct, 2023 1 commit
- not found error before pulling model (#718) · af4cf558
  Bruce MacDonald authored Oct 06, 2023
  
  af4cf558
30 Sep, 2023 1 commit
- Document response stream chunk delimiter. (#632) · 1d0ebe67
  Jay Nakrani authored Sep 30, 2023
```
Document response stream chunk delimiter.
```
  1d0ebe67
29 Sep, 2023 1 commit
- remove unused push/pull params (#650) · a1b2d95f
  Bruce MacDonald authored Sep 29, 2023
  
  a1b2d95f
27 Sep, 2023 1 commit
- prune empty directories · 8608eb47
  Michael Yang authored Sep 26, 2023
  
  8608eb47
23 Sep, 2023 1 commit
- check other request fields before load short circuit in `/api/generate` · 9b12a511
  Jeffrey Morgan authored Sep 22, 2023
  
  9b12a511
22 Sep, 2023 1 commit
- close llm on interrupt (#577) · 5d71bda4
  Bruce MacDonald authored Sep 22, 2023
  
  5d71bda4
21 Sep, 2023 4 commits

Michael Yang authored Sep 21, 2023

HEAD request should respond like their GET counterparts except without a
response body.

c9866943

remove tmp directories created by previous servers (#559) · 4cba75ef

Bruce MacDonald authored Sep 21, 2023



* remove tmp directories created by previous servers

* clean up on server stop

* Update routes.go

* Update server/routes.go
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* create top-level temp ollama dir

* check file exists before creating

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
Co-authored-by: Michael Yang <mxyng@pm.me>

4cba75ef

refactor default allow origins · 1fabba47
Michael Yang authored Sep 21, 2023
```
this should be less error prone
```
1fabba47

20 Sep, 2023 1 commit
- only package 11.8 runner · 1255bc9b
  Bruce MacDonald authored Sep 20, 2023
  
  1255bc9b
18 Sep, 2023 1 commit
- Cmd changes (#541) · 80dd44e8
  Patrick Devine authored Sep 18, 2023
  
  80dd44e8
12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

11 Sep, 2023 1 commit
- add autoprune to remove unused layers (#491) · e7e91cd7
  Patrick Devine authored Sep 11, 2023
  
  e7e91cd7
06 Sep, 2023 1 commit
- add show command (#474) · 790d24eb
  Patrick Devine authored Sep 06, 2023
  
  790d24eb
03 Sep, 2023 1 commit
- fix num_keep · 681f3c4c
  Michael Yang authored Sep 03, 2023
  
  681f3c4c
31 Aug, 2023 2 commits
- fix list models for windows · eeb40a67
  Michael Yang authored Aug 30, 2023
  
  eeb40a67
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
22 Aug, 2023 3 commits
- build release mode · 95187d7e
  Michael Yang authored Aug 22, 2023
  
  95187d7e
- fix `FROM` instruction erroring when referring to a file · a9f6c566
  Jeffrey Morgan authored Aug 22, 2023
  
  a9f6c566
- Strip protocol from model path (#377) · 0a892419
  Ryan Baker authored Aug 21, 2023
  
  0a892419
15 Aug, 2023 1 commit
- use loaded llm for embeddings · 326de489
  Bruce MacDonald authored Aug 15, 2023
  
  326de489
11 Aug, 2023 1 commit
- add maximum retries when pushing (#334) · d9cf18e2
  Patrick Devine authored Aug 11, 2023
  
  d9cf18e2
10 Aug, 2023 4 commits
- clean up cli flags · 040a5b97
  Jeffrey Morgan authored Aug 10, 2023
  
  040a5b97
- implement loading ggml lora adapters through the modelfile · 6de5d032
  Michael Yang authored Aug 03, 2023
  
  6de5d032
- partial decode ggml bin for more info · fccf8d17
  Michael Yang authored Jul 21, 2023
  
  fccf8d17
- embeddings endpoint · 4b3507f0
  Bruce MacDonald authored Aug 08, 2023
```
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  4b3507f0
09 Aug, 2023 3 commits
- allow for concurrent pulls of the same files · 868e3b31
  Bruce MacDonald authored Jul 25, 2023
  
  868e3b31
- fix build errors · 09d8bf67
  Bruce MacDonald authored Aug 09, 2023
  
  09d8bf67
- use content type `application/x-ndjson` for streaming responses · cff002b8
  Jeffrey Morgan authored Aug 08, 2023
  
  cff002b8
08 Aug, 2023 3 commits
- add `0.0.0.0` as an allowed origin by default · a027a7dd
  Jeffrey Morgan authored Aug 08, 2023
```
Fixes #282
```
  a027a7dd
- pr comments · 21ddcaa1
  Bruce MacDonald authored Aug 08, 2023
```
- default to embeddings enabled
- move embedding logic for loaded model to request
- allow embedding full directory
- close llm on reload
```
  21ddcaa1
- embed text document in modelfile · a6f6d18f
  Bruce MacDonald authored Aug 04, 2023
  
  a6f6d18f
07 Aug, 2023 1 commit

automatically set num_keep if num_keep < 0 · 4dc5b117

Michael Yang authored Aug 07, 2023

num_keep defines how many tokens to keep in the context when truncating
inputs. if left to its default value of -1, the server will calculate
num_keep to be the left of the system instructions

4dc5b117