Commits · ec2a31e9b3785ff21f1f0a8df4b4cd351f43f8ea · OpenDAS / ollama

08 Nov, 2023 1 commit

support raw generation requests (#952) · ec2a31e9

Bruce MacDonald authored Nov 08, 2023

- add the optional `raw` generate request parameter to bypass prompt formatting and response context
-add raw request to docs

ec2a31e9

03 Nov, 2023 2 commits
- Restore system prompt on requests and default `num_keep` to `0` · 17678b72
  Jeffrey Morgan authored Nov 03, 2023
  
  17678b72
- Set `NumKeep` to `4` by default (#982) · 06589a3b
  Jeffrey Morgan authored Nov 02, 2023
  
  06589a3b
02 Nov, 2023 2 commits
- update default NumKeep · 6db3691b
  Michael Yang authored Nov 02, 2023
  
  6db3691b
- use http.Method · 60bb3c03
  Michael Yang authored Nov 02, 2023
  
  60bb3c03
27 Oct, 2023 1 commit

allow for a configurable ollama model storage directory (#897) · 5c3491f4

Bruce MacDonald authored Oct 27, 2023



* allow for a configurable ollama models directory

- set OLLAMA_MODELS in the environment that ollama is running in to change where model files are stored
- update docs
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
Co-Authored-By: Jay Nakrani <dhananjaynakrani@gmail.com>
Co-Authored-By: Akhil Acharya <akhilcacharya@gmail.com>
Co-Authored-By: Sasha Devol <sasha.devol@protonmail.com>

5c3491f4

26 Oct, 2023 1 commit
- client: fix trailing slash · 28c3f288
  Michael Yang authored Oct 26, 2023
  
  28c3f288
20 Oct, 2023 1 commit
- fix: ollama host for hostname · 459f4a78
  Michael Yang authored Oct 20, 2023
  
  459f4a78
19 Oct, 2023 1 commit

do not reload the running llm when runtime params change (#840) · fe6f3b48

Bruce MacDonald authored Oct 19, 2023

- only reload the running llm if the model has changed, or the options for loading the running model have changed
- rename loaded llm to runner to differentiate from loaded model image
- remove logic which keeps the first system prompt in the generation context

fe6f3b48

13 Oct, 2023 2 commits

fix memory check · 92189a58
Michael Yang authored Oct 12, 2023

92189a58

improve api error handling (#781) · 6fe17813

Bruce MacDonald authored Oct 13, 2023

- remove new lines from llama.cpp error messages relayed to client
- check api option types and return error on wrong type
- change num layers from 95% VRAM to 92% VRAM

6fe17813

12 Oct, 2023 1 commit
- validate api options fields from map (#711) · 7804b8fa
  Bruce MacDonald authored Oct 12, 2023
  
  7804b8fa
11 Oct, 2023 2 commits
- add format bytes · b599946b
  Michael Yang authored Oct 11, 2023
  
  b599946b
- optional parameter to not stream response (#639) · 274d5a5f
  Bruce MacDonald authored Oct 11, 2023
```
* update streaming request accept header
* add optional stream param to request bodies
```
  274d5a5f
09 Oct, 2023 1 commit
- handle client proxy · 2cfffea0
  Michael Yang authored Oct 09, 2023
  
  2cfffea0
05 Oct, 2023 1 commit
- output type parsed from modelfile (#678) · 2130c070
  Bruce MacDonald authored Oct 05, 2023
  
  2130c070
04 Oct, 2023 1 commit
- increase streaming buffer size (#692) · 9e2de1bd
  Bruce MacDonald authored Oct 04, 2023
  
  9e2de1bd
02 Oct, 2023 1 commit

Relay default values to llama runner (#672) · 1fbf3585

Bruce MacDonald authored Oct 02, 2023



* include seed in params for llama.cpp server and remove empty filter for temp

* relay default predict options to llama.cpp

- reorganize options to match predict request for readability

* omit empty stop

---------
Co-authored-by: hallh <hallh@users.noreply.github.com>

1fbf3585

29 Sep, 2023 1 commit
- remove unused push/pull params (#650) · a1b2d95f
  Bruce MacDonald authored Sep 29, 2023
  
  a1b2d95f
28 Sep, 2023 1 commit
- use int64 consistently · f40b3de7
  Michael Yang authored Sep 28, 2023
  
  f40b3de7
14 Sep, 2023 1 commit
- DRAFT: add a simple python client to access ollama (#522) · 8efbc5df
  Patrick Devine authored Sep 14, 2023
  
  8efbc5df
12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

06 Sep, 2023 1 commit
- add show command (#474) · 790d24eb
  Patrick Devine authored Sep 06, 2023
  
  790d24eb
31 Aug, 2023 1 commit
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 1 commit
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
26 Aug, 2023 1 commit
- default host to `127.0.0.1`, fixes #424 · 22ab7f5f
  Jeffrey Morgan authored Aug 26, 2023
  
  22ab7f5f
22 Aug, 2023 1 commit
- add version · 2c7f956b
  Michael Yang authored Aug 21, 2023
  
  2c7f956b
17 Aug, 2023 2 commits
- ignore nil map values · f723bf08
  Michael Yang authored Aug 17, 2023
  
  f723bf08
- parse protocol for `OLLAMA_HOST` · 54bb49a5
  Jeffrey Morgan authored Aug 17, 2023
  
  54bb49a5
16 Aug, 2023 2 commits

set default `OLLAMA_HOST` to `http://localhost:11434` · 5ee61164
Jeffrey Morgan authored Aug 16, 2023

5ee61164

cmd: support OLLAMA_CLIENT_HOST environment variable (#262) · 67e593e3

Blake Mizerany authored Aug 16, 2023



* cmd: support OLLAMA_HOST environment variable

This commit adds support for the OLLAMA_HOST environment
variable. This variable can be used to specify the host to which
the client should connect. This is useful when the client is
running somewhere other than the host where the server is running.

The new api.FromEnv function is used to read configure clients from the
environment. Clients wishing to use the environment variable being
consistent with the Ollama CLI can use this new function.

* Update api/client.go
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update api/client.go
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

67e593e3

10 Aug, 2023 4 commits
- s/parmeter/parameter/ · f27bc261
  Michael Yang authored Aug 10, 2023
  
  f27bc261
- fix could not convert int · 81d8d7b7
  Michael Yang authored Aug 10, 2023
  
  81d8d7b7
- Token auth (#314) · be989d89
  Patrick Devine authored Aug 10, 2023
  
  be989d89
- embeddings endpoint · 4b3507f0
  Bruce MacDonald authored Aug 08, 2023
```
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  4b3507f0
08 Aug, 2023 2 commits
- pr comments · 21ddcaa1
  Bruce MacDonald authored Aug 08, 2023
```
- default to embeddings enabled
- move embedding logic for loaded model to request
- allow embedding full directory
- close llm on reload
```
  21ddcaa1
- allow overriding `template` and `system` in `/api/generate` · 8713ac23
  Jeffrey Morgan authored Aug 08, 2023
```
Fixes #297
Fixes #296
```
  8713ac23
07 Aug, 2023 1 commit

automatically set num_keep if num_keep < 0 · 4dc5b117

Michael Yang authored Aug 07, 2023

num_keep defines how many tokens to keep in the context when truncating
inputs. if left to its default value of -1, the server will calculate
num_keep to be the left of the system instructions

4dc5b117