Commits · 56497663c8bc7de73f4ff8d53235664a044c9d95 · OpenDAS / ollama

11 Oct, 2023 2 commits
- add format bytes · b599946b
  Michael Yang authored Oct 11, 2023
  
  b599946b
- optional parameter to not stream response (#639) · 274d5a5f
  Bruce MacDonald authored Oct 11, 2023
```
* update streaming request accept header
* add optional stream param to request bodies
```
  274d5a5f
09 Oct, 2023 1 commit
- handle client proxy · 2cfffea0
  Michael Yang authored Oct 09, 2023
  
  2cfffea0
05 Oct, 2023 1 commit
- output type parsed from modelfile (#678) · 2130c070
  Bruce MacDonald authored Oct 05, 2023
  
  2130c070
04 Oct, 2023 1 commit
- increase streaming buffer size (#692) · 9e2de1bd
  Bruce MacDonald authored Oct 04, 2023
  
  9e2de1bd
02 Oct, 2023 1 commit

Relay default values to llama runner (#672) · 1fbf3585

Bruce MacDonald authored Oct 02, 2023



* include seed in params for llama.cpp server and remove empty filter for temp

* relay default predict options to llama.cpp

- reorganize options to match predict request for readability

* omit empty stop

---------
Co-authored-by: hallh <hallh@users.noreply.github.com>

1fbf3585

29 Sep, 2023 1 commit
- remove unused push/pull params (#650) · a1b2d95f
  Bruce MacDonald authored Sep 29, 2023
  
  a1b2d95f
28 Sep, 2023 1 commit
- use int64 consistently · f40b3de7
  Michael Yang authored Sep 28, 2023
  
  f40b3de7
14 Sep, 2023 1 commit
- DRAFT: add a simple python client to access ollama (#522) · 8efbc5df
  Patrick Devine authored Sep 14, 2023
  
  8efbc5df
12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

06 Sep, 2023 1 commit
- add show command (#474) · 790d24eb
  Patrick Devine authored Sep 06, 2023
  
  790d24eb
31 Aug, 2023 1 commit
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 1 commit

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 1 commit
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
26 Aug, 2023 1 commit
- default host to `127.0.0.1`, fixes #424 · 22ab7f5f
  Jeffrey Morgan authored Aug 26, 2023
  
  22ab7f5f
22 Aug, 2023 1 commit
- add version · 2c7f956b
  Michael Yang authored Aug 21, 2023
  
  2c7f956b
17 Aug, 2023 2 commits
- ignore nil map values · f723bf08
  Michael Yang authored Aug 17, 2023
  
  f723bf08
- parse protocol for `OLLAMA_HOST` · 54bb49a5
  Jeffrey Morgan authored Aug 17, 2023
  
  54bb49a5
16 Aug, 2023 2 commits

set default `OLLAMA_HOST` to `http://localhost:11434` · 5ee61164
Jeffrey Morgan authored Aug 16, 2023

5ee61164

cmd: support OLLAMA_CLIENT_HOST environment variable (#262) · 67e593e3

Blake Mizerany authored Aug 16, 2023



* cmd: support OLLAMA_HOST environment variable

This commit adds support for the OLLAMA_HOST environment
variable. This variable can be used to specify the host to which
the client should connect. This is useful when the client is
running somewhere other than the host where the server is running.

The new api.FromEnv function is used to read configure clients from the
environment. Clients wishing to use the environment variable being
consistent with the Ollama CLI can use this new function.

* Update api/client.go
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update api/client.go
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

67e593e3

10 Aug, 2023 4 commits
- s/parmeter/parameter/ · f27bc261
  Michael Yang authored Aug 10, 2023
  
  f27bc261
- fix could not convert int · 81d8d7b7
  Michael Yang authored Aug 10, 2023
  
  81d8d7b7
- Token auth (#314) · be989d89
  Patrick Devine authored Aug 10, 2023
  
  be989d89
- embeddings endpoint · 4b3507f0
  Bruce MacDonald authored Aug 08, 2023
```
Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>
```
  4b3507f0
08 Aug, 2023 2 commits
- pr comments · 21ddcaa1
  Bruce MacDonald authored Aug 08, 2023
```
- default to embeddings enabled
- move embedding logic for loaded model to request
- allow embedding full directory
- close llm on reload
```
  21ddcaa1
- allow overriding `template` and `system` in `/api/generate` · 8713ac23
  Jeffrey Morgan authored Aug 08, 2023
```
Fixes #297
Fixes #296
```
  8713ac23
07 Aug, 2023 1 commit

automatically set num_keep if num_keep < 0 · 4dc5b117

Michael Yang authored Aug 07, 2023

num_keep defines how many tokens to keep in the context when truncating
inputs. if left to its default value of -1, the server will calculate
num_keep to be the left of the system instructions

4dc5b117

04 Aug, 2023 1 commit
- configurable rope frequency parameters · b9f4d675
  Michael Yang authored Aug 03, 2023
  
  b9f4d675
01 Aug, 2023 3 commits
- use head to check heartbeat · 76599436
  Bruce MacDonald authored Aug 01, 2023
  
  76599436
- read runner parameter options from map · 1c5a8770
  Bruce MacDonald authored Aug 01, 2023
```
- read runner options from map to see what was specified explicitly and overwrite zero values
```
  1c5a8770
- cache loaded model · 528bafa5
  Jeffrey Morgan authored Jul 31, 2023
  
  528bafa5
31 Jul, 2023 1 commit
- check server is running before running command · e72fe794
  Bruce MacDonald authored Jul 31, 2023
  
  e72fe794
28 Jul, 2023 3 commits
- allow specifying stop conditions in modelfile · 184ad8f0
  Bruce MacDonald authored Jul 27, 2023
  
  184ad8f0
- lower batch size to 512 · 822a0e36
  Jeffrey Morgan authored Jul 28, 2023
  
  822a0e36
- add stop conditions · fadf75f9
  Michael Yang authored Jul 27, 2023
  
  fadf75f9
27 Jul, 2023 4 commits
- add NumGQA · ad3a7d0e
  Michael Yang authored Jul 27, 2023
  
  ad3a7d0e
- increase default batch size to 1024 · 688661ab
  Jeffrey Morgan authored Jul 27, 2023
  
  688661ab
- sample metrics · cca61181
  Michael Yang authored Jul 25, 2023
  
  cca61181
- lock on llm.lock(); decrease batch size · c4904161
  Michael Yang authored Jul 20, 2023
  
  c4904161