Commits · 6602e793c011805bec36d7d5b1f27537fe2f2353 · OpenDAS / ollama

10 May, 2024 1 commit

Use `--quantize` flag and `quantize` api parameter (#4321) · 6602e793

Jeffrey Morgan authored May 10, 2024



* rename `--quantization` to `--quantize`

* backwards

* Update api/types.go
Co-authored-by: Michael Yang <mxyng@pm.me>

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

6602e793

09 May, 2024 3 commits
- omit empty done reason · c02db932
  Bruce MacDonald authored May 09, 2024
  
  c02db932
- add done_reason to the api (#4235) · cfa84b84
  Bruce MacDonald authored May 09, 2024
  
  cfa84b84
- use model defaults for `num_gqa`, `rope_frequency_base ` and `rope_frequency_scale` (#1983) · d5eec16d
  Jeffrey Morgan authored May 09, 2024
  
  d5eec16d
07 May, 2024 1 commit

api: fill up API documentation (#3596) · d77c1c5f

Eli Bendersky authored May 07, 2024



* api: fill up API documentation

Followup for #2878

Now that the documentation is more complete, mention it in the README.

Updates #2840

* fix typo/lint

* Update README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

d77c1c5f

06 May, 2024 1 commit
- Add MarshalJSON to Duration (#3284) · af47413d
  Jackie Li authored May 06, 2024
```
---------
Co-authored-by: Patrick Devine <patrick@infrahq.com>
```
  af47413d
29 Apr, 2024 1 commit
- better checking for OLLAMA_HOST variable (#3661) · 9009bedf
  Patrick Devine authored Apr 29, 2024
  
  9009bedf
25 Apr, 2024 1 commit
- llm: limit generation to 10x context size to avoid run on generations (#3918) · 993cf8bf
  Jeffrey Morgan authored Apr 25, 2024
```
* llm: limit generation to 10x context size to avoid run on generations

* add comment

* simplify condition statement
```
  993cf8bf
23 Apr, 2024 1 commit

Request and model concurrency · 34b9db5a

Daniel Hiltgen authored Mar 30, 2024

This change adds support for multiple concurrent requests, as well as
loading multiple models by spawning multiple runners. The default
settings are currently set at 1 concurrent request per model and only 1
loaded model at a time, but these can be adjusted by setting
OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.

34b9db5a

21 Apr, 2024 1 commit
- chore: use errors.New to replace fmt.Errorf will much better (#3789) · 62be2050
  Cheng authored Apr 21, 2024
  
  62be2050
10 Apr, 2024 1 commit
- api: start adding documentation to package api (#2878) · ad90b9ab
  Eli Bendersky authored Apr 10, 2024
```
* api: start adding documentation to package api

Updates #2840

* Fix lint typo report
```
  ad90b9ab
09 Apr, 2024 1 commit
- fix: rope · 01114b45
  Michael Yang authored Apr 09, 2024
  
  01114b45
08 Apr, 2024 2 commits
- cgo quantize · 9502e566
  Michael Yang authored Apr 05, 2024
  
  9502e566
- no blob create if already exists · e1c9a2a0
  Michael Yang authored Apr 05, 2024
  
  e1c9a2a0
06 Apr, 2024 1 commit
- no rope parameters · be517e49
  Michael Yang authored Apr 05, 2024
  
  be517e49
26 Mar, 2024 1 commit
- change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) · 1b272d5b
  Patrick Devine authored Mar 26, 2024
  
  1b272d5b
13 Mar, 2024 1 commit
- Default Keep Alive environment variable (#3094) · 47cfe58a
  Patrick Devine authored Mar 13, 2024
```
---------
Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>
```
  47cfe58a
01 Mar, 2024 1 commit
- Fix embeddings load model behavior (#2848) · 3b4bab3d
  Jeffrey Morgan authored Feb 29, 2024
  
  3b4bab3d
25 Feb, 2024 1 commit
- Update types.go (#2744) · e95b8967
  Ikko Eltociear Ashimine authored Feb 26, 2024
```
specfied -> specified
```
  e95b8967
20 Feb, 2024 1 commit
- use http.DefaultClient (#2530) · 897b2134
  Michael Yang authored Feb 20, 2024
```
default client already handles proxy
```
  897b2134
13 Feb, 2024 1 commit
- Fix infinite keep_alive (#2480) · caf2b13c
  bnorick authored Feb 13, 2024
  
  caf2b13c
26 Jan, 2024 1 commit
- add keep_alive to generate/chat/embedding api endpoints (#2146) · b5cf31b4
  Patrick Devine authored Jan 26, 2024
  
  b5cf31b4
25 Jan, 2024 1 commit
- Save and load sessions (#2063) · 7c40a678
  Patrick Devine authored Jan 25, 2024
  
  7c40a678
18 Jan, 2024 2 commits
- add model to ModelResponse · 745b5934
  Michael Yang authored Jan 18, 2024
  
  745b5934
- api: add model for all requests · a38d88d8
  Michael Yang authored Jan 11, 2024
```
prefer using req.Model and fallback to req.Name
```
  a38d88d8
11 Jan, 2024 1 commit
- remove client.py · 5ffbbea1
  Michael Yang authored Jan 11, 2024
  
  5ffbbea1
05 Jan, 2024 1 commit
- add show info command and fix the modelfile · 22e93efa
  Patrick Devine authored Jan 04, 2024
  
  22e93efa
04 Jan, 2024 1 commit
- Add embeddings to API (#1773) · 0d6e3565
  Brian Murray authored Jan 04, 2024
  
  0d6e3565
27 Dec, 2023 2 commits
- clean up cache api option · 55978c1d
  Jeffrey Morgan authored Dec 27, 2023
  
  55978c1d
- enable `cache_prompt` by default · d4ebdadb
  Jeffrey Morgan authored Dec 27, 2023
  
  d4ebdadb
22 Dec, 2023 1 commit
- Add Cache flag to api (#1642) · 10da41d6
  K0IN authored Dec 22, 2023
  
  10da41d6
18 Dec, 2023 1 commit
- send empty messages on last chat response (#1530) · d99fa6ce
  Bruce MacDonald authored Dec 18, 2023
  
  d99fa6ce
12 Dec, 2023 1 commit
- add image support to the chat api (#1490) · d9e60f63
  Patrick Devine authored Dec 12, 2023
  
  d9e60f63
11 Dec, 2023 1 commit

Multimodal support (#1216) · 910e9401

Patrick Devine authored Dec 11, 2023




---------
Co-authored-by: Matt Apperson <mattapperson@Matts-MacBook-Pro.local>

910e9401

09 Dec, 2023 1 commit
- Don't expose model information in `/api/generate` · 9e1406e4
  Jeffrey Morgan authored Dec 09, 2023
  
  9e1406e4
05 Dec, 2023 4 commits
- return model configuration in generate · 5d75505e
  Michael Yang authored Dec 01, 2023
  
  5d75505e
- chat api endpoint (#1392) · 195e3d9d
  Bruce MacDonald authored Dec 05, 2023
  
  195e3d9d
- api: add version api handler · 0db4706e
  Michael Yang authored Nov 22, 2023
  
  0db4706e
- Revert "chat api (#991)" while context variable is fixed · 00d06619
  Jeffrey Morgan authored Dec 04, 2023
```
This reverts commit 7a0899d6.
```
  00d06619
04 Dec, 2023 1 commit

chat api (#991) · 7a0899d6

Bruce MacDonald authored Dec 04, 2023

- update chat docs
- add messages chat endpoint
- remove deprecated context and template generate parameters from docs
- context and template are still supported for the time being and will continue to work as expected
- add partial response to chat history

7a0899d6