- 04 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 14 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 10 May, 2024 1 commit
-
-
Jeffrey Morgan authored
* rename `--quantization` to `--quantize` * backwards * Update api/types.go Co-authored-by:
Michael Yang <mxyng@pm.me> --------- Co-authored-by:
Michael Yang <mxyng@pm.me>
-
- 09 May, 2024 3 commits
-
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
Jeffrey Morgan authored
-
- 07 May, 2024 1 commit
-
-
Eli Bendersky authored
* api: fill up API documentation Followup for #2878 Now that the documentation is more complete, mention it in the README. Updates #2840 * fix typo/lint * Update README.md Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> --------- Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
- 06 May, 2024 1 commit
-
-
Jackie Li authored
--------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
- 29 Apr, 2024 1 commit
-
-
Patrick Devine authored
-
- 25 Apr, 2024 1 commit
-
-
Jeffrey Morgan authored
* llm: limit generation to 10x context size to avoid run on generations * add comment * simplify condition statement
-
- 23 Apr, 2024 1 commit
-
-
Daniel Hiltgen authored
This change adds support for multiple concurrent requests, as well as loading multiple models by spawning multiple runners. The default settings are currently set at 1 concurrent request per model and only 1 loaded model at a time, but these can be adjusted by setting OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
-
- 21 Apr, 2024 1 commit
-
-
Cheng authored
-
- 10 Apr, 2024 1 commit
-
-
Eli Bendersky authored
* api: start adding documentation to package api Updates #2840 * Fix lint typo report
-
- 09 Apr, 2024 1 commit
-
-
Michael Yang authored
-
- 08 Apr, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 06 Apr, 2024 1 commit
-
-
Michael Yang authored
-
- 26 Mar, 2024 1 commit
-
-
Patrick Devine authored
-
- 13 Mar, 2024 1 commit
-
-
Patrick Devine authored
--------- Co-authored-by:Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>
-
- 01 Mar, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 25 Feb, 2024 1 commit
-
-
Ikko Eltociear Ashimine authored
specfied -> specified
-
- 20 Feb, 2024 1 commit
-
-
Michael Yang authored
default client already handles proxy
-
- 13 Feb, 2024 1 commit
-
-
bnorick authored
-
- 26 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 25 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 18 Jan, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
prefer using req.Model and fallback to req.Name
-
- 11 Jan, 2024 1 commit
-
-
Michael Yang authored
-
- 05 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 04 Jan, 2024 1 commit
-
-
Brian Murray authored
-
- 27 Dec, 2023 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 22 Dec, 2023 1 commit
-
-
K0IN authored
-
- 18 Dec, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 12 Dec, 2023 1 commit
-
-
Patrick Devine authored
-
- 11 Dec, 2023 1 commit
-
-
Patrick Devine authored
--------- Co-authored-by:Matt Apperson <mattapperson@Matts-MacBook-Pro.local>
-
- 09 Dec, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 05 Dec, 2023 3 commits
-
-
Michael Yang authored
-
Bruce MacDonald authored
-
Michael Yang authored
-