Commits · 1d0ebe67e8754f2cdfc0b986f980a87f71a7efa5 · OpenDAS / ollama

"vscode:/vscode.git/clone" did not exist on "c7b4acfb375fd32099e3cd83de10313408f3671a"

30 Sep, 2023 1 commit
- Document response stream chunk delimiter. (#632) · 1d0ebe67
  Jay Nakrani authored Sep 30, 2023
```
Document response stream chunk delimiter.
```
  1d0ebe67
28 Sep, 2023 1 commit
- Update modelfile.md to reflect the usage of num_gpu. (#629) · 6ae33d81
  Aaron Coffey authored Sep 28, 2023
  
  6ae33d81
27 Sep, 2023 3 commits
- Update faq.md · c5664c1f
  Jeffrey Morgan authored Sep 27, 2023
  
  c5664c1f
- Update modelfile.md · ed20837f
  Bruce MacDonald authored Sep 27, 2023
  
  ed20837f
- Added num_predict to the options table (#614) · 1db2a61d
  James Braza authored Sep 27, 2023
  
  1db2a61d
25 Sep, 2023 5 commits
- Update linux.md · 5306b026
  Jeffrey Morgan authored Sep 25, 2023
  
  5306b026
- Update linux.md · 0fb52684
  Jeffrey Morgan authored Sep 25, 2023
  
  0fb52684
- improvements to `docs/linux.md` · ee3032ad
  Jeffrey Morgan authored Sep 24, 2023
  
  ee3032ad
- improvements to `docs/linux.md` · 5b7a2728
  Jeffrey Morgan authored Sep 24, 2023
  
  5b7a2728
- add `docs/linux.md` · d2a784e3
  Jeffrey Morgan authored Sep 24, 2023
  
  d2a784e3
20 Sep, 2023 4 commits
- embed libraries using cmake · 6c6a31a1
  Michael Yang authored Sep 20, 2023
  
  6c6a31a1
- remove libcuda.so · fc6ec356
  Bruce MacDonald authored Sep 20, 2023
  
  fc6ec356
- only package 11.8 runner · 1255bc9b
  Bruce MacDonald authored Sep 20, 2023
  
  1255bc9b
- pack in cuda libs · 4e8be787
  Bruce MacDonald authored Sep 20, 2023
  
  4e8be787
14 Sep, 2023 2 commits

support for packaging in multiple cuda runners (#509) · 2540c918

Bruce MacDonald authored Sep 14, 2023



* enable packaging multiple cuda versions
* use nvcc cuda version if available

---------
Co-authored-by: Michael Yang <mxyng@pm.me>

2540c918

Update API docs (#527) · fc870768

Matt Williams authored Sep 14, 2023



* Update API docs
Signed-off-by: Matt Williams <m@technovangelist.com>

* strange TOC was getting auto generated
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>

* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>

* Update docs/api.md
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>

* Update api.md

---------
Signed-off-by: Matt Williams <m@technovangelist.com>
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
Co-authored-by: Michael Chiang <mchiang0610@users.noreply.github.com>

fc870768

12 Sep, 2023 1 commit

first pass at linux gpu support (#454) · f2216370

Bruce MacDonald authored Sep 12, 2023



* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>

f2216370

06 Sep, 2023 1 commit
- Added missing options params to the embeddings docs (#472) · 154f24af
  Ackermann Yuriy authored Sep 06, 2023
  
  154f24af
30 Aug, 2023 2 commits

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

treat stop as stop sequences, not exact tokens (#442) · f4432e1d

Quinn Slack authored Aug 30, 2023

The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.

Fixes https://github.com/jmorganca/ollama/issues/295.

f4432e1d

27 Aug, 2023 1 commit
- update `orca` to `orca-mini` · d3b838ce
  Jeffrey Morgan authored Aug 27, 2023
  
  d3b838ce
25 Aug, 2023 1 commit
- update README.md · 041f9ad1
  Michael Yang authored Aug 25, 2023
  
  041f9ad1
15 Aug, 2023 1 commit
- Update modelfile.md · 53bc36d2
  Bruce MacDonald authored Aug 15, 2023
  
  53bc36d2
14 Aug, 2023 5 commits
- update python example · af98a177
  Bruce MacDonald authored Aug 14, 2023
  
  af98a177
- Update modelfile.md · 9ae9a898
  Bruce MacDonald authored Aug 14, 2023
  
  9ae9a898
- python example · 648f0974
  Bruce MacDonald authored Aug 14, 2023
  
  648f0974
- Add context to api docs · fc5230df
  Bruce MacDonald authored Aug 14, 2023
  
  fc5230df
- Update langchainpy.md · 4c33a9ac
  Güvenç Usanmaz authored Aug 14, 2023
```
base_url value for Ollama object creation is corrected.
```
  4c33a9ac
11 Aug, 2023 6 commits
- resolving bmacd comment · 202c29c2
  Matt Williams authored Aug 11, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  202c29c2
- Update docs/tutorials/langchainjs.md · c1c87162
  Matt Williams authored Aug 11, 2023
```
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  c1c87162
- Update docs/tutorials/langchainjs.md · a21a8bef
  Matt Williams authored Aug 11, 2023
```
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  a21a8bef
- Update docs/tutorials.md · 52272622
  Matt Williams authored Aug 11, 2023
```
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>
```
  52272622
- Add tutorials for using Langchain with ollama · d3ee1329
  Matt Williams authored Aug 10, 2023
```
Signed-off-by: Matt Williams <m@technovangelist.com>
```
  d3ee1329
- document default num_gqa to 1, as it's applicable to most models · d9c2687f
  Arturas Smorgun authored Aug 11, 2023
```
Co-authored-by: Michael Yang <mxyng@pm.me>
```
  d9c2687f
10 Aug, 2023 4 commits
- Document num_gqa parameter · c0e7a3b9
  Arturas Smorgun authored Aug 11, 2023
```
It is required to be adjusted for some models, see https://github.com/jmorganca/ollama/issues/320 for more context
```
  c0e7a3b9
- add docs for `/api/embeddings` · be889b2f
  Jeffrey Morgan authored Aug 10, 2023
  
  be889b2f
- cmd: use environment variables for server options · 7e26a8df
  Jeffrey Morgan authored Aug 10, 2023
  
  7e26a8df
- add lora docs · 37c9a8ee
  Michael Yang authored Aug 09, 2023
  
  37c9a8ee
09 Aug, 2023 2 commits
- add embed docs for modelfile · 43c40c50
  Bruce MacDonald authored Aug 09, 2023
  
  43c40c50
- remove embed docs · c4861360
  Bruce MacDonald authored Aug 09, 2023
  
  c4861360