Commits · a1ecdd36d5c0d145817ef1b0c3bf095de5ec458a · OpenDAS / ollama

06 Sep, 2023 1 commit
- create manifests directory · a1ecdd36
  Michael Yang authored Sep 05, 2023
  
  a1ecdd36
05 Sep, 2023 2 commits

Michael Yang authored Sep 05, 2023

parameters are not inherited because they are processed differently from
other layer. fix this by explicitly merging the inherited params into
the new params. parameter values defined in the new modelfile will
override those defined in the inherited modelfile. array lists are
replaced instead of appended

06ef90c0

use slices.DeleteFunc · e9f6df7d
Michael Yang authored Sep 02, 2023

e9f6df7d

03 Sep, 2023 1 commit
- fix num_keep · 681f3c4c
  Michael Yang authored Sep 03, 2023
  
  681f3c4c
01 Sep, 2023 1 commit

do not HTML-escape prompt · 62d29b21

Quinn Slack authored Sep 01, 2023

The `html/template` package automatically HTML-escapes interpolated strings in templates. This behavior is undesirable because it causes prompts like `<h1>hello` to be escaped to `&lt;h1&gt;hello` before being passed to the LLM.

The included test case passes, but before the code change, it failed:

```
--- FAIL: TestModelPrompt
    images_test.go:21: got "a&lt;h1&gt;b", want "a<h1>b"
```

62d29b21

31 Aug, 2023 4 commits
- windows: fix create modelfile · 1c8fd627
  Michael Yang authored Aug 30, 2023
  
  1c8fd627
- windows: fix delete · ae950b00
  Michael Yang authored Aug 30, 2023
  
  ae950b00
- fix list models for windows · eeb40a67
  Michael Yang authored Aug 30, 2023
  
  eeb40a67
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 2 commits

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

treat stop as stop sequences, not exact tokens (#442) · f4432e1d

Quinn Slack authored Aug 30, 2023

The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.

Fixes https://github.com/jmorganca/ollama/issues/295.

f4432e1d

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 4 commits
- remove unused parameter · 16b06699
  Michael Yang authored Aug 28, 2023
  
  16b06699
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
- chunked pipe · 865fceb7
  Michael Yang authored Aug 26, 2023
  
  865fceb7
- bump chunk size to 95MB · 72266c76
  Michael Yang authored Aug 25, 2023
  
  72266c76
26 Aug, 2023 1 commit
- set default template · 59734ca2
  Michael Yang authored Aug 26, 2023
  
  59734ca2
22 Aug, 2023 7 commits
- remove unused requestContextKey · 32d1a000
  Michael Yang authored Aug 22, 2023
  
  32d1a000
- move upload funcs to upload.go · 04e21282
  Michael Yang authored Aug 22, 2023
  
  04e21282
- use url.URL · 2cc63468
  Michael Yang authored Aug 21, 2023
  
  2cc63468
- build release mode · 95187d7e
  Michael Yang authored Aug 22, 2023
  
  95187d7e
- add version · 2c7f956b
  Michael Yang authored Aug 21, 2023
  
  2c7f956b
- fix `FROM` instruction erroring when referring to a file · a9f6c566
  Jeffrey Morgan authored Aug 22, 2023
  
  a9f6c566
- Strip protocol from model path (#377) · 0a892419
  Ryan Baker authored Aug 21, 2023
  
  0a892419
18 Aug, 2023 2 commits

retry on unauthorized chunk push · 3b49315f

Michael Yang authored Aug 18, 2023

The token printed for authorized requests has a lifetime of 1h. If an
upload exceeds 1h, a chunk push will fail since the token is created on
a "start upload" request.

This replaces the Pipe with SectionReader which is simpler and
implements Seek, a requirement for makeRequestWithRetry. This is
slightly worse than using a Pipe since the progress update is directly
tied to the chunk size instead of controlled separately.

3b49315f

copy metadata from source · 7eda70f2
Michael Yang authored Aug 17, 2023

7eda70f2

17 Aug, 2023 4 commits
- fmt · 086449b6
  Michael Yang authored Aug 17, 2023
  
  086449b6
- fix push manifest · 3cbc6a5c
  Michael Yang authored Aug 17, 2023
  
  3cbc6a5c
- model and file type as strings · a894cc79
  Michael Yang authored Aug 17, 2023
  
  a894cc79
- set the scopes correctly (#368) · 14220d98
  Patrick Devine authored Aug 16, 2023
  
  14220d98
16 Aug, 2023 3 commits
- reimplement chunked uploads · 5dfe91be
  Michael Yang authored Aug 14, 2023
  
  5dfe91be
- push: retry on unauthorized · 9f944c00
  Michael Yang authored Aug 16, 2023
  
  9f944c00
- images: remove body copies · 56e87cec
  Michael Yang authored Aug 16, 2023
  
  56e87cec
15 Aug, 2023 3 commits
- retry download on network errors · f0d7c2f5
  Bruce MacDonald authored Aug 15, 2023
  
  f0d7c2f5
- use loaded llm for embeddings · 326de489
  Bruce MacDonald authored Aug 15, 2023
  
  326de489
- dont log fatal · 18f2cb04
  Bruce MacDonald authored Aug 15, 2023
  
  18f2cb04
14 Aug, 2023 4 commits
- close open files · e26085b9
  Michael Yang authored Aug 14, 2023
  
  e26085b9
- cross repo mount · f594c8eb
  Michael Yang authored Aug 14, 2023
  
  f594c8eb
- always remove from in progress map on download · f020e1d5
  Bruce MacDonald authored Aug 14, 2023
  
  f020e1d5
- use file info for embeddings cache · 2c8b680b
  Bruce MacDonald authored Aug 14, 2023
  
  2c8b680b