Commits · 62d29b2157d8fec5ec45de7a9fa70fc6fcf02408 · OpenDAS / ollama

01 Sep, 2023 1 commit

Quinn Slack authored Sep 01, 2023

The `html/template` package automatically HTML-escapes interpolated strings in templates. This behavior is undesirable because it causes prompts like `<h1>hello` to be escaped to `&lt;h1&gt;hello` before being passed to the LLM.

The included test case passes, but before the code change, it failed:

```
--- FAIL: TestModelPrompt
    images_test.go:21: got "a&lt;h1&gt;b", want "a<h1>b"
```

62d29b21

31 Aug, 2023 2 commits
- windows: fix create modelfile · 1c8fd627
  Michael Yang authored Aug 30, 2023
  
  1c8fd627
- windows: fix delete · ae950b00
  Michael Yang authored Aug 30, 2023
  
  ae950b00
30 Aug, 2023 2 commits

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

treat stop as stop sequences, not exact tokens (#442) · f4432e1d

Quinn Slack authored Aug 30, 2023

The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.

Fixes https://github.com/jmorganca/ollama/issues/295.

f4432e1d

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 2 commits
- remove unused parameter · 16b06699
  Michael Yang authored Aug 28, 2023
  
  16b06699
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
26 Aug, 2023 1 commit
- set default template · 59734ca2
  Michael Yang authored Aug 26, 2023
  
  59734ca2
22 Aug, 2023 6 commits
- remove unused requestContextKey · 32d1a000
  Michael Yang authored Aug 22, 2023
  
  32d1a000
- move upload funcs to upload.go · 04e21282
  Michael Yang authored Aug 22, 2023
  
  04e21282
- use url.URL · 2cc63468
  Michael Yang authored Aug 21, 2023
  
  2cc63468
- add version · 2c7f956b
  Michael Yang authored Aug 21, 2023
  
  2c7f956b
- fix `FROM` instruction erroring when referring to a file · a9f6c566
  Jeffrey Morgan authored Aug 22, 2023
  
  a9f6c566
- Strip protocol from model path (#377) · 0a892419
  Ryan Baker authored Aug 21, 2023
  
  0a892419
18 Aug, 2023 2 commits

retry on unauthorized chunk push · 3b49315f

Michael Yang authored Aug 18, 2023

The token printed for authorized requests has a lifetime of 1h. If an
upload exceeds 1h, a chunk push will fail since the token is created on
a "start upload" request.

This replaces the Pipe with SectionReader which is simpler and
implements Seek, a requirement for makeRequestWithRetry. This is
slightly worse than using a Pipe since the progress update is directly
tied to the chunk size instead of controlled separately.

3b49315f

copy metadata from source · 7eda70f2
Michael Yang authored Aug 17, 2023

7eda70f2

17 Aug, 2023 3 commits
- fmt · 086449b6
  Michael Yang authored Aug 17, 2023
  
  086449b6
- fix push manifest · 3cbc6a5c
  Michael Yang authored Aug 17, 2023
  
  3cbc6a5c
- model and file type as strings · a894cc79
  Michael Yang authored Aug 17, 2023
  
  a894cc79
16 Aug, 2023 3 commits
- reimplement chunked uploads · 5dfe91be
  Michael Yang authored Aug 14, 2023
  
  5dfe91be
- push: retry on unauthorized · 9f944c00
  Michael Yang authored Aug 16, 2023
  
  9f944c00
- images: remove body copies · 56e87cec
  Michael Yang authored Aug 16, 2023
  
  56e87cec
15 Aug, 2023 3 commits
- retry download on network errors · f0d7c2f5
  Bruce MacDonald authored Aug 15, 2023
  
  f0d7c2f5
- use loaded llm for embeddings · 326de489
  Bruce MacDonald authored Aug 15, 2023
  
  326de489
- dont log fatal · 18f2cb04
  Bruce MacDonald authored Aug 15, 2023
  
  18f2cb04
14 Aug, 2023 5 commits
- close open files · e26085b9
  Michael Yang authored Aug 14, 2023
  
  e26085b9
- cross repo mount · f594c8eb
  Michael Yang authored Aug 14, 2023
  
  f594c8eb
- use file info for embeddings cache · 2c8b680b
  Bruce MacDonald authored Aug 14, 2023
  
  2c8b680b
- use model bin digest for embed digest · 99b6b600
  Bruce MacDonald authored Aug 14, 2023
  
  99b6b600
- do not regenerate embeddings · e9a9580b
  Bruce MacDonald authored Aug 14, 2023
```
- re-use previously evaluated embeddings when possible
- change embeddings digest identifier to be based on model name and embedded file path
```
  e9a9580b
11 Aug, 2023 1 commit
- add maximum retries when pushing (#334) · d9cf18e2
  Patrick Devine authored Aug 11, 2023
  
  d9cf18e2
10 Aug, 2023 3 commits
- Token auth (#314) · be989d89
  Patrick Devine authored Aug 10, 2023
  
  be989d89
- implement loading ggml lora adapters through the modelfile · 6de5d032
  Michael Yang authored Aug 03, 2023
  
  6de5d032
- partial decode ggml bin for more info · fccf8d17
  Michael Yang authored Jul 21, 2023
  
  fccf8d17
09 Aug, 2023 3 commits
- fix embeddings invalid values · 984c9c62
  Bruce MacDonald authored Aug 09, 2023
  
  984c9c62
- Update images.go · ac971c56
  Bruce MacDonald authored Aug 09, 2023
  
  ac971c56
- allow for concurrent pulls of the same files · 868e3b31
  Bruce MacDonald authored Jul 25, 2023
  
  868e3b31
08 Aug, 2023 2 commits
- pr feedback · 1bee2347
  Bruce MacDonald authored Aug 08, 2023
```
- defer closing llm on embedding
- do not override licenses
- remove debugging print line
- reformat model file docs
```
  1bee2347
- allow embedding from model binary · 884d78ce
  Bruce MacDonald authored Aug 08, 2023
  
  884d78ce