Commits · e53bc57d4de0ff0b3326bd52db6737b528ebf052 · OpenDAS / ollama

15 Sep, 2023 2 commits
- split uploadBlobChunked · e53bc57d
  Michael Yang authored Sep 14, 2023
  
  e53bc57d
- implement ProgressWriter · f0b398d1
  Michael Yang authored Sep 14, 2023
  
  f0b398d1
14 Sep, 2023 1 commit

Michael Yang authored Sep 14, 2023

This informs the HTTP client the content length is known and disables
chunked Transfer-Encoding

daa4f096

13 Sep, 2023 1 commit
- remove unused · e6881cab
  Michael Yang authored Sep 13, 2023
  
  e6881cab
12 Sep, 2023 3 commits
- fix model type for 70b · 0c5a4543
  Michael Yang authored Sep 12, 2023
  
  0c5a4543
- fix falcon decode · 7dee25a0
  Michael Yang authored Sep 12, 2023
```
get model and file type from bin file
```
  7dee25a0
- first pass at linux gpu support (#454) · f2216370
  Bruce MacDonald authored Sep 12, 2023
```
* linux gpu support
* handle multiple gpus
* add cuda docker image (#488)
---------
Co-authored-by: Michael Yang <mxyng@pm.me>
```
  f2216370
11 Sep, 2023 2 commits
- create the blobs directory correctly (#508) · 45ac07cd
  Patrick Devine authored Sep 11, 2023
  
  45ac07cd
- add autoprune to remove unused layers (#491) · e7e91cd7
  Patrick Devine authored Sep 11, 2023
  
  e7e91cd7
09 Sep, 2023 1 commit
- add model format to config layer (#497) · 3920e153
  Jeffrey Morgan authored Sep 09, 2023
  
  3920e153
08 Sep, 2023 1 commit
- fix nil pointer dereference · de227b62
  Michael Yang authored Sep 07, 2023
  
  de227b62
07 Sep, 2023 3 commits
- fix retry on unauthorized chunk · bf146fb0
  Michael Yang authored Sep 07, 2023
  
  bf146fb0
- fix get auth token · f0f49435
  Michael Yang authored Sep 07, 2023
  
  f0f49435
- GGUF support (#441) · 09dd2aef
  Bruce MacDonald authored Sep 07, 2023
  
  09dd2aef
06 Sep, 2023 3 commits
- fix model manifests (#477) · 83c6be16
  Michael Yang authored Sep 06, 2023
  
  83c6be16
- add show command (#474) · 790d24eb
  Patrick Devine authored Sep 06, 2023
  
  790d24eb
- create manifests directory · a1ecdd36
  Michael Yang authored Sep 05, 2023
  
  a1ecdd36
05 Sep, 2023 2 commits

fix parameter inheritence · 06ef90c0

Michael Yang authored Sep 05, 2023

parameters are not inherited because they are processed differently from
other layer. fix this by explicitly merging the inherited params into
the new params. parameter values defined in the new modelfile will
override those defined in the inherited modelfile. array lists are
replaced instead of appended

06ef90c0

use slices.DeleteFunc · e9f6df7d
Michael Yang authored Sep 02, 2023

e9f6df7d

03 Sep, 2023 1 commit
- fix num_keep · 681f3c4c
  Michael Yang authored Sep 03, 2023
  
  681f3c4c
01 Sep, 2023 1 commit

do not HTML-escape prompt · 62d29b21

Quinn Slack authored Sep 01, 2023

The `html/template` package automatically HTML-escapes interpolated strings in templates. This behavior is undesirable because it causes prompts like `<h1>hello` to be escaped to `&lt;h1&gt;hello` before being passed to the LLM.

The included test case passes, but before the code change, it failed:

```
--- FAIL: TestModelPrompt
    images_test.go:21: got "a&lt;h1&gt;b", want "a<h1>b"
```

62d29b21

31 Aug, 2023 4 commits
- windows: fix create modelfile · 1c8fd627
  Michael Yang authored Aug 30, 2023
  
  1c8fd627
- windows: fix delete · ae950b00
  Michael Yang authored Aug 30, 2023
  
  ae950b00
- fix list models for windows · eeb40a67
  Michael Yang authored Aug 30, 2023
  
  eeb40a67
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 2 commits

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

treat stop as stop sequences, not exact tokens (#442) · f4432e1d

Quinn Slack authored Aug 30, 2023

The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.

Fixes https://github.com/jmorganca/ollama/issues/295.

f4432e1d

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 4 commits
- remove unused parameter · 16b06699
  Michael Yang authored Aug 28, 2023
  
  16b06699
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
- chunked pipe · 865fceb7
  Michael Yang authored Aug 26, 2023
  
  865fceb7
- bump chunk size to 95MB · 72266c76
  Michael Yang authored Aug 25, 2023
  
  72266c76
26 Aug, 2023 1 commit
- set default template · 59734ca2
  Michael Yang authored Aug 26, 2023
  
  59734ca2
22 Aug, 2023 7 commits
- remove unused requestContextKey · 32d1a000
  Michael Yang authored Aug 22, 2023
  
  32d1a000
- move upload funcs to upload.go · 04e21282
  Michael Yang authored Aug 22, 2023
  
  04e21282
- use url.URL · 2cc63468
  Michael Yang authored Aug 21, 2023
  
  2cc63468
- build release mode · 95187d7e
  Michael Yang authored Aug 22, 2023
  
  95187d7e
- add version · 2c7f956b
  Michael Yang authored Aug 21, 2023
  
  2c7f956b
- fix `FROM` instruction erroring when referring to a file · a9f6c566
  Jeffrey Morgan authored Aug 22, 2023
  
  a9f6c566
- Strip protocol from model path (#377) · 0a892419
  Ryan Baker authored Aug 21, 2023
  
  0a892419