Commits · e9f6df7dca85abfe473d5b74c25cb45286746483 · OpenDAS / ollama

05 Sep, 2023 2 commits
- use slices.DeleteFunc · e9f6df7d
  Michael Yang authored Sep 02, 2023
  
  e9f6df7d
- generate binary dependencies based on GOARCH on macos (#459) · 7fa6e516
  Jeffrey Morgan authored Sep 05, 2023
  
  7fa6e516
02 Sep, 2023 1 commit
- Merge pull request #457 from sqs/dont-html-escape-prompt · adaa1308
  Michael Yang authored Sep 01, 2023
```
do not HTML-escape prompt
```
  adaa1308
01 Sep, 2023 4 commits

Quinn Slack authored Sep 01, 2023

The `html/template` package automatically HTML-escapes interpolated strings in templates. This behavior is undesirable because it causes prompts like `<h1>hello` to be escaped to `&lt;h1&gt;hello` before being passed to the LLM.

The included test case passes, but before the code change, it failed:

```
--- FAIL: TestModelPrompt
    images_test.go:21: got "a&lt;h1&gt;b", want "a<h1>b"
```

62d29b21

update readme (#451) · ed19d10a
Michael Yang authored Sep 01, 2023
```
* update readme

* readme: more run examples
```
ed19d10a
Merge pull request #450 from jmorganca/mxyng/update-readme · 36c2f45c
Michael Yang authored Sep 01, 2023
```
update readme
```
36c2f45c
update readme · 74222662
Michael Yang authored Sep 01, 2023

74222662

31 Aug, 2023 9 commits
- Merge pull request #273 from jmorganca/matt/moreexamples · 6bb8a16c
  Matt Williams authored Aug 31, 2023
```
Create a sentiments example
```
  6bb8a16c
- app: dont package `ggml-metal.metal` · a5dbcf2e
  Jeffrey Morgan authored Aug 31, 2023
  
  a5dbcf2e
- Merge pull request #443 from jmorganca/mxyng/fix-list-models · 9304f0e7
  Michael Yang authored Aug 31, 2023
```
windows: fix filepath bugs
```
  9304f0e7
- Merge pull request #448 from callmephilip/patch-1 · 6578b2f8
  Michael Yang authored Aug 31, 2023
```
fix spelling errors in example prompts
```
  6578b2f8
- windows: fix create modelfile · 1c8fd627
  Michael Yang authored Aug 30, 2023
  
  1c8fd627
- windows: fix delete · ae950b00
  Michael Yang authored Aug 30, 2023
  
  ae950b00
- fix list models for windows · eeb40a67
  Michael Yang authored Aug 30, 2023
  
  eeb40a67
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
- fix spelling errors in prompt · 1363f537
  Philip Nuzhnyi authored Aug 31, 2023
  
  1363f537
30 Aug, 2023 6 commits

update `README.md` · bc3e21fd
Jeffrey Morgan authored Aug 30, 2023

bc3e21fd
update docs for subprocess · a82eb275
Jeffrey Morgan authored Aug 30, 2023

a82eb275
remove test not applicate to subprocess · f964aea9
Bruce MacDonald authored Aug 30, 2023

f964aea9

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

treat stop as stop sequences, not exact tokens (#442) · f4432e1d

Quinn Slack authored Aug 30, 2023

The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.

Fixes https://github.com/jmorganca/ollama/issues/295.

f4432e1d

Merge pull request #428 from jmorganca/mxyng/upload-chunks · 982c5354
Michael Yang authored Aug 30, 2023
```
update upload chunks
```
982c5354

29 Aug, 2023 2 commits
- Merge pull request #421 from jmorganca/mxyng/f16-metal · 7df342a6
  Michael Yang authored Aug 29, 2023
```
allow F16 to use metal
```
  7df342a6
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 4 commits
- remove unused parameter · 16b06699
  Michael Yang authored Aug 28, 2023
  
  16b06699
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
- chunked pipe · 865fceb7
  Michael Yang authored Aug 26, 2023
  
  865fceb7
- bump chunk size to 95MB · 72266c76
  Michael Yang authored Aug 25, 2023
  
  72266c76
27 Aug, 2023 2 commits
- update `orca` to `orca-mini` · d3b838ce
  Jeffrey Morgan authored Aug 27, 2023
  
  d3b838ce
- Merge pull request #412 from jmorganca/mxyng/update-readme · e639a12f
  Michael Yang authored Aug 26, 2023
```
update README.md
```
  e639a12f
26 Aug, 2023 9 commits
- Merge pull request #420 from jmorganca/mxyng/34b-mem-check · e82fcf30
  Michael Yang authored Aug 26, 2023
```
add 34b to mem check
```
  e82fcf30
- Merge pull request #426 from jmorganca/default-template · 495e8b0a
  Michael Yang authored Aug 26, 2023
```
set default template
```
  495e8b0a
- set default template · 59734ca2
  Michael Yang authored Aug 26, 2023
  
  59734ca2
- default host to `127.0.0.1`, fixes #424 · 22ab7f5f
  Jeffrey Morgan authored Aug 26, 2023
  
  22ab7f5f
- allow F16 to use metal · b25dd179
  Michael Yang authored Aug 26, 2023
```
warning F16 uses significantly more memory than quantized model so the
standard requires don't apply.
```
  b25dd179
- add 34b to mem check · 304f2b6c
  Michael Yang authored Aug 26, 2023
  
  304f2b6c
- delete all models (not just 1st) in `ollama rm` (#415) · 2ecc3a33
  Quinn Slack authored Aug 26, 2023
```
Previously, `ollama rm model1 model2 modelN` would only delete `model1`. The other model command-line arguments would be silently ignored. Now, all models mentioned are deleted.
```
  2ecc3a33
- add `codellama` to model list in readme · ee6e1df1
  Jeffrey Morgan authored Aug 25, 2023
  
  ee6e1df1
- add missing entries for 34B · 177b69a2
  Jeffrey Morgan authored Aug 25, 2023
  
  177b69a2
25 Aug, 2023 1 commit
- Merge pull request #411 from jmorganca/mxyng/34b · dad63f08
  Michael Yang authored Aug 25, 2023
```
patch llama.cpp for 34B
```
  dad63f08