Commits · eeb40a672c80f0cc06b08faaebec67b954202e4f · OpenDAS / ollama

31 Aug, 2023 2 commits
- fix list models for windows · eeb40a67
  Michael Yang authored Aug 30, 2023
  
  eeb40a67
- s/ListResponseModel/ModelResponse/ · 0f541a03
  Michael Yang authored Aug 30, 2023
  
  0f541a03
30 Aug, 2023 2 commits

subprocess llama.cpp server (#401) · 42998d79

Bruce MacDonald authored Aug 30, 2023

* remove c code
* pack llama.cpp
* use request context for llama_cpp
* let llama_cpp decide the number of threads to use
* stop llama runner when app stops
* remove sample count and duration metrics
* use go generate to get libraries
* tmp dir for running llm

42998d79

treat stop as stop sequences, not exact tokens (#442) · f4432e1d

Quinn Slack authored Aug 30, 2023

The `stop` option to the generate API is a list of sequences that should cause generation to stop. Although these are commonly called "stop tokens", they do not necessarily correspond to LLM tokens (per the LLM's tokenizer). For example, if the caller sends a generate request with `"stop":["\n"]`, then generation should stop on any token containing `\n` (and trim `\n` from the output), not just if the token exactly matches `\n`. If `stop` were interpreted strictly as LLM tokens, then it would require callers of the generate API to know the LLM's tokenizer and enumerate many tokens in the `stop` list.

Fixes https://github.com/jmorganca/ollama/issues/295.

f4432e1d

29 Aug, 2023 1 commit
- add model IDs (#439) · 8bbff2df
  Patrick Devine authored Aug 28, 2023
  
  8bbff2df
28 Aug, 2023 4 commits
- remove unused parameter · 16b06699
  Michael Yang authored Aug 28, 2023
  
  16b06699
- loosen http status code checks · 246dc654
  Michael Yang authored Aug 26, 2023
  
  246dc654
- chunked pipe · 865fceb7
  Michael Yang authored Aug 26, 2023
  
  865fceb7
- bump chunk size to 95MB · 72266c76
  Michael Yang authored Aug 25, 2023
  
  72266c76
26 Aug, 2023 1 commit
- set default template · 59734ca2
  Michael Yang authored Aug 26, 2023
  
  59734ca2
22 Aug, 2023 7 commits
- remove unused requestContextKey · 32d1a000
  Michael Yang authored Aug 22, 2023
  
  32d1a000
- move upload funcs to upload.go · 04e21282
  Michael Yang authored Aug 22, 2023
  
  04e21282
- use url.URL · 2cc63468
  Michael Yang authored Aug 21, 2023
  
  2cc63468
- build release mode · 95187d7e
  Michael Yang authored Aug 22, 2023
  
  95187d7e
- add version · 2c7f956b
  Michael Yang authored Aug 21, 2023
  
  2c7f956b
- fix `FROM` instruction erroring when referring to a file · a9f6c566
  Jeffrey Morgan authored Aug 22, 2023
  
  a9f6c566
- Strip protocol from model path (#377) · 0a892419
  Ryan Baker authored Aug 21, 2023
  
  0a892419
18 Aug, 2023 2 commits

retry on unauthorized chunk push · 3b49315f

Michael Yang authored Aug 18, 2023

The token printed for authorized requests has a lifetime of 1h. If an
upload exceeds 1h, a chunk push will fail since the token is created on
a "start upload" request.

This replaces the Pipe with SectionReader which is simpler and
implements Seek, a requirement for makeRequestWithRetry. This is
slightly worse than using a Pipe since the progress update is directly
tied to the chunk size instead of controlled separately.

3b49315f

copy metadata from source · 7eda70f2
Michael Yang authored Aug 17, 2023

7eda70f2

17 Aug, 2023 4 commits
- fmt · 086449b6
  Michael Yang authored Aug 17, 2023
  
  086449b6
- fix push manifest · 3cbc6a5c
  Michael Yang authored Aug 17, 2023
  
  3cbc6a5c
- model and file type as strings · a894cc79
  Michael Yang authored Aug 17, 2023
  
  a894cc79
- set the scopes correctly (#368) · 14220d98
  Patrick Devine authored Aug 16, 2023
  
  14220d98
16 Aug, 2023 3 commits
- reimplement chunked uploads · 5dfe91be
  Michael Yang authored Aug 14, 2023
  
  5dfe91be
- push: retry on unauthorized · 9f944c00
  Michael Yang authored Aug 16, 2023
  
  9f944c00
- images: remove body copies · 56e87cec
  Michael Yang authored Aug 16, 2023
  
  56e87cec
15 Aug, 2023 3 commits
- retry download on network errors · f0d7c2f5
  Bruce MacDonald authored Aug 15, 2023
  
  f0d7c2f5
- use loaded llm for embeddings · 326de489
  Bruce MacDonald authored Aug 15, 2023
  
  326de489
- dont log fatal · 18f2cb04
  Bruce MacDonald authored Aug 15, 2023
  
  18f2cb04
14 Aug, 2023 6 commits
- close open files · e26085b9
  Michael Yang authored Aug 14, 2023
  
  e26085b9
- cross repo mount · f594c8eb
  Michael Yang authored Aug 14, 2023
  
  f594c8eb
- always remove from in progress map on download · f020e1d5
  Bruce MacDonald authored Aug 14, 2023
  
  f020e1d5
- use file info for embeddings cache · 2c8b680b
  Bruce MacDonald authored Aug 14, 2023
  
  2c8b680b
- use model bin digest for embed digest · 99b6b600
  Bruce MacDonald authored Aug 14, 2023
  
  99b6b600
- do not regenerate embeddings · e9a9580b
  Bruce MacDonald authored Aug 14, 2023
```
- re-use previously evaluated embeddings when possible
- change embeddings digest identifier to be based on model name and embedded file path
```
  e9a9580b
11 Aug, 2023 3 commits
- add maximum retries when pushing (#334) · d9cf18e2
  Patrick Devine authored Aug 11, 2023
  
  d9cf18e2
- create `.ollama` directory if it doesnt exist · 1556162c
  Jeffrey Morgan authored Aug 11, 2023
  
  1556162c
- create `.ollama` directory if it doesnt exist · 148f0225
  Jeffrey Morgan authored Aug 11, 2023
  
  148f0225
10 Aug, 2023 2 commits
- Token auth (#314) · be989d89
  Patrick Devine authored Aug 10, 2023
  
  be989d89
- clean up cli flags · 040a5b97
  Jeffrey Morgan authored Aug 10, 2023
  
  040a5b97