Commits · 69eb06c40ec22fe002cfbe1d52b560fce0dcddba · OpenDAS / ollama

07 Aug, 2024 2 commits

manifest: Fix crash on startup when trying to clean up unused files (#5840) · 1829fb61

Jesse Gross authored Aug 05, 2024

Currently if the config field is missing in the manifest file (or
corrupted), Ollama will crash when it tries to read it. This can
happen at startup or when pulling new models.

This data is mostly just used for showing model information so we
can be tolerant of it not being present - it is not required to
run the models. Besides avoiding crashing, this also gives us the
ability to restructure the config in the future by pulling it
into the main manifest file.

1829fb61

manifest: Don't prune layers if we can't open a manifest file · 685a5353

Jesse Gross authored Aug 01, 2024

If there is an error when opening a manifest file (corrupted, permission denied, etc.)
then the referenced layers will not be included in the list of active
layers. This causes them to be deleted when pruning happens at startup
or a model is pulled.

In such a situation, we should prefer to preserve data in the hopes that
it can be recovered rather than being agressive about deletion.

685a5353

06 Aug, 2024 1 commit

Ensure sparse files on windows during download · fc85f50a

Daniel Hiltgen authored Aug 06, 2024

The file.Truncate call on windows will write the whole file
unless you set the sparse flag, leading to heavy I/O at the
beginning of download.  This should improve our
I/O behavior on windows and put less stress on the users disk.

fc85f50a

02 Aug, 2024 2 commits
- use testing tempdirs · a091fadf
  Michael Yang authored Aug 02, 2024
  
  a091fadf
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
01 Aug, 2024 5 commits
- Refactor and format code. · 8a9f946c
  Vyacheslav Moskalev authored Aug 02, 2024
  
  8a9f946c
- Refactor code. Remove extra variable. · 3b521054
  Vyacheslav Moskalev authored Aug 01, 2024
  
  3b521054
- Better types and naming closer to style. · b0c21658
  Vyacheslav Moskalev authored Aug 01, 2024
  
  b0c21658
- Change the order of context and prompt. · 49a54831
  Vyacheslav Moskalev authored Aug 01, 2024
  
  49a54831
- Fix extra context concatenation in generate handler (#5980). · 6bc5c137
  Vyacheslav Moskalev authored Aug 01, 2024
  
  6bc5c137
31 Jul, 2024 5 commits
- fix modelfile message quotes · d87b4a48
  Michael Yang authored Jul 31, 2024
  
  d87b4a48
- server: fix json marshalling of downloadBlobPart (#6108) · dc77bbcf
  Blake Mizerany authored Jul 31, 2024
  
  dc77bbcf
- convert: only extract large files · eafc607a
  Michael Yang authored Jun 29, 2024
  
  eafc607a
- comments · df993fa3
  Michael Yang authored Jul 08, 2024
  
  df993fa3
- refactor convert · 5e9db9fb
  Michael Yang authored May 31, 2024
  
  5e9db9fb
30 Jul, 2024 2 commits

Add Metrics to `api\embed` response (#5709) · 1b44d873

royjhan authored Jul 30, 2024

* add prompt tokens to embed response

* rm slog

* metrics

* types

* prompt n

* clean up

* reset submodule

* update tests

* test name

* list metrics

1b44d873

Prevent partial loading on mixed GPU brands · 34542099

Daniel Hiltgen authored Jul 22, 2024

In mult-brand GPU setups, if we couldn't fully load the model we
would fall through the scheduler and mistakenly try to load across
a mix of brands.  This makes sure we find the set of GPU(s) that
best fit for the partial load.

34542099

26 Jul, 2024 3 commits

server: fix race conditions during download (#5994) · 750c1c55

Blake Mizerany authored Jul 26, 2024

This fixes various data races scattered throughout the download/pull
client where the client was accessing the download state concurrently.

This commit is mostly a hot-fix and will be replaced by a new client one
day soon.

Also, remove the unnecessary opts argument from downloadChunk.

750c1c55

fix nil deref in auth.go · a622c47b
Michael Yang authored Jul 26, 2024

a622c47b
include modelfile messages · 15af5584
Michael Yang authored Jun 19, 2024

15af5584

25 Jul, 2024 1 commit

server: reuse original download URL for images (#5962) · c8af3c2d

Blake Mizerany authored Jul 25, 2024

This changes the registry client to reuse the original download URL
it gets on the first redirect response for all subsequent requests,
preventing thundering herd issues when hot new LLMs are released.

c8af3c2d

22 Jul, 2024 10 commits
- fix dupe err message (#5857) · db0968f3
  Josh authored Jul 22, 2024
  
  db0968f3
- comments · 85d9d73a
  Michael Yang authored Jul 08, 2024
  
  85d9d73a
- uint64 · 1954ec59
  Michael Yang authored Jul 03, 2024
  
  1954ec59
- int · 0f191012
  Michael Yang authored Jul 03, 2024
  
  0f191012
- keepalive · 8570c1c0
  Michael Yang authored Jul 03, 2024
  
  8570c1c0
- bool · 55cd3ddc
  Michael Yang authored Jul 03, 2024
  
  55cd3ddc
- models · 66fe77f0
  Michael Yang authored Jul 03, 2024
  
  66fe77f0
- origins · d1a5227c
  Michael Yang authored Jul 03, 2024
  
  d1a5227c
- rfc: dynamic environ lookup · 35b89b2e
  Michael Yang authored Jul 03, 2024
  
  35b89b2e
- server: collect nested tool call objects when parsing (#5824) · b3e5491e
  Jeffrey Morgan authored Jul 22, 2024
  
  b3e5491e
21 Jul, 2024 1 commit
- Remove out of space test temporarily (#5825) · 80ee9b5e
  Jeffrey Morgan authored Jul 21, 2024
  
  80ee9b5e
20 Jul, 2024 1 commit
- Fix generate test flakyness (#5804) · 69a2d4cc
  Jeffrey Morgan authored Jul 19, 2024
  
  69a2d4cc
19 Jul, 2024 1 commit
- server: validate template (#5734) · e8b954c6
  Josh authored Jul 19, 2024
```
add template validation to modelfile
```
  e8b954c6
18 Jul, 2024 3 commits
- fix parsing tool calls · 43606d6d
  Michael Yang authored Jul 18, 2024
  
  43606d6d
- server: check for empty tools array too (#5779) · 70b1010f
  Jeffrey Morgan authored Jul 18, 2024
  
  70b1010f
- server: only parse tool calls if tools are provided (#5771) · 319fb1ce
  Jeffrey Morgan authored Jul 18, 2024
```
* server: only parse tool calls if tools are provided

* still set `resp.Message.Content`
```
  319fb1ce
17 Jul, 2024 2 commits
- marshal json automatically for some template values (#5758) · b2554455
  Michael Yang authored Jul 17, 2024
  
  b2554455
- parse tool call as individual objects · 5fd69881
  Michael Yang authored Jul 17, 2024
  
  5fd69881
16 Jul, 2024 1 commit
- remove ToolCall from GenerateResponse · c279f963
  Michael Yang authored Jul 16, 2024
  
  c279f963