- 14 Aug, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 08 Aug, 2024 1 commit
-
-
Jesse Gross authored
Commit 1829fb61 ("manifest: Fix crash on startup when trying to clean up unused files (#5840)") changed the config layer stored in manifests from a pointer to a value. This was done in order to avoid potential nil pointer dereferences after it is deserialized from JSON in the event that the field is missing. This changes the Layers slice to also be stored by value. This enables consistency in handling across the two objects.
-
- 07 Aug, 2024 3 commits
-
-
Jesse Gross authored
When creating a model the config layer is appended to the list of layers and then the last layer is used as the config when writing the manifest. This change directly uses the config layer to write the manifest. There is no behavior change but it is less error prone.
-
Jesse Gross authored
Currently if the config field is missing in the manifest file (or corrupted), Ollama will crash when it tries to read it. This can happen at startup or when pulling new models. This data is mostly just used for showing model information so we can be tolerant of it not being present - it is not required to run the models. Besides avoiding crashing, this also gives us the ability to restructure the config in the future by pulling it into the main manifest file.
-
Jesse Gross authored
If there is an error when opening a manifest file (corrupted, permission denied, etc.) then the referenced layers will not be included in the list of active layers. This causes them to be deleted when pruning happens at startup or a model is pulled. In such a situation, we should prefer to preserve data in the hopes that it can be recovered rather than being agressive about deletion.
-
- 02 Aug, 2024 1 commit
-
-
Michael Yang authored
-
- 31 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 26 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 25 Jul, 2024 1 commit
-
-
Blake Mizerany authored
This changes the registry client to reuse the original download URL it gets on the first redirect response for all subsequent requests, preventing thundering herd issues when hot new LLMs are released.
-
- 22 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 19 Jul, 2024 1 commit
-
-
Josh authored
add template validation to modelfile
-
- 16 Jul, 2024 1 commit
-
-
Michael Yang authored
this change is triggered by the presence of "suffix", particularly useful for code completion tasks
-
- 15 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 05 Jul, 2024 1 commit
-
-
Michael Yang authored
-
- 01 Jul, 2024 4 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 25 Jun, 2024 1 commit
-
-
Blake Mizerany authored
Previously, some costly things were causing the loading of GGUF files and their metadata and tensor information to be VERY slow: * Too many allocations when decoding strings * Hitting disk for each read of each key and value, resulting in a not-okay amount of syscalls/disk I/O. The show API is now down to 33ms from 800ms+ for llama3 on a macbook pro m3. This commit also prevents collecting large arrays of values when decoding GGUFs (if desired). When such keys are encountered, their values are null, and are encoded as such in JSON. Also, this fixes a broken test that was not encoding valid GGUF.
-
- 21 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 13 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 12 Jun, 2024 1 commit
-
-
Michael Yang authored
multiple templates may appear in a model if a model is created from another model that 1) has an autodetected template and 2) defines a custom template
-
- 07 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 06 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 05 Jun, 2024 1 commit
-
-
Blake Mizerany authored
-
- 04 Jun, 2024 5 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 24 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 20 May, 2024 4 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
particularly useful for zipfiles and f16s
-
Patrick Devine authored
-
- 14 May, 2024 1 commit
-
-
Michael Yang authored
-
- 09 May, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 08 May, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-