- 07 Aug, 2024 2 commits
-
-
Jesse Gross authored
Currently if the config field is missing in the manifest file (or corrupted), Ollama will crash when it tries to read it. This can happen at startup or when pulling new models. This data is mostly just used for showing model information so we can be tolerant of it not being present - it is not required to run the models. Besides avoiding crashing, this also gives us the ability to restructure the config in the future by pulling it into the main manifest file.
-
Jesse Gross authored
If there is an error when opening a manifest file (corrupted, permission denied, etc.) then the referenced layers will not be included in the list of active layers. This causes them to be deleted when pruning happens at startup or a model is pulled. In such a situation, we should prefer to preserve data in the hopes that it can be recovered rather than being agressive about deletion.
-
- 06 Aug, 2024 1 commit
-
-
Daniel Hiltgen authored
The file.Truncate call on windows will write the whole file unless you set the sparse flag, leading to heavy I/O at the beginning of download. This should improve our I/O behavior on windows and put less stress on the users disk.
-
- 02 Aug, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 01 Aug, 2024 5 commits
-
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
- 31 Jul, 2024 5 commits
-
-
Michael Yang authored
-
Blake Mizerany authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 30 Jul, 2024 2 commits
-
-
royjhan authored
* add prompt tokens to embed response * rm slog * metrics * types * prompt n * clean up * reset submodule * update tests * test name * list metrics
-
Daniel Hiltgen authored
In mult-brand GPU setups, if we couldn't fully load the model we would fall through the scheduler and mistakenly try to load across a mix of brands. This makes sure we find the set of GPU(s) that best fit for the partial load.
-
- 26 Jul, 2024 3 commits
-
-
Blake Mizerany authored
This fixes various data races scattered throughout the download/pull client where the client was accessing the download state concurrently. This commit is mostly a hot-fix and will be replaced by a new client one day soon. Also, remove the unnecessary opts argument from downloadChunk.
-
Michael Yang authored
-
Michael Yang authored
-
- 25 Jul, 2024 1 commit
-
-
Blake Mizerany authored
This changes the registry client to reuse the original download URL it gets on the first redirect response for all subsequent requests, preventing thundering herd issues when hot new LLMs are released.
-
- 22 Jul, 2024 10 commits
-
-
Josh authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Jeffrey Morgan authored
-
- 21 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 20 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 19 Jul, 2024 1 commit
-
-
Josh authored
add template validation to modelfile
-
- 18 Jul, 2024 3 commits
-
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* server: only parse tool calls if tools are provided * still set `resp.Message.Content`
-
- 17 Jul, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 16 Jul, 2024 1 commit
-
-
Michael Yang authored
-