- 07 Aug, 2024 5 commits
-
-
Kyle Kelley authored
* Use llama3.1 in tools example * Update api.md
-
Jesse Gross authored
Fix crash on startup when trying to clean up unused files (#5840)
-
Jesse Gross authored
Currently if the config field is missing in the manifest file (or corrupted), Ollama will crash when it tries to read it. This can happen at startup or when pulling new models. This data is mostly just used for showing model information so we can be tolerant of it not being present - it is not required to run the models. Besides avoiding crashing, this also gives us the ability to restructure the config in the future by pulling it into the main manifest file.
-
Jesse Gross authored
If there is an error when opening a manifest file (corrupted, permission denied, etc.) then the referenced layers will not be included in the list of active layers. This causes them to be deleted when pruning happens at startup or a model is pulled. In such a situation, we should prefer to preserve data in the hopes that it can be recovered rather than being agressive about deletion.
-
Jeffrey Morgan authored
-
- 06 Aug, 2024 4 commits
-
-
Jeffrey Morgan authored
-
Chua Chee Seng authored
-
Daniel Hiltgen authored
Ensure sparse files on windows during download
-
Daniel Hiltgen authored
The file.Truncate call on windows will write the whole file unless you set the sparse flag, leading to heavy I/O at the beginning of download. This should improve our I/O behavior on windows and put less stress on the users disk.
-
- 05 Aug, 2024 9 commits
-
-
royjhan authored
-
Michael Yang authored
fix concurrency test
-
Michael Yang authored
-
Daniel Hiltgen authored
Implement linux NUMA detection
-
Michael Yang authored
use testing tempdirs
-
Daniel Hiltgen authored
If the system has multiple numa nodes, enable numa support in llama.cpp If we detect numactl in the path, use that, else use the basic "distribute" mode.
-
Michael Yang authored
line feed
-
frob authored
Users using `journalctl` to get logs for issue logging sometimes don't realize that paging is causing information to be missed.
-
Michael Yang authored
-
- 04 Aug, 2024 1 commit
-
-
sryu1 authored
-
- 03 Aug, 2024 1 commit
-
-
Ivan Charapanau authored
-
- 02 Aug, 2024 5 commits
-
-
Michael Yang authored
-
Michael Yang authored
enable gofmt/gofumpt/goimports/tenv
-
royjhan authored
* Update docs * token bug corrected * Update docs/openai.md * Update docs/openai.md * add suffix * merge conflicts * merge conflicts
-
royjhan authored
-
Michael Yang authored
-
- 01 Aug, 2024 13 commits
-
-
Kim Hallberg authored
-
royjhan authored
* docs without usage * no usage * rm metric note
-
royjhan authored
-
royjhan authored
* add prompt tokens to embed response * rm slog * metrics * types * prompt n * clean up * reset submodule * add tokens to v1/embeddings * separate usage
-
royjhan authored
* OpenAI Docs * Update docs/openai.md Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Remove newline --------- Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
Michael Yang authored
Fix context in /api/generate grows too much (#5980).
-
Michael Yang authored
refactor convert
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Vyacheslav Moskalev authored
-
Michael Yang authored
fix modelfile message quotes
-
- 31 Jul, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
patches: phi3 optional sliding window attention
-