- 20 Feb, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Taras Tsugrii authored
-
- 19 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
This should resolve the problem where we don't fully unload from the GPU when we go idle.
-
- 16 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
Also fixes a few fit-and-finish items for better developer experience
-
- 15 Feb, 2024 2 commits
-
-
Daniel Hiltgen authored
Even though we weren't setting it to on, somewhere in the cmake config it was getting toggled on. By explicitly setting it to off, we get `/arch:AVX` as intended.
-
Daniel Hiltgen authored
This focuses on Windows first, but coudl be used for Mac and possibly linux in the future.
-
- 14 Feb, 2024 4 commits
-
-
Jeffrey Morgan authored
This reverts commit 6920964b.
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
This reverts commit 2f9ed52b.
-
Jeffrey Morgan authored
-
- 12 Feb, 2024 3 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
This wires up some new logic to start using sysfs to discover AMD GPU information and detects old cards we can't yet support so we can fallback to CPU mode.
-
Jeffrey Morgan authored
-
- 09 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
Make sure that when a shutdown signal comes, we shutdown quickly instead of waiting for a potentially long exchange to wrap up.
-
- 08 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
When we store our libraries in a temp dir, a reaper might clean them when we are idle, so make sure to check for them before we reload.
-
- 06 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 02 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
Only apply patches if we have any, and make sure to cleanup every file we patched at the end to leave the tree clean
-
- 01 Feb, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Michael Yang authored
-
- 31 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
This requires an upstream change to support graceful termination, carried as a patch.
-
- 29 Jan, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 25 Jan, 2024 3 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Fix clearing kv cache between requests with the same prompt * fix powershell script
-
mraiser authored
-
- 24 Jan, 2024 1 commit
-
-
Michael Yang authored
-
- 23 Jan, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
For old macs, accelerate seems to cause crashes, but for AVX2 capable macs, it does not.
-
- 22 Jan, 2024 4 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
This wires up logging in llama.cpp to always go to stderr, and also turns up logging if OLLAMA_DEBUG is set.
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
- 21 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
Detect potential error scenarios so we can fallback to CPU mode without hitting asserts.
-
Daniel Hiltgen authored
The linux build now support parallel CPU builds to speed things up. This also exposes AMD GPU targets as an optional setting for advaced users who want to alter our default set.
-
Jeffrey Morgan authored
-
- 20 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
- 19 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
Self Denial authored
-