"tests/vscode:/vscode.git/clone" did not exist on "b5e1facc85219770b6e85a9cfb2ec554167ccedc"
- 14 Feb, 2024 2 commits
-
-
Jeffrey Morgan authored
This reverts commit 2f9ed52b.
-
Jeffrey Morgan authored
-
- 12 Feb, 2024 3 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
This wires up some new logic to start using sysfs to discover AMD GPU information and detects old cards we can't yet support so we can fallback to CPU mode.
-
Jeffrey Morgan authored
-
- 09 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
Make sure that when a shutdown signal comes, we shutdown quickly instead of waiting for a potentially long exchange to wrap up.
-
- 08 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
When we store our libraries in a temp dir, a reaper might clean them when we are idle, so make sure to check for them before we reload.
-
- 06 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 02 Feb, 2024 1 commit
-
-
Daniel Hiltgen authored
Only apply patches if we have any, and make sure to cleanup every file we patched at the end to leave the tree clean
-
- 01 Feb, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Michael Yang authored
-
- 31 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
This requires an upstream change to support graceful termination, carried as a patch.
-
- 29 Jan, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 25 Jan, 2024 3 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Fix clearing kv cache between requests with the same prompt * fix powershell script
-
mraiser authored
-
- 24 Jan, 2024 1 commit
-
-
Michael Yang authored
-
- 23 Jan, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
For old macs, accelerate seems to cause crashes, but for AVX2 capable macs, it does not.
-
- 22 Jan, 2024 4 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
This wires up logging in llama.cpp to always go to stderr, and also turns up logging if OLLAMA_DEBUG is set.
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
- 21 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
Detect potential error scenarios so we can fallback to CPU mode without hitting asserts.
-
Daniel Hiltgen authored
The linux build now support parallel CPU builds to speed things up. This also exposes AMD GPU targets as an optional setting for advaced users who want to alter our default set.
-
Jeffrey Morgan authored
-
- 20 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
- 19 Jan, 2024 4 commits
-
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
Self Denial authored
-
Self Denial authored
Update gpu.go initGPUHandles() to declare gpuHandles variable before reading it. This resolves an "invalid memory address or nil pointer dereference" error. Update dyn_ext_server.c to avoid setting the RTLD_DEEPBIND flag under __TERMUX__ (Android).
-
- 18 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
A few obvious levels were adjusted, but generally everything mapped to "info" level.
-
- 17 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
This also refines the build process for the ext_server build.
-
- 16 Jan, 2024 2 commits
-
-
Daniel Hiltgen authored
Upstream llama.cpp has added a new dependency with the NVIDIA CUDA Driver Libraries (libcuda.so) which is part of the driver distribution, not the general cuda libraries, and is not available as an archive, so we can not statically link it. This may introduce some additional compatibility challenges which we'll need to keep an eye on.
-
Bruce MacDonald authored
- prompt cache causes inferance to hang after some time
-
- 14 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
-
Alexander F. Rødseth authored
-
Jeffrey Morgan authored
-