- 18 Jan, 2024 6 commits
-
-
Sachin Sachdeva authored
Updated readme with the web link for haystack ollama integration
-
Daniel Hiltgen authored
Let gpu.go and gen_linux.sh also find CUDA on Arch Linux
-
Daniel Hiltgen authored
Refine the linux cuda/rocm developer docs
-
Daniel Hiltgen authored
Add multiple CPU variants for Intel Mac
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
The runners are x86 so we can only run binaries that match.
-
- 17 Jan, 2024 4 commits
-
-
Daniel Hiltgen authored
This also refines the build process for the ext_server build.
-
Alexander F. Rødseth authored
-
Michael Yang authored
fix: normalize name path before splitting
-
Michael Yang authored
-
- 16 Jan, 2024 7 commits
-
-
Daniel Hiltgen authored
Bump llama.cpp to b1842 and add new cuda lib dep
-
Daniel Hiltgen authored
Upstream llama.cpp has added a new dependency with the NVIDIA CUDA Driver Libraries (libcuda.so) which is part of the driver distribution, not the general cuda libraries, and is not available as an archive, so we can not statically link it. This may introduce some additional compatibility challenges which we'll need to keep an eye on.
-
Daniel Hiltgen authored
Add macos cross-compile CI coverage
-
Michael Yang authored
remove client.py
-
Bruce MacDonald authored
- prompt cache causes inferance to hang after some time
-
Patrick Devine authored
-
Michael Yang authored
fix: request retry with error
-
- 15 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
improve cuda detection (rel. issue #1704)
-
- 14 Jan, 2024 6 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Fix typo in arm mac arch script
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Fix intel mac build
-
Alexander F. Rødseth authored
-
Jeffrey Morgan authored
-
- 13 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
Make sure we're building an x86 ext_server lib when cross-compiling
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 12 Jan, 2024 10 commits
-
-
Michael Yang authored
remove double newlines in /set parameter
-
Michael Yang authored
add max context length check
-
Michael Yang authored
-
Michael Yang authored
this fixes a subtle bug with makeRequestWithRetry where an HTTP status error on a retried request will potentially not return the right err
-
Fabian Preiss authored
-
Patrick Devine authored
-
Michael Yang authored
-
Michael Yang authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Tristram Oaten authored
After executing the `userdel ollama` command, I saw this message: ```sh $ sudo userdel ollama userdel: group ollama not removed because it has other members. ``` Which reminded me that I had to remove the dangling group too. For completeness, the uninstall instructions should do this too. Thanks!
-
Michael Yang authored
-
- 11 Jan, 2024 3 commits
-
-
Michael Yang authored
-
Daniel Hiltgen authored
Fix up the CPU fallback selection
-
Daniel Hiltgen authored
The memory changes and multi-variant change had some merge glitches I missed. This fixes them so we actually get the cpu llm lib and best variant for the given system.
-