- 22 Jan, 2024 9 commits
-
-
Daniel Hiltgen authored
Refine debug logging for llm
-
Daniel Hiltgen authored
Debug logging on init failure
-
Daniel Hiltgen authored
This wires up logging in llama.cpp to always go to stderr, and also turns up logging if OLLAMA_DEBUG is set.
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
Michael Yang authored
fix: remove overwritten model layers
-
Meng Zhuo authored
-
Daniel Hiltgen authored
Make CPU builds parallel and customizable AMD GPUs
-
Daniel Hiltgen authored
Probe GPUs before backend init
-
- 21 Jan, 2024 5 commits
-
-
Daniel Hiltgen authored
Detect potential error scenarios so we can fallback to CPU mode without hitting asserts.
-
Daniel Hiltgen authored
The linux build now support parallel CPU builds to speed things up. This also exposes AMD GPU targets as an optional setting for advaced users who want to alter our default set.
-
Daniel Hiltgen authored
Combine the 2 Dockerfiles and add ROCm
-
Daniel Hiltgen authored
This renames Dockerfile.build to Dockerfile, and adds some new stages to support 2 modes of building - the build_linux.sh script uses intermediate stages to extract the artifacts for ./dist, and the default build generates a container image usable by both cuda and rocm cards. This required transitioniing the x86 base to the rocm image to avoid layer bloat.
-
Jeffrey Morgan authored
-
- 20 Jan, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 19 Jan, 2024 10 commits
-
-
Michael Yang authored
if create overrides a manifest, first add the older manifest's layers to the delete map so they can be cleaned up
-
Daniel Hiltgen authored
More WSL paths
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
Switch to local dlopen symbols
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Fix CPU-only build under Android Termux enviornment.
-
Self Denial authored
-
Self Denial authored
Update gpu.go initGPUHandles() to declare gpuHandles variable before reading it. This resolves an "invalid memory address or nil pointer dereference" error. Update dyn_ext_server.c to avoid setting the RTLD_DEEPBIND flag under __TERMUX__ (Android).
-
Michael Yang authored
fix show handler
-
- 18 Jan, 2024 14 commits
-
-
Michael Yang authored
-
Michael Yang authored
api: add model for all requests
-
Michael Yang authored
ci: update setup-go action
-
Michael Yang authored
-
Michael Yang authored
prefer using req.Model and fallback to req.Name
-
Daniel Hiltgen authored
Mechanical switch from log to slog
-
Michael Yang authored
install: pin fedora to max 37
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
A few obvious levels were adjusted, but generally everything mapped to "info" level.
-
Mike Bird authored
-
Sachin Sachdeva authored
Updated readme with the web link for haystack ollama integration
-
Daniel Hiltgen authored
Let gpu.go and gen_linux.sh also find CUDA on Arch Linux
-
Daniel Hiltgen authored
Refine the linux cuda/rocm developer docs
-
Daniel Hiltgen authored
Add multiple CPU variants for Intel Mac
-