- 27 Jan, 2024 2 commits
-
-
Daniel Hiltgen authored
ROCm: Correct the response string in rocm_get_version function
-
Jagadish Krishnamoorthy authored
-
- 26 Jan, 2024 17 commits
-
-
Patrick Devine authored
-
Daniel Hiltgen authored
Detect lack of AVX and fallback to CPU mode
-
Daniel Hiltgen authored
We build the GPU libraries with AVX enabled to ensure that if not all layers fit on the GPU we get better performance in a mixed mode. If the user is using a virtualization/emulation system that lacks AVX this used to result in an illegal instruction error and crash before this fix. Now we will report a warning in the server log, and just use CPU mode to ensure we don't crash.
-
Michael Yang authored
fix build
-
Michael Yang authored
-
Michael Yang authored
download: add inactivity monitor
-
Daniel Hiltgen authored
Add back ROCm container support
-
Daniel Hiltgen authored
Ignore AMD integrated GPUs
-
Daniel Hiltgen authored
Fix crash on cuda ml init failure
-
Daniel Hiltgen authored
This adds ROCm support back as a discrete image.
-
Daniel Hiltgen authored
Detect and ignore integrated GPUs reported by rocm.
-
Daniel Hiltgen authored
The new driver lookup code was triggering after init failure due to a missing return
-
Daniel Hiltgen authored
Switch back to ubuntu base
-
Daniel Hiltgen authored
The size increase for rocm support in the standard image is problematic We'll revisit multiple tags for rocm support in a follow up PR.
-
Michael Yang authored
build cuda and rocm
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 25 Jan, 2024 11 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Fix clearing kv cache between requests with the same prompt * fix powershell script
-
Patrick Devine authored
-
Michael Yang authored
stub generate outputs for lint
-
Michael Yang authored
refactor tensor read
-
Jeffrey Morgan authored
-
Michael Yang authored
-
- 24 Jan, 2024 5 commits
-
-
Daniel Hiltgen authored
More logging for gpu management
-
Michael Yang authored
-
Daniel Hiltgen authored
Fix an ordering glitch of dlerr/dlclose and add more logging to help root cause some crashes users are hitting. This also refines the function pointer names to use the underlying function names instead of simplified names for readability.
-
Daniel Hiltgen authored
Report more information about GPUs in verbose mode
-
Jeffrey Morgan authored
-
- 23 Jan, 2024 5 commits
-
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
This adds additional calls to both CUDA and ROCm management libraries to discover additional attributes about the GPU(s) detected in the system, and wires up runtime verbosity selection. When users hit problems with GPUs we can ask them to run with `OLLAMA_DEBUG=1 ollama serve` and share the results.
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Set a default version using git describe
-