- 23 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
This adds additional calls to both CUDA and ROCm management libraries to discover additional attributes about the GPU(s) detected in the system, and wires up runtime verbosity selection. When users hit problems with GPUs we can ask them to run with `OLLAMA_DEBUG=1 ollama serve` and share the results.
-
- 10 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
When there are multiple management libraries installed on a system not every one will be compatible with the current driver. This change improves our management library algorithm to build up a set of discovered libraries based on glob patterns, and then try all of them until we're able to load one without error.
-
- 09 Jan, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 07 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
If we try to load the CUDA library on an old GPU, it panics and crashes the server. This checks the compute capability before we load the library so we can gracefully fall back to CPU mode.
-
- 06 Jan, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 05 Jan, 2024 1 commit
-
-
Jeffrey Morgan authored
* gpu: read memory info from all cuda devices * add `LOOKUP_SIZE` constant * better constant name * address comments
-
- 03 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
This refines the gpu package error handling and fixes a bug with the system memory lookup on windows.
-
- 19 Dec, 2023 4 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
If someone checks out the ollama repo and doesn't install the CUDA library, this will ensure they can build a CPU only version
-
Daniel Hiltgen authored
-