- 31 Jan, 2024 1 commit
-
-
Richard Macarthy authored
-
- 30 Jan, 2024 3 commits
-
-
Bruce MacDonald authored
-
Maximilian Weber authored
Added - [Ollama for R - rollama](https://github.com/JBGruber/rollama) in Libraries in README.md
-
Daniel Hiltgen authored
Add container hints for troubleshooting
-
- 29 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
Some users are new to containers and unsure where the server logs go
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Don't disable GPUs on arm without AVX
-
- 28 Jan, 2024 5 commits
-
-
Daniel Hiltgen authored
AVX is an x86 feature, so ARM should be excluded from the check.
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Harden for zero detected GPUs
-
Daniel Hiltgen authored
At least with the ROCm libraries, its possible to have the library present with zero GPUs. This fix avoids a divide by zero bug in llm.go when we try to calculate GPU memory with zero GPUs.
-
- 27 Jan, 2024 3 commits
-
-
Daniel Hiltgen authored
Add support for CUDA 5.0 cards
-
Daniel Hiltgen authored
ROCm: Correct the response string in rocm_get_version function
-
Jagadish Krishnamoorthy authored
-
- 26 Jan, 2024 17 commits
-
-
Patrick Devine authored
-
Daniel Hiltgen authored
Detect lack of AVX and fallback to CPU mode
-
Daniel Hiltgen authored
We build the GPU libraries with AVX enabled to ensure that if not all layers fit on the GPU we get better performance in a mixed mode. If the user is using a virtualization/emulation system that lacks AVX this used to result in an illegal instruction error and crash before this fix. Now we will report a warning in the server log, and just use CPU mode to ensure we don't crash.
-
Michael Yang authored
fix build
-
Michael Yang authored
-
Michael Yang authored
download: add inactivity monitor
-
Daniel Hiltgen authored
Add back ROCm container support
-
Daniel Hiltgen authored
Ignore AMD integrated GPUs
-
Daniel Hiltgen authored
Fix crash on cuda ml init failure
-
Daniel Hiltgen authored
This adds ROCm support back as a discrete image.
-
Daniel Hiltgen authored
Detect and ignore integrated GPUs reported by rocm.
-
Daniel Hiltgen authored
The new driver lookup code was triggering after init failure due to a missing return
-
Daniel Hiltgen authored
Switch back to ubuntu base
-
Daniel Hiltgen authored
The size increase for rocm support in the standard image is problematic We'll revisit multiple tags for rocm support in a follow up PR.
-
Michael Yang authored
build cuda and rocm
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 25 Jan, 2024 8 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* Fix clearing kv cache between requests with the same prompt * fix powershell script
-
Patrick Devine authored
-
Michael Yang authored
stub generate outputs for lint
-