- 11 Sep, 2024 2 commits
-
-
Petr Mironychev authored
-
Daniel Hiltgen authored
This adds back a check which was lost many releases back to verify /dev/kfd permissions which when lacking, can lead to confusing failure modes of: "rocBLAS error: Could not initialize Tensile host: No devices found" This implementation does not hard fail the serve command but instead will fall back to CPU with an error log. In the future we can include this in the GPU discovery UX to show detected but unsupported devices we discovered.
-
- 10 Sep, 2024 5 commits
-
-
Michael Yang authored
add *_proxy to env map for debugging
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
* Quiet down dockers new lint warnings Docker has recently added lint warnings to build. This cleans up those warnings. * Fix go lint regression
-
Patrick Devine authored
-
- 08 Sep, 2024 2 commits
-
-
Jeffrey Morgan authored
-
RAPID ARCHITECT authored
-
- 07 Sep, 2024 3 commits
-
-
frob authored
-
Jeffrey Morgan authored
Includes small improvements to document layout and code blocks
-
Yaroslav authored
-
- 06 Sep, 2024 5 commits
-
-
nickthecook authored
-
imoize authored
-
Daniel Hiltgen authored
When we determine a GPU is too small for any layers, it's not always clear why. This will help troubleshoot those scenarios.
-
frob authored
-
Patrick Devine authored
-
- 05 Sep, 2024 14 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
This reverts commit a60d9b89.
-
Daniel Hiltgen authored
With the new very large parameter models, some users are willing to wait for a very long time for models to load.
-
Daniel Hiltgen authored
Provide a mechanism for users to set aside an amount of VRAM on each GPU to make room for other applications they want to start after Ollama, or workaround memory prediction bugs
-
Daniel Hiltgen authored
-
Michael Yang authored
llama3.1 memory
-
Zeyo authored
-
Michael authored
* Update gpu.md Seems strange that the laptop versions of 3050 and 3050 Ti would be supported but not the non-notebook, but this is what the page (https://developer.nvidia.com/cuda-gpus ) says. Signed-off-by:bean5 <2052646+bean5@users.noreply.github.com> * Update gpu.md Remove notebook reference --------- Signed-off-by:
bean5 <2052646+bean5@users.noreply.github.com>
-
Tobias Heinze authored
-
Vitaly Zdanevich authored
-
王卿 authored
Replace "command -v" with encapsulated functionality
-
Augustinas Malinauskas authored
Added Enchanted with Apple Vision Pro support
-
Silas Marvin authored
-
Arda Günsüren authored
-
- 04 Sep, 2024 9 commits
-
-
jk011ru authored
-
Pascal Patry authored
-
Rune Berg authored
-
Tomoya Fujita authored
-
Teïlo M authored
-
亢奋猫 authored
-
Mitar authored
-
Carter authored
change "dorrect" to "correct"
-
Erkin Alp Güney authored
-