- 10 Sep, 2024 1 commit
-
-
Patrick Devine authored
-
- 08 Sep, 2024 2 commits
-
-
Jeffrey Morgan authored
-
RAPID ARCHITECT authored
-
- 07 Sep, 2024 3 commits
-
-
frob authored
-
Jeffrey Morgan authored
Includes small improvements to document layout and code blocks
-
Yaroslav authored
-
- 06 Sep, 2024 5 commits
-
-
nickthecook authored
-
imoize authored
-
Daniel Hiltgen authored
When we determine a GPU is too small for any layers, it's not always clear why. This will help troubleshoot those scenarios.
-
frob authored
-
Patrick Devine authored
-
- 05 Sep, 2024 14 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
This reverts commit a60d9b89.
-
Daniel Hiltgen authored
With the new very large parameter models, some users are willing to wait for a very long time for models to load.
-
Daniel Hiltgen authored
Provide a mechanism for users to set aside an amount of VRAM on each GPU to make room for other applications they want to start after Ollama, or workaround memory prediction bugs
-
Daniel Hiltgen authored
-
Michael Yang authored
llama3.1 memory
-
Zeyo authored
-
Michael authored
* Update gpu.md Seems strange that the laptop versions of 3050 and 3050 Ti would be supported but not the non-notebook, but this is what the page (https://developer.nvidia.com/cuda-gpus ) says. Signed-off-by:bean5 <2052646+bean5@users.noreply.github.com> * Update gpu.md Remove notebook reference --------- Signed-off-by:
bean5 <2052646+bean5@users.noreply.github.com>
-
Tobias Heinze authored
-
Vitaly Zdanevich authored
-
王卿 authored
Replace "command -v" with encapsulated functionality
-
Augustinas Malinauskas authored
Added Enchanted with Apple Vision Pro support
-
Silas Marvin authored
-
Arda Günsüren authored
-
- 04 Sep, 2024 13 commits
-
-
jk011ru authored
-
Pascal Patry authored
-
Rune Berg authored
-
Tomoya Fujita authored
-
Teïlo M authored
-
亢奋猫 authored
-
Mitar authored
-
Carter authored
change "dorrect" to "correct"
-
Erkin Alp Güney authored
-
Sam authored
-
Viz authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
It looks like driver 525 (aka, cuda driver 12.0) has problems with the cuda v12 library we compile against, so run v11 on those older drivers if detected.
-
- 03 Sep, 2024 2 commits
-
-
Daniel Hiltgen authored
On systems with low system memory, we can hit allocation failures that are difficult to diagnose without debug logs. This will make it easier to spot.
-
Mateusz Migas authored
-