- 17 Oct, 2023 4 commits
-
-
Michael Yang authored
fix: wrong format string type
-
Michael Yang authored
fix: regression unsupported metal types
-
Andreas Wäscher authored
-
Alexander F. Rødseth authored
-
- 16 Oct, 2023 13 commits
-
-
Michael Yang authored
-
Michael Yang authored
Add oterm to community integrations
-
Michael Yang authored
-
Michael Yang authored
Add ellama community integration
-
Michael Yang authored
Update install.sh
-
Victor Vieux authored
-
Michael Yang authored
omitting `--n-gpu-layers` means use metal on macos which isn't correct since ollama uses `num_gpu=0` to explicitly disable gpu for file types that are not implemented in metal
-
Bruce MacDonald authored
-
Michael Yang authored
fix memory check
-
Michael Yang authored
server: print version on start
-
Michael Yang authored
-
Bruce MacDonald authored
-
Sergey Kostyaev authored
-
- 15 Oct, 2023 8 commits
-
-
Yiorgis Gozadinos authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 14 Oct, 2023 3 commits
-
-
Jeffrey Morgan authored
-
Matt Williams authored
add how to quantize doc
-
Matt Williams authored
Signed-off-by:Matt Williams <m@technovangelist.com>
-
- 13 Oct, 2023 12 commits
-
-
Jeffrey Morgan authored
-
Bruce MacDonald authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
fix: offloading on low end GPUs
-
Michael Yang authored
-
Michael Yang authored
-
Bruce MacDonald authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Bruce MacDonald authored
- remove new lines from llama.cpp error messages relayed to client - check api option types and return error on wrong type - change num layers from 95% VRAM to 92% VRAM
-