- 19 Dec, 2023 13 commits
-
-
Daniel Hiltgen authored
If someone checks out the ollama repo and doesn't install the CUDA library, this will ensure they can build a CPU only version
-
Daniel Hiltgen authored
A simple test case that verifies llava:7b can read text in an image
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
This changes the container-based linux build to use an older Ubuntu distro to improve our compatibility matrix for older user machines
-
Daniel Hiltgen authored
-
65a authored
The build tags rocm or cuda must be specified to both go generate and go build. ROCm builds should have both ROCM_PATH set (and the ROCM SDK present) as well as CLBlast installed (for GGML) and CLBlast_DIR set in the environment to the CLBlast cmake directory (likely /usr/lib/cmake/CLBlast). Build tags are also used to switch VRAM detection between cuda and rocm implementations, using added "accelerator_foo.go" files which contain architecture specific functions and variables. accelerator_none is used when no tags are set, and a helper function addRunner will ignore it if it is the chosen accelerator. Fix go generate commands, thanks @deadmeu for testing.
-
Daniel Hiltgen authored
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.
-
Bruce MacDonald authored
-
Bruce MacDonald authored
- remove ggml runner - automatically pull gguf models when ggml detected - tell users to update to gguf in the case automatic pull fails Co-Authored-By:Jeffrey Morgan <jmorganca@gmail.com>
-
Matt Williams authored
Added cmdh to community section in README
-
Matt Williams authored
Lets get rid of these old modelfile examples
-
pgibler authored
-
Matt Williams authored
Signed-off-by:Matt Williams <m@technovangelist.com>
-
- 18 Dec, 2023 5 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Bruce MacDonald authored
-
Patrick Devine authored
-
Jeffrey Morgan authored
-
- 15 Dec, 2023 6 commits
-
-
Patrick Devine authored
-
Augustinas Malinauskas authored
* feat(docs): README with Enchanted iOS app * Update README.md --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
Ian Purton authored
* Added Bionic GPT as a front end. * Update README.md --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
Bruce MacDonald authored
-
Patrick Devine authored
-
Patrick Devine authored
-
- 14 Dec, 2023 2 commits
-
-
Bruce MacDonald authored
this info has not been returned from these endpoints in some time
-
Bruce MacDonald authored
* restore model load duration on generate response - set model load duration on generate and chat done response - calculate createAt time when response created * remove checkpoints predict opts * Update routes.go
-
- 13 Dec, 2023 5 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
* add multimodal docs * add chat api docs * consistency between `/api/generate` and `/api/chat` * simplify docs
-
pepperoni21 authored
-
- 12 Dec, 2023 6 commits
-
-
Patrick Devine authored
-
Michael Yang authored
remove per-model types
-
Jeffrey Morgan authored
-
Bruce MacDonald authored
-
Bruce MacDonald authored
- remove parallel
-
ruecat authored
-
- 11 Dec, 2023 3 commits
-
-
Jorge Torres authored
-
Patrick Devine authored
--------- Co-authored-by:Matt Apperson <mattapperson@Matts-MacBook-Pro.local>
-
Michael Yang authored
mostly replaced by decoding tensors except ggml models which only support llama
-