- 20 Mar, 2025 1 commit
-
-
rylativity authored
* updates parser/parser.go to allow arbitrary roles in Modelfile MESSAGE blocks
-
- 14 Feb, 2025 1 commit
-
-
Michael Yang authored
feat: add new Ollama engine using ggml through cgo This change introduces a new way to run pretrained models. It introduces 3 high level interfaces and a bunch of smaller helper interfaces to facilitate this. - `model.Model` defines the interface for a model architecture. Models such as `llama` and `mllama`, which are provided as examples, can implement the model's forward propagation in the `Forward` method. This method will be called to generate completions. This interface can be found in `model/model.go` - `ml.Backend` defines the interface for a backend tensor library, in this case `ggml`. Among other things, a Backend is responsible for loading a pretrained model into hardware (GPU, CPU, etc) and providing an interface for Models to access loaded tensors. This interface can be found in `ml/backend.go` - `ml.Tensor` defines the interface for a tensor and tensor operations This is the first implementation of the new engine. Follow up PRs will implement more features: - non-greedy sampling (#8410) - integration with Ollama and KV caching (#8301) - more model support (#9080) with more coming soon Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 21 Jan, 2025 1 commit
-
-
frob authored
-
- 16 Jan, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 15 Jan, 2025 1 commit
-
-
Patrick Devine authored
-
- 11 Jan, 2025 1 commit
-
-
Patrick Devine authored
-
- 08 Jan, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 01 Jan, 2025 1 commit
-
-
Patrick Devine authored
Replaces `POST /api/create` to use JSON instead of a Modelfile. This is a breaking change.
-
- 10 Dec, 2024 1 commit
-
-
Stefan Weil authored
-
- 14 Nov, 2024 1 commit
-
-
Patrick Devine authored
-
- 06 Nov, 2024 1 commit
-
-
Jesse Gross authored
Now that server.cpp is gone, we don't need to keep passing arguments that were only ignored and only kept for compatibility.
-
- 02 Aug, 2024 1 commit
-
-
Michael Yang authored
-
- 27 Jul, 2024 1 commit
-
-
Tibor Schmidt authored
-
- 01 Jul, 2024 2 commits
- 27 Jun, 2024 2 commits
- 13 Jun, 2024 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
This reverts commit 66ab4877. this change broke utf-8 scanning of multi-byte runes
-
- 05 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 04 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 20 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 01 May, 2024 9 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 25 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 18 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
A few obvious levels were adjusted, but generally everything mapped to "info" level.
-
- 05 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 05 Dec, 2023 1 commit
-
-
Michael Yang authored
-
- 16 Oct, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 10 Aug, 2023 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-