"vscode:/vscode.git/clone" did not exist on "ce597c639c66e0d049e0a42c33e56db07844f21f"
- 03 Apr, 2025 1 commit
-
-
Bruce MacDonald authored
Mistral is a popular research lab making open source models. This updates the forward pass of llama architecture models to support both llama models and mistral models by accounting for additional metadata present in mistral models, and finding the correct dimensions for the output projection.
-
- 21 Mar, 2025 1 commit
-
-
Parth Sareen authored
This reverts commit ffbfe833.
-
- 20 Mar, 2025 1 commit
-
-
rylativity authored
* updates parser/parser.go to allow arbitrary roles in Modelfile MESSAGE blocks
-
- 14 Feb, 2025 1 commit
-
-
Michael Yang authored
feat: add new Ollama engine using ggml through cgo This change introduces a new way to run pretrained models. It introduces 3 high level interfaces and a bunch of smaller helper interfaces to facilitate this. - `model.Model` defines the interface for a model architecture. Models such as `llama` and `mllama`, which are provided as examples, can implement the model's forward propagation in the `Forward` method. This method will be called to generate completions. This interface can be found in `model/model.go` - `ml.Backend` defines the interface for a backend tensor library, in this case `ggml`. Among other things, a Backend is responsible for loading a pretrained model into hardware (GPU, CPU, etc) and providing an interface for Models to access loaded tensors. This interface can be found in `ml/backend.go` - `ml.Tensor` defines the interface for a tensor and tensor operations This is the first implementation of the new engine. Follow up PRs will implement more features: - non-greedy sampling (#8410) - integration with Ollama and KV caching (#8301) - more model support (#9080) with more coming soon Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 21 Jan, 2025 1 commit
-
-
frob authored
-
- 16 Jan, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 15 Jan, 2025 1 commit
-
-
Patrick Devine authored
-
- 11 Jan, 2025 1 commit
-
-
Patrick Devine authored
-
- 08 Jan, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 01 Jan, 2025 1 commit
-
-
Patrick Devine authored
Replaces `POST /api/create` to use JSON instead of a Modelfile. This is a breaking change.
-
- 10 Dec, 2024 1 commit
-
-
Stefan Weil authored
-
- 14 Nov, 2024 1 commit
-
-
Patrick Devine authored
-
- 06 Nov, 2024 1 commit
-
-
Jesse Gross authored
Now that server.cpp is gone, we don't need to keep passing arguments that were only ignored and only kept for compatibility.
-
- 02 Aug, 2024 1 commit
-
-
Michael Yang authored
-
- 27 Jul, 2024 1 commit
-
-
Tibor Schmidt authored
-
- 01 Jul, 2024 2 commits
- 27 Jun, 2024 2 commits
- 13 Jun, 2024 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
This reverts commit 66ab4877. this change broke utf-8 scanning of multi-byte runes
-
- 05 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 04 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 20 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 01 May, 2024 9 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 25 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 18 Jan, 2024 1 commit
-
-
Daniel Hiltgen authored
A few obvious levels were adjusted, but generally everything mapped to "info" level.
-
- 05 Jan, 2024 1 commit
-
-
Patrick Devine authored
-
- 05 Dec, 2023 1 commit
-
-
Michael Yang authored
-
- 16 Oct, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 10 Aug, 2023 1 commit
-
-
Michael Yang authored
-