- 07 Jan, 2026 1 commit
-
-
Devon Rifkin authored
In #13525, I accidentally broke templates' ability to automatically render tool call function arguments as JSON. We do need these to be proper maps because we need templates to be able to call range, which can't be done on custom types.
-
- 11 Dec, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 18 Nov, 2025 1 commit
-
-
Michael Yang authored
* migrate to golangci-lint v2 * copyloopvar
-
- 03 Oct, 2025 2 commits
-
-
Patrick Devine authored
-
Patrick Devine authored
-
- 07 Jul, 2025 1 commit
-
-
Parth Sareen authored
-
- 14 Feb, 2025 1 commit
-
-
Michael Yang authored
feat: add new Ollama engine using ggml through cgo This change introduces a new way to run pretrained models. It introduces 3 high level interfaces and a bunch of smaller helper interfaces to facilitate this. - `model.Model` defines the interface for a model architecture. Models such as `llama` and `mllama`, which are provided as examples, can implement the model's forward propagation in the `Forward` method. This method will be called to generate completions. This interface can be found in `model/model.go` - `ml.Backend` defines the interface for a backend tensor library, in this case `ggml`. Among other things, a Backend is responsible for loading a pretrained model into hardware (GPU, CPU, etc) and providing an interface for Models to access loaded tensors. This interface can be found in `ml/backend.go` - `ml.Tensor` defines the interface for a tensor and tensor operations This is the first implementation of the new engine. Follow up PRs will implement more features: - non-greedy sampling (#8410) - integration with Ollama and KV caching (#8301) - more model support (#9080) with more coming soon Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 18 Oct, 2024 1 commit
-
-
Patrick Devine authored
Co-authored-by:
jmorganca <jmorganca@gmail.com> Co-authored-by:
Michael Yang <mxyng@pm.me> Co-authored-by:
Jesse Gross <jesse@ollama.com>
-
- 02 Aug, 2024 1 commit
-
-
Michael Yang authored
-
- 20 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 16 Jul, 2024 1 commit
-
-
Michael Yang authored
this change is triggered by the presence of "suffix", particularly useful for code completion tasks
-
- 12 Jul, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 11 Jul, 2024 3 commits
-
-
Michael Yang authored
This reverts commit 19753c18. for compat. messages will be added at a later date
-
Michael Yang authored
-
Michael Yang authored
-
- 05 Jul, 2024 4 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 01 Jul, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-