Commits · fa1c987a29df360ae503c09f1b3ddd2d6db6336a · OpenDAS / ollama

15 Sep, 2025 1 commit

add qwen3-coder tool support · 47991940

Devon Rifkin authored Sep 11, 2025

The format qwen3-coder uses is relatively unique, both in rendering and
in parsing. To implement parsing, I wrote a custom parser in similar
style to harmony. For the rendering, I found that the logic would be
much more difficult to follow in a template, so I introduced the concept
of a built-in renderer that uses go code, rather than a template to
generate prompts.

I set us up for future built-in parsers and renderers by making it so
they can be specified in a Modelfile like so:

```
RENDERER "qwen3-coder"
PARSER "qwen3-coder"
```

These need to be provided explicitly because the architecture alone is
not enough to understand what format the model expects to receive, and
what format we expect it to output (e.g., qwen3-coder is `qwen3moe`,
which includes other qwen3-family models as well)

I haven't converted harmony to be one of these "built-ins" yet, since
some of it is in flux with the changes @ParthSareen has been making to
move harmony to the runner. It is likely that many other built-ins will
need to move to the runner as well, but I'm able to slightly defer that
decision since qwen3-coder doesn't have thinking (and therefore doesn't
need to be in the runner to make structured outputs work). I expect to
unify harmony with this approach very soon.

Whether a particular model supports tools or thinking was previously
inferred from templates, but without a template we now also use the
parser itself to declare what it supports. If we have future models that
re-use the same parsing format, but have different capabilities, we'll
want to parameterize them and give them different names to be specified
as a `PARSER`.

Misc changes:

- I worked on the renderer by diffing outputs from the reference
  implementation and ours. To make it easier to do this, I extended
  <https://github.com/ollama/ollama/pull/11875> to also support
  returning the prompt via the openai compat layer

47991940

08 May, 2025 1 commit
- api: remove unused sampling parameters (#10581) · fa9973cd
  Jeffrey Morgan authored May 08, 2025
  
  fa9973cd
06 May, 2025 1 commit

Move quantization to new backend (#10363) · 42481045

Daniel Hiltgen authored May 06, 2025

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

42481045

05 May, 2025 1 commit

api: remove unused or unsupported api options (#10574) · 3b2d2c83

Jeffrey Morgan authored May 05, 2025

Some options listed in api/types.go are not supported in
newer models, or have been deprecated in the past. This is
the first of a series of PRs to clean up the API options

3b2d2c83

21 Mar, 2025 1 commit
- Revert "parser: remove role validation from Modelfile parser" (#9917) · 00ebda8c
  Parth Sareen authored Mar 21, 2025
```
This reverts commit ffbfe833.
```
  00ebda8c
20 Mar, 2025 1 commit
- parser: remove role validation from Modelfile parser (#9874) · ffbfe833
  rylativity authored Mar 20, 2025
```
* updates parser/parser.go to allow arbitrary roles in Modelfile MESSAGE blocks
```
  ffbfe833
14 Feb, 2025 1 commit

next ollama runner (#7913) · 58245413

Michael Yang authored Feb 14, 2025



feat: add new Ollama engine using ggml through cgo

This change introduces a new way to run pretrained models. It introduces 3 high level interfaces and a bunch of smaller helper interfaces to facilitate this.

- `model.Model` defines the interface for a model architecture. Models such as `llama` and `mllama`, which are provided as examples, can implement the model's forward propagation in the `Forward` method. This method will be called to generate completions. This interface can be found in `model/model.go`
- `ml.Backend` defines the interface for a backend tensor library, in this case `ggml`. Among other things, a Backend is responsible for loading a pretrained model into hardware (GPU, CPU, etc) and providing an interface for Models to access loaded tensors. This interface can be found in `ml/backend.go`
- `ml.Tensor` defines the interface for a tensor and tensor operations

This is the first implementation of the new engine. Follow up PRs will implement more features:

- non-greedy sampling (#8410)
- integration with Ollama and KV caching (#8301)
- more model support (#9080) with more coming soon
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>

58245413

21 Jan, 2025 1 commit
- docs: remove tfs_z option from documentation (#8515) · 294b6f5a
  frob authored Jan 21, 2025
  
  294b6f5a
16 Jan, 2025 1 commit
- parser: fix parsing Modelfiles with multiple FROM commands (#8449) · 42cf4db6
  Jeffrey Morgan authored Jan 16, 2025
  
  42cf4db6
11 Jan, 2025 1 commit
- make the modelfile path relative for `ollama create` (#8380) · 32bd37ad
  Patrick Devine authored Jan 10, 2025
  
  32bd37ad
01 Jan, 2025 1 commit
- Update the /api/create endpoint to use JSON (#7935) · 86a622cb
  Patrick Devine authored Dec 31, 2024
```
Replaces `POST /api/create` to use JSON instead of a Modelfile.

This is a breaking change.
```
  86a622cb
10 Dec, 2024 1 commit
- all: fix typos in documentation, code, and comments (#7021) · abfdc471
  Stefan Weil authored Dec 10, 2024
  
  abfdc471
14 Nov, 2024 1 commit
- add line numbers for parser errors (#7326) · 4efb98cb
  Patrick Devine authored Nov 14, 2024
  
  4efb98cb
06 Nov, 2024 1 commit

runner.go: Remove unused arguments · a9094176

Jesse Gross authored Oct 30, 2024

Now that server.cpp is gone, we don't need to keep passing arguments
that were only ignored and only kept for compatibility.

a9094176

02 Aug, 2024 1 commit
- lint · b732beba
  Michael Yang authored Aug 01, 2024
  
  b732beba
27 Jul, 2024 1 commit
- feat: add support for min_p (resolve #1142) (#1825) · f3d7a481
  Tibor Schmidt authored Jul 27, 2024
  
  f3d7a481
01 Jul, 2024 2 commits
- trimspace test case · 7e571f95
  Josh Yan authored Jul 01, 2024
  
  7e571f95
- updated parsefile test · 26e4e66f
  Josh Yan authored Jul 01, 2024
  
  26e4e66f
27 Jun, 2024 2 commits
- trim all params · 9bd00041
  Josh Yan authored Jun 27, 2024
  
  9bd00041
- unquote, trimp space · 4e986a82
  Josh Yan authored Jun 27, 2024
  
  4e986a82
13 Jun, 2024 1 commit
- parser: add test for multibyte runes · cd234ce2
  Michael Yang authored Jun 13, 2024
  
  cd234ce2
04 Jun, 2024 1 commit
- lint · e40145a3
  Michael Yang authored May 21, 2024
  
  e40145a3
20 May, 2024 1 commit
- Move the parser back + handle utf16 files (#4533) · ccdf0b2a
  Patrick Devine authored May 20, 2024
  
  ccdf0b2a
07 May, 2024 1 commit
- types/model: fix parser for empty values · 63bc884e
  Michael Yang authored May 07, 2024
  
  63bc884e
01 May, 2024 6 commits
- rename parser to model/file · 119589fc
  Michael Yang authored Apr 30, 2024
  
  119589fc
- fix parser name · bd8eed57
  Michael Yang authored Apr 26, 2024
  
  bd8eed57
- parser: add commands format · 176ad3aa
  Michael Yang authored Apr 24, 2024
  
  176ad3aa
- fix multiline · 8907bf51
  Michael Yang authored Apr 24, 2024
  
  8907bf51
- tests · abe614c7
  Michael Yang authored Apr 24, 2024
  
  abe614c7
- refactor modelfile parser · c0a00f68
  Michael Yang authored Apr 22, 2024
  
  c0a00f68
25 Jan, 2024 1 commit
- Save and load sessions (#2063) · 7c40a678
  Patrick Devine authored Jan 25, 2024
  
  7c40a678
05 Jan, 2024 1 commit
- Add unit tests for Parser (#1815) · 238ac5e7
  Patrick Devine authored Jan 05, 2024
  
  238ac5e7