Commits · 89eb795293f353d575ab52eb9252f73e0269819c · OpenDAS / ollama

16 Dec, 2025 3 commits
- parsers/renderers: use think from user for nemotron (#13492) · 89eb7952
  Parth Sareen authored Dec 15, 2025
  
  89eb7952
- llama/parsers/renderers: nemotron 3 nano (#13489) · 7e3ea813
  Parth Sareen authored Dec 15, 2025
```
---------
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>
```
  7e3ea813
- Adding tool definitions to DeepseekV3 renderer (#13491) · 7b95087b
  Grace authored Dec 15, 2025
  
  7b95087b
15 Dec, 2025 2 commits
- DeepseekV3 family renderer (#13180) · 2c639431
  Grace authored Dec 15, 2025
  
  2c639431
- renderers: add olmo3.1 and olmo3 fixes (#13447) · e3731fb1
  Parth Sareen authored Dec 15, 2025
  
  e3731fb1
09 Dec, 2025 2 commits
- renderers/parsers: olmo3 instruct (#13383) · 2bccf8c6
  Parth Sareen authored Dec 09, 2025
  
  2bccf8c6
- parsers/renderers: olmo3 think (#13290) · 0c5e5f66
  Parth Sareen authored Dec 09, 2025
  
  0c5e5f66
19 Nov, 2025 1 commit
- Renderer for Cogito v2 (#13139) · 91935631
  Grace authored Nov 18, 2025
  
  91935631
16 Oct, 2025 2 commits

renderers: add global flag for setting [img] tags (#12669) · 65fb3ff4

Jeffrey Morgan authored Oct 16, 2025

Adds a temporary global flag to renderers that causes renderers to always
render images as [img]. In a follow up change, we will consider making this
the default, and this flag could eventually be removed

65fb3ff4

Grace/qwen3 thinking (#12647) · e2a0b244

Grace authored Oct 16, 2025

* changing initial status to take into consideration prefill

* Add seperate strings for content and thinking builder

* thinking tests

* remove white space from string before closing think tag

e2a0b244

14 Oct, 2025 1 commit
- add registries for parsers/renderers · ddaca643
  Devon Rifkin authored Oct 14, 2025
  
  ddaca643
13 Oct, 2025 1 commit

Qwen3VL Cloud Parser and Renderer (#12526) · 05982a95

Grace authored Oct 13, 2025



* working (other than tool call is the incorrect order) for tool calls and tools

* Tests work, other than image tags (tests do not go through server) and tools (not in the correct order, but contents are the same)

* testing for qwen3vl parser - toolparser is working

* made changes to JSON tool parser, wraps the TollCallFunction with a TollCall object

* Working parser for thinking models - assumes state of thinking, emits unambiguous content in thinking, does not call tool call in thinking

* changed the parser to start with collecting content

* thinking prefill

* add hasThinkingSupport parameter to parser

* qwen3-vl -> qwen3-vl-instruct for renderer/parser

* Add hasThinkingSupport=false to QwenVLParser

---------
Co-authored-by: Devon Rifkin <drifkin@drifkin.net>

05982a95

30 Sep, 2025 1 commit
- qwen3-coder: fix tool definition type rendering · 83021fcf
  Devon Rifkin authored Sep 30, 2025
  
  83021fcf
15 Sep, 2025 2 commits

address comments · 472feec2
Devon Rifkin authored Sep 15, 2025

472feec2

add qwen3-coder tool support · 47991940

Devon Rifkin authored Sep 11, 2025

The format qwen3-coder uses is relatively unique, both in rendering and
in parsing. To implement parsing, I wrote a custom parser in similar
style to harmony. For the rendering, I found that the logic would be
much more difficult to follow in a template, so I introduced the concept
of a built-in renderer that uses go code, rather than a template to
generate prompts.

I set us up for future built-in parsers and renderers by making it so
they can be specified in a Modelfile like so:

```
RENDERER "qwen3-coder"
PARSER "qwen3-coder"
```

These need to be provided explicitly because the architecture alone is
not enough to understand what format the model expects to receive, and
what format we expect it to output (e.g., qwen3-coder is `qwen3moe`,
which includes other qwen3-family models as well)

I haven't converted harmony to be one of these "built-ins" yet, since
some of it is in flux with the changes @ParthSareen has been making to
move harmony to the runner. It is likely that many other built-ins will
need to move to the runner as well, but I'm able to slightly defer that
decision since qwen3-coder doesn't have thinking (and therefore doesn't
need to be in the runner to make structured outputs work). I expect to
unify harmony with this approach very soon.

Whether a particular model supports tools or thinking was previously
inferred from templates, but without a template we now also use the
parser itself to declare what it supports. If we have future models that
re-use the same parsing format, but have different capabilities, we'll
want to parameterize them and give them different names to be specified
as a `PARSER`.

Misc changes:

- I worked on the renderer by diffing outputs from the reference
  implementation and ours. To make it easier to do this, I extended
  <https://github.com/ollama/ollama/pull/11875> to also support
  returning the prompt via the openai compat layer

47991940