Commits · a838421ea35366ea35a39102981c96a4a16658bb · OpenDAS / ollama

11 Dec, 2025 1 commit
- model: conversion and hyperparameter fixes for ministral and devstral (#13424) · a838421e
  Jeffrey Morgan authored Dec 11, 2025
  
  a838421e
09 Dec, 2025 4 commits
- nomic-embed-text:v2: model implementation (#13162) · 76f88caf
  nicole pardal authored Dec 09, 2025
  
  76f88caf
- renderers/parsers: olmo3 instruct (#13383) · 2bccf8c6
  Parth Sareen authored Dec 09, 2025
  
  2bccf8c6
- parsers/renderers: olmo3 think (#13290) · 0c5e5f66
  Parth Sareen authored Dec 09, 2025
  
  0c5e5f66
- model: add rnj-1 inference support (#13354) · d2f334c1
  Jeffrey Morgan authored Dec 08, 2025
  
  d2f334c1
08 Dec, 2025 1 commit

Michael Yang authored Nov 18, 2025

change to a flatter directory structure and group the options with the
function

update models to call rope in one place

603ceefa

02 Dec, 2025 1 commit

model: ministral w/ llama4 scaling (#13292) · d3e0a0de

Patrick Devine authored Dec 01, 2025



This change:

* fixes rope scaling in the mistral converter
* updates ministral to include llama4 scaling
* includes a new ministral parser for parsing reasoning and tool calling

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>

d3e0a0de

20 Nov, 2025 2 commits
- Parser for Cogito v2 (#13145) · d70e9355
  Grace authored Nov 19, 2025
  
  d70e9355
- deepseek2: upgrade to run v3+ models (#13166) · 5c1063df
  Michael Yang authored Nov 19, 2025
```
the check for mla omits v3 and r1 which should not return unsupported.
instead check the tokenizer for compatibility
```
  5c1063df
19 Nov, 2025 4 commits
- models: enable deepseek2 (deepseek v3.1 w/ MLA) on the new engine (#13151) · 604e43b2
  Patrick Devine authored Nov 18, 2025
  
  604e43b2
- Renderer for Cogito v2 (#13139) · 91935631
  Grace authored Nov 18, 2025
  
  91935631
- nomic-embed-text model implementation (#13071) · 8de30b56
  nicole pardal authored Nov 18, 2025
  
  8de30b56
- deepseekocr · 92981ae3
  Michael Yang authored Oct 31, 2025
  
  92981ae3
18 Nov, 2025 2 commits
- fix(tokenizer): add special tokens to empty inputs (#13091) · 440a3823
  Michael Yang authored Nov 18, 2025
  
  440a3823
- Add deepseek v3.1 (#13063) · 584e2d64
  Grace authored Nov 17, 2025
```
* Add mla for flash attention
* Revert to using chunks
```
  584e2d64
13 Nov, 2025 1 commit

chore: update models to use slice/chunk/chunksections (#12934) · 333203d8

Michael Yang authored Nov 13, 2025

* use slice/chunks

* bert

* llama4

* gemma3n

* gptoss

* mistral3

* qwen3vl

* qwen25vl

* deepseek2

* remove unused ops

333203d8

06 Nov, 2025 1 commit
- ggml update to b6840 (#12791) · 544b6739
  Daniel Hiltgen authored Nov 06, 2025
  
  544b6739
03 Nov, 2025 1 commit
- chore(gptoss): cleanup dead code (#12932) · ce3eb0a3
  Michael Yang authored Nov 03, 2025
  
  ce3eb0a3
30 Oct, 2025 2 commits
- interleaved mrope (#12807) · f67a6df1
  Michael Yang authored Oct 30, 2025
```
* ml(ggml): mrope
* interleave mrope
```
  f67a6df1
- fix: qwen2.5vl, qwen3vl composite image (#12841) · d432ade7
  Michael Yang authored Oct 30, 2025
```
this change fixes images with an alpha channel by overlaying the image
onto a white background
```
  d432ade7
29 Oct, 2025 2 commits
- Removing whitespace between Thinking and Content in Qwen3VL (#12838) · 0a2d9208
  Grace authored Oct 29, 2025
```
Eats extra whitespace at the end/beginning of content
```
  0a2d9208
- feat(model): add qwen3vl (#12665) · 7d25b9e1
  Michael Yang authored Oct 28, 2025
  
  7d25b9e1
28 Oct, 2025 2 commits
- s/From*Slice/From*s/ (#12255) · 1188f408
  Michael Yang authored Oct 28, 2025
  
  1188f408
- gemma3: make embedding non-causal (#12297) · ec9eb28f
  Michael Yang authored Oct 27, 2025
  
  ec9eb28f
20 Oct, 2025 1 commit
- model/parsers: remove warning for missing <think> tag for qwen3-vl (#12713) · 94f110b3
  Jeffrey Morgan authored Oct 20, 2025
  
  94f110b3
18 Oct, 2025 1 commit
- contiguous input per layer (#12686) · bc1a818f
  Daniel Hiltgen authored Oct 17, 2025
```
Co-authored-by: Michael Yang <git@mxy.ng>
```
  bc1a818f
16 Oct, 2025 2 commits

renderers: add global flag for setting [img] tags (#12669) · 65fb3ff4

Jeffrey Morgan authored Oct 16, 2025

Adds a temporary global flag to renderers that causes renderers to always
render images as [img]. In a follow up change, we will consider making this
the default, and this flag could eventually be removed

65fb3ff4

Grace/qwen3 thinking (#12647) · e2a0b244

Grace authored Oct 16, 2025

* changing initial status to take into consideration prefill

* Add seperate strings for content and thinking builder

* thinking tests

* remove white space from string before closing think tag

e2a0b244

14 Oct, 2025 2 commits
- qwen3-coder: support anyOf when parsing tool calls · 08fbb60b
  Devon Rifkin authored Oct 14, 2025
  
  08fbb60b
- add registries for parsers/renderers · ddaca643
  Devon Rifkin authored Oct 14, 2025
  
  ddaca643
13 Oct, 2025 2 commits

Qwen3VL Cloud Parser and Renderer (#12526) · 05982a95

Grace authored Oct 13, 2025



* working (other than tool call is the incorrect order) for tool calls and tools

* Tests work, other than image tags (tests do not go through server) and tools (not in the correct order, but contents are the same)

* testing for qwen3vl parser - toolparser is working

* made changes to JSON tool parser, wraps the TollCallFunction with a TollCall object

* Working parser for thinking models - assumes state of thinking, emits unambiguous content in thinking, does not call tool call in thinking

* changed the parser to start with collecting content

* thinking prefill

* add hasThinkingSupport parameter to parser

* qwen3-vl -> qwen3-vl-instruct for renderer/parser

* Add hasThinkingSupport=false to QwenVLParser

---------
Co-authored-by: Devon Rifkin <drifkin@drifkin.net>

05982a95

fix(qwen3): deepseek distill · 6c833d5f
Michael Yang authored Oct 13, 2025
```
deepseek's qwen3 distill uses a different rope scheme so support both
```
6c833d5f

10 Oct, 2025 1 commit
- refactor: using testing.B.Loop · df411c4b
  yajianggroup authored Sep 23, 2025
```
Signed-off-by: yajianggroup <yajianggroup@outlook.com>
```
  df411c4b
09 Oct, 2025 2 commits
- refactor: use builtin max and min · 47298fce
  shengxinjing authored Sep 28, 2025
  
  47298fce
- refactor: use builtin max and min · 4a48937e
  shengxinjing authored Sep 25, 2025
  
  4a48937e
03 Oct, 2025 1 commit
- Fixed Deepseek2 adding nil tensor error · 33801c15
  Grace authored Oct 03, 2025
  
  33801c15
30 Sep, 2025 1 commit
- qwen3-coder: fix tool definition type rendering · 83021fcf
  Devon Rifkin authored Sep 30, 2025
  
  83021fcf
25 Sep, 2025 1 commit

parsers: fix unicode handling for qwen3-coder · 05ba4ca1

Devon Rifkin authored Sep 25, 2025

When trimming whitespace at the end of every chunk, we were iterating
backwards over the string byte-by-byte instead of rune-by-rune.

As an example of how this can cause corruption, suppose we have the
multi-byte character ✅ (`"\u2705"`), which is represented in utf-8 as
the three bytes `0xE2 0x9C 0x85`. It happens that `0x85` is NEL, which
passes `unicode.IsSpace()`. Because we were iterating byte-by-byte, this
caused us to mistakenly slice in the middle of the rune, removing `0x85`
and leaving `0xE2 0x9C`, which beyond being the incorrect place to
slice, is not even a valid utf-8 character.

`trailingWhitespaceLen()` was modified to count from the end in a
rune-aware way. Tests with various multibyte unicode characters were
also added.


Fixes: #12414

05ba4ca1

24 Sep, 2025 2 commits

Grace/deepseek v3 migration (#12385) · fbd82ba5

Grace authored Sep 24, 2025



* init deepseek model file

* temp removal of flash attention implementation

* shapes and proper, can make a pass

* query, key, value have good cosine similarity, but the max diff is a bit high

* Attention block is working! ** with eager for now, have not added the mask line

* Attention block is working! ** with eager for now, have not added the mask line

* working MoE at around 0.95 cosine sim

* added cosine similarity function

* Starting end to end structure

* Trying (and failing) to get rope to work, going to test full thing on tater

* running on tater36... just not the right outputs

* we have the right values for rope... but its still not working?

* chnage Extrapolation Factor to 1

* removed adding residuals twice, removed normalization from shared expert, refactored Norms (Attention, MLP) to be outside the (Attention, MLP) blocks and in the Transformer block instead, add cache setLayer

* Temporary modelfiles for cpu

* change kpass intermediate step to kv, two layer outputs [0,1] look fine

* this calls for 16 chicken nuggets

* whoops

* cleaning up code

* delete stuff we dont need

* getting rid of debug statements for llama cpp

* working with long contexts

* fix long context view error

* reverting some changes I made for files that are not apart of pr

* Added proper tokenizer for deeepseek3

* clean up model and go test

* remove Modelfile

* not passing the tests

* whoops

* how to pass the ci tests

* resolving some of the comments

* rename

* linted and renamed deepseek3 -> deepseek2

* remove name go

* addressed changes - main change was adopting qwen3 naming scheme

* I cannot with linters

* clean up logs

* clean up logs

---------
Co-authored-by: Grace Guo <graceguo@Graces-MBP.localdomain>
Co-authored-by: Grace Guo <graceguo@Graces-MacBook-Pro.local>
Co-authored-by: graceguo <graceguo@tater36.localdomain>

fbd82ba5

fix: leaf alt name (#12390) · e1979c57

Michael Yang authored Sep 23, 2025

a leaf node with an alternative name gets all its alternatives names
added into the same branch rather than creating branches themselves

e1979c57