Commits · e51dead6363e941b480f5bf1270254db7e175083 · OpenDAS / ollama

06 Jan, 2026 1 commit

preserve tool definition and call JSON ordering (#13525) · e51dead6

Devon Rifkin authored Jan 05, 2026

* preserve tool definition and call JSON ordering

This is another iteration of
<https://github.com/ollama/ollama/pull/12518>, but this time we've
simplified things by relaxing the competing requirements of being
compatible AND order-preserving with templates (vs. renderers). We
maintain backwards compatibility at the cost of not guaranteeing order
for templates. We plan on moving more and more models to renderers,
which have been updated to use these new data types, and additionally
we could add an opt-in way of templates getting an order-preserved list
(e.g., via sibling template vars)

* orderedmap_test: remove testify

e51dead6

22 Oct, 2025 1 commit
- tools: parse tool calls that don't conform to ("name": name, "arguments": args} (#12738) · 0334e67f
  frob authored Oct 22, 2025
  
  0334e67f
25 Sep, 2025 1 commit
- tools: handle the case where a tool call sends "arguments" or "parameters" as... · 2fba04b5
  Gabe Goodhart authored Sep 25, 2025
```
tools: handle the case where a tool call sends "arguments" or "parameters" as a serialized json string (#12413)
```
  2fba04b5
22 Aug, 2025 1 commit
- tools: avoid matching braces that are part of tool content (#12039) · 4bcb04ad
  Jeffrey Morgan authored Aug 22, 2025
  
  4bcb04ad
05 Aug, 2025 1 commit

gpt-oss (#11672) · fa7776fd

Michael Yang authored Aug 05, 2025



* bf16

* tests

* gpt-oss

* enable gptoss for engine

* rough estimate

* convert to mxfp4

* handle safetensors U8

* clamp glu/linear

* update tokenizer

* MXFP4 support

This implements the Open Compute Microscaling (MX) FP4 format
as a tensor type with backend implementations focusing
on mulmat and mulmatid on CPU, CUDA, and Metal.

* Unit tests for MXFP4 support

This exercises various operations and shapes on both CPU and GPU (if detected
on the system)

* cuda graph

* unit test adjustments

* cuda: optimize memory access

Read 4 bytes at a time (8 elements) when performing mul_mat_vec_mxfp4

* mac: fix crash on old macos versions

cblas_sgemm is only supported on v13.3 and up, however bf16 is
only supported on v14+ so we were falling back to ggml-blas and
crashing on bf16 tensors.  Checking for the function being null
seems to be the simplest way to condittionally avoid registering the
backend.

* server: Minimum context length for gptoss

This model requires a minimum context length of 8192 to function
effectively. Users can set higher values through all normal mechanisms
but lower values will be silently reset.

* ggml: Multiply by numParallel for gptoss sliding window

When computing the graph size estimate, the context size is already
multiplied by numParallel so estimates reflect that. However, since
sliding window models use a smaller, fixed context size, they need
to manually take numParallel into account.

* gpt-oss integration

includes harmony parser and thinking levels, etc.

* fix sync

* fix tests

* fix lint

---------
Co-authored-by: Daniel Hiltgen <daniel@ollama.com>
Co-authored-by: Jesse Gross <jesse@ollama.com>
Co-authored-by: Devon Rifkin <drifkin@drifkin.net>

fa7776fd

24 Jul, 2025 1 commit
- tools: loosen tool argument parsing (#11509) · 4f8a0166
  Jeffrey Morgan authored Jul 23, 2025
  
  4f8a0166
20 Jul, 2025 1 commit
- tools: fix parsing issue when a tool name is a substring of another (#11456) · bdd9d22d
  Jeffrey Morgan authored Jul 20, 2025
```
Co-authored-by: frob <rick+github@frob.com.au>
```
  bdd9d22d
30 Jun, 2025 1 commit
- tools: fix parsing tool calls with empty arguments, missing required fields (#11233) · 44b17d2b
  Jeffrey Morgan authored Jun 30, 2025
  
  44b17d2b
18 Jun, 2025 1 commit
- tools: return empty arguments object instead of null (#11113) · 55bbf3b4
  Jeffrey Morgan authored Jun 18, 2025
  
  55bbf3b4
17 Jun, 2025 1 commit

tools: fix parsing tool calls without any parameters (#11101) · 6bda1d24

Jeffrey Morgan authored Jun 17, 2025

Fixes issue where tool calls that don't expect any parameters were
not being parsed. This also fixes two additional issues: one where
2+ tool calls would not be correctly parsed, and cases where tool calls
with invalid parameters would still get parsed

6bda1d24

12 Jun, 2025 1 commit
- tools: loosen tool parsing to allow for more formats (#11030) · 9f8a18ec
  Jeffrey Morgan authored Jun 12, 2025
  
  9f8a18ec
27 May, 2025 2 commits
- tools: relax JSON parse constraints for tool calling (#10872) · 066d0f47
  Parth Sareen authored May 26, 2025
  
  066d0f47
- tools: remove newline stripping (#10869) · aea6fb9b
  Parth Sareen authored May 26, 2025
  
  aea6fb9b
23 May, 2025 1 commit
- tools: refactor tool call parsing and enable streaming (#10415) · e8b981fa
  Parth Sareen authored May 23, 2025
  
  e8b981fa