- 07 Jul, 2025 1 commit
-
-
Parth Sareen authored
-
- 07 Jun, 2025 1 commit
-
-
Jeffrey Morgan authored
This reverts commit 09430011.
-
- 04 Jun, 2025 1 commit
-
-
JasonHonKL authored
-
- 29 May, 2025 1 commit
-
-
Devon Rifkin authored
- Both `/api/generate` and `/api/chat` now accept a `"think"` option that allows specifying whether thinking mode should be on or not - Templates get passed this new option so, e.g., qwen3's template can put `/think` or `/no_think` in the system prompt depending on the value of the setting - Models' thinking support is inferred by inspecting model templates. The prefix and suffix the parser uses to identify thinking support is also automatically inferred from templates - Thinking control & parsing is opt-in via the API to prevent breaking existing API consumers. If the `"think"` option is not specified, the behavior is unchanged from previous versions of ollama - Add parsing for thinking blocks in both streaming/non-streaming mode in both `/generate` and `/chat` - Update the CLI to make use of these changes. Users can pass `--think` or `--think=false` to control thinking, or during an interactive session they can use the commands `/set think` or `/set nothink` - A `--hidethinking` option has also been added to the CLI. This makes it easy to use thinking in scripting scenarios like `ollama run qwen3 --think --hidethinking "my question here"` where you just want to see the answer but still want the benefits of thinking models
-
- 12 May, 2025 1 commit
-
-
Daniel Hiltgen authored
The quantization PR didn't block all unsupported file types, which this PR fixes. It also updates the API docs to reflect the now reduced set of supported types.
-
- 08 May, 2025 1 commit
-
-
Jeffrey Morgan authored
-
- 05 May, 2025 1 commit
-
-
Jeffrey Morgan authored
Some options listed in api/types.go are not supported in newer models, or have been deprecated in the past. This is the first of a series of PRs to clean up the API options
-
- 15 Apr, 2025 1 commit
-
-
Devon Rifkin authored
This is to prevent rendering bright red comments indicating invalid JSON when the comments are just supposed to be explanatory
-
- 01 Apr, 2025 1 commit
-
-
Bruce MacDonald authored
With support for multimodal models becoming more varied and common it is important for clients to be able to easily see what capabilities a model has. Retuning these from the show endpoint will allow clients to easily see what a model can do.
-
- 21 Mar, 2025 1 commit
-
-
Parth Sareen authored
-
- 07 Feb, 2025 1 commit
-
-
Azis Alvriyanto authored
-
- 02 Feb, 2025 1 commit
-
-
Davide Bertoni authored
-
- 29 Jan, 2025 1 commit
-
-
Parth Sareen authored
-
- 14 Jan, 2025 1 commit
-
-
Patrick Devine authored
-
- 29 Dec, 2024 1 commit
-
-
Anas Khan authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
- 11 Dec, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 06 Dec, 2024 1 commit
-
-
Parth Sareen authored
-
- 30 Nov, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 19 Nov, 2024 1 commit
-
-
Patrick Devine authored
-
- 06 Nov, 2024 1 commit
-
-
Jesse Gross authored
Now that server.cpp is gone, we don't need to keep passing arguments that were only ignored and only kept for compatibility.
-
- 25 Sep, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 18 Sep, 2024 1 commit
-
-
Patrick Devine authored
-
- 10 Sep, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 07 Aug, 2024 2 commits
-
-
royjhan authored
-
Kyle Kelley authored
* Use llama3.1 in tools example * Update api.md
-
- 29 Jul, 2024 1 commit
-
-
Veit Heller authored
-
- 27 Jul, 2024 1 commit
-
-
Tibor Schmidt authored
-
- 26 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 22 Jul, 2024 2 commits
-
-
Michael Yang authored
-
royjhan authored
-
- 29 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 19 Jun, 2024 1 commit
-
-
royjhan authored
* API Show Extended * Initial Draft of Information Co-Authored-By:
Patrick Devine <pdevine@sonic.net> * Clean Up * Descriptive arg error messages and other fixes * Second Draft of Show with Projectors Included * Remove Chat Template * Touches * Prevent wrapping from files * Verbose functionality * Docs * Address Feedback * Lint * Resolve Conflicts * Function Name * Tests for api/show model info * Show Test File * Add Projector Test * Clean routes * Projector Check * Move Show Test * Touches * Doc update --------- Co-authored-by:
Patrick Devine <pdevine@sonic.net>
-
- 11 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 09 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 05 Jun, 2024 1 commit
-
-
royjhan authored
* API PS Documentation
-
- 13 May, 2024 1 commit
-
-
睡觉型学渣 authored
* Correct typos. * Correct typos.
-
- 09 May, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 06 May, 2024 1 commit
-
-
Darinka authored
* Update api.md Changed the calculation of tps (token/s) in the documentation * Update docs/api.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
- 03 May, 2024 1 commit
-
-
Dr Nic Williams authored
* Update 'llama2' -> 'llama3' in most places --------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
- 20 Apr, 2024 1 commit
-
-
Sri Siddhaarth authored
-