1. 07 Jul, 2025 1 commit
  2. 07 Jun, 2025 1 commit
  3. 04 Jun, 2025 1 commit
  4. 29 May, 2025 1 commit
    • Devon Rifkin's avatar
      add thinking support to the api and cli (#10584) · 5f57b0ef
      Devon Rifkin authored
      - Both `/api/generate` and `/api/chat` now accept a `"think"`
        option that allows specifying whether thinking mode should be on or
        not
      - Templates get passed this new option so, e.g., qwen3's template can
        put `/think` or `/no_think` in the system prompt depending on the
        value of the setting
      - Models' thinking support is inferred by inspecting model templates.
        The prefix and suffix the parser uses to identify thinking support is
        also automatically inferred from templates
      - Thinking control & parsing is opt-in via the API to prevent breaking
        existing API consumers. If the `"think"` option is not specified, the
        behavior is unchanged from previous versions of ollama
      - Add parsing for thinking blocks in both streaming/non-streaming mode
        in both `/generate` and `/chat`
      - Update the CLI to make use of these changes. Users can pass `--think`
        or `--think=false` to control thinking, or during an interactive
        session they can use the commands `/set think` or `/set nothink`
      - A `--hidethinking` option has also been added to the CLI. This makes
        it easy to use thinking in scripting scenarios like
        `ollama run qwen3 --think --hidethinking "my question here"` where you
        just want to see the answer but still want the benefits of thinking
        models
      5f57b0ef
  5. 12 May, 2025 1 commit
    • Daniel Hiltgen's avatar
      Follow up to #10363 (#10647) · 9d6df908
      Daniel Hiltgen authored
      The quantization PR didn't block all unsupported file types,
      which this PR fixes.  It also updates the API docs to reflect
      the now reduced set of supported types.
      9d6df908
  6. 08 May, 2025 1 commit
  7. 05 May, 2025 1 commit
  8. 15 Apr, 2025 1 commit
  9. 01 Apr, 2025 1 commit
  10. 21 Mar, 2025 1 commit
  11. 07 Feb, 2025 1 commit
  12. 02 Feb, 2025 1 commit
  13. 29 Jan, 2025 1 commit
  14. 14 Jan, 2025 1 commit
  15. 29 Dec, 2024 1 commit
  16. 11 Dec, 2024 1 commit
  17. 06 Dec, 2024 1 commit
  18. 30 Nov, 2024 1 commit
  19. 19 Nov, 2024 1 commit
  20. 06 Nov, 2024 1 commit
  21. 25 Sep, 2024 1 commit
  22. 18 Sep, 2024 1 commit
  23. 10 Sep, 2024 1 commit
  24. 07 Aug, 2024 2 commits
  25. 29 Jul, 2024 1 commit
  26. 27 Jul, 2024 1 commit
  27. 26 Jul, 2024 1 commit
  28. 22 Jul, 2024 2 commits
  29. 29 Jun, 2024 1 commit
  30. 19 Jun, 2024 1 commit
    • royjhan's avatar
      Extend api/show and ollama show to return more model info (#4881) · fedf7163
      royjhan authored
      
      
      * API Show Extended
      
      * Initial Draft of Information
      Co-Authored-By: default avatarPatrick Devine <pdevine@sonic.net>
      
      * Clean Up
      
      * Descriptive arg error messages and other fixes
      
      * Second Draft of Show with Projectors Included
      
      * Remove Chat Template
      
      * Touches
      
      * Prevent wrapping from files
      
      * Verbose functionality
      
      * Docs
      
      * Address Feedback
      
      * Lint
      
      * Resolve Conflicts
      
      * Function Name
      
      * Tests for api/show model info
      
      * Show Test File
      
      * Add Projector Test
      
      * Clean routes
      
      * Projector Check
      
      * Move Show Test
      
      * Touches
      
      * Doc update
      
      ---------
      Co-authored-by: default avatarPatrick Devine <pdevine@sonic.net>
      fedf7163
  31. 11 Jun, 2024 1 commit
  32. 09 Jun, 2024 1 commit
  33. 05 Jun, 2024 1 commit
  34. 13 May, 2024 1 commit
  35. 09 May, 2024 1 commit
  36. 06 May, 2024 1 commit
  37. 03 May, 2024 1 commit
  38. 20 Apr, 2024 1 commit