1. 07 Jul, 2025 1 commit
  2. 07 Jun, 2025 1 commit
  3. 04 Jun, 2025 1 commit
  4. 29 May, 2025 1 commit
    • Devon Rifkin's avatar
      add thinking support to the api and cli (#10584) · 5f57b0ef
      Devon Rifkin authored
      - Both `/api/generate` and `/api/chat` now accept a `"think"`
        option that allows specifying whether thinking mode should be on or
        not
      - Templates get passed this new option so, e.g., qwen3's template can
        put `/think` or `/no_think` in the system prompt depending on the
        value of the setting
      - Models' thinking support is inferred by inspecting model templates.
        The prefix and suffix the parser uses to identify thinking support is
        also automatically inferred from templates
      - Thinking control & parsing is opt-in via the API to prevent breaking
        existing API consumers. If the `"think"` option is not specified, the
        behavior is unchanged from previous versions of ollama
      - Add parsing for thinking blocks in both streaming/non-streaming mode
        in both `/generate` and `/chat`
      - Update the CLI to make use of these changes. Users can pass `--think`
        or `--think=false` to control thinking, or during an interactive
        session they can use the commands `/se...
      5f57b0ef
  5. 27 May, 2025 1 commit
  6. 08 May, 2025 2 commits
  7. 07 May, 2025 1 commit
  8. 05 May, 2025 1 commit
  9. 24 Apr, 2025 1 commit
  10. 10 Apr, 2025 1 commit
  11. 08 Apr, 2025 1 commit
  12. 07 Apr, 2025 1 commit
  13. 02 Apr, 2025 1 commit
  14. 01 Apr, 2025 1 commit
  15. 13 Mar, 2025 1 commit
  16. 05 Mar, 2025 1 commit
    • Blake Mizerany's avatar
      server/internal/registry: take over pulls from server package (#9485) · e2252d0f
      Blake Mizerany authored
      This commit replaces the old pull implementation in the server package
      with the new, faster, more robust pull implementation in the registry
      package.
      
      The new endpoint, and now the remove endpoint too, are behind the
      feature gate "client2" enabled only by setting the OLLAMA_EXPERIMENT
      environment variable include "client2".
      
      Currently, the progress indication is wired to perform the same as the
      previous implementation to avoid making changes to the CLI, and because
      the status reports happen at the start of the download, and the end of
      the write to disk, the progress indication is not as smooth as it could
      be. This is a known issue and will be addressed in a future change.
      
      This implementation may be ~0.5-1.0% slower in rare cases, depending on
      network and disk speed, but is generally MUCH faster and more robust
      than the its predecessor in all other cases.
      e2252d0f
  17. 27 Feb, 2025 1 commit
  18. 24 Feb, 2025 1 commit
  19. 20 Feb, 2025 1 commit
    • Bruce MacDonald's avatar
      api: document client stream behavior with a test (#8996) · 14b5a9a1
      Bruce MacDonald authored
      Added unit tests to verify error handling behavior in the Client.stream and Client.do methods.
      Tests cover various error scenarios including:
      - Error responses with status codes >= 400
      - Error messages with successful status codes
      - Empty error messages
      - Successful responses
      14b5a9a1
  20. 07 Feb, 2025 1 commit
  21. 13 Jan, 2025 1 commit
  22. 08 Jan, 2025 1 commit
  23. 03 Jan, 2025 1 commit
    • Bruce MacDonald's avatar
      api: remove unused create fields · 29a8975c
      Bruce MacDonald authored
      These fields are deprecated, but specifying them will not do anything. Removing them as the other deprecated fields will still work, but these do not, so they dont match our existing pattern.
      29a8975c
  24. 01 Jan, 2025 1 commit
  25. 11 Dec, 2024 1 commit
  26. 05 Dec, 2024 2 commits
  27. 30 Nov, 2024 1 commit
  28. 12 Nov, 2024 1 commit
  29. 11 Nov, 2024 1 commit
  30. 06 Nov, 2024 1 commit
  31. 28 Aug, 2024 1 commit
  32. 14 Aug, 2024 1 commit
  33. 06 Aug, 2024 1 commit
  34. 05 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Implement linux NUMA detection · f457d634
      Daniel Hiltgen authored
      If the system has multiple numa nodes, enable numa support in llama.cpp
      If we detect numactl in the path, use that, else use the basic "distribute" mode.
      f457d634
  35. 02 Aug, 2024 1 commit
  36. 30 Jul, 2024 1 commit
  37. 29 Jul, 2024 1 commit
  38. 27 Jul, 2024 1 commit