1. 23 Sep, 2025 1 commit
    • Patrick Devine's avatar
      auth: fix problems with the ollama keypairs (#12373) · 64883e3c
      Patrick Devine authored
      * auth: fix problems with the ollama keypairs
      
      This change adds several fixes including:
        - reading in the pubkey files correctly
        - fixing the push unit test to create a keypair file in a temp directory
        - not return 500 errors for normal status error
      64883e3c
  2. 17 Sep, 2025 1 commit
  3. 16 Jul, 2025 1 commit
    • Bruce MacDonald's avatar
      api: fix unreachable status err (#11423) · 92c2e8a5
      Bruce MacDonald authored
      StatusError was unreachable, the client always checked for error messages in the response body first, and the server always includes error messages with HTTP error status codes.
      92c2e8a5
  4. 27 May, 2025 1 commit
  5. 27 Feb, 2025 1 commit
  6. 20 Feb, 2025 1 commit
    • Bruce MacDonald's avatar
      api: document client stream behavior with a test (#8996) · 14b5a9a1
      Bruce MacDonald authored
      Added unit tests to verify error handling behavior in the Client.stream and Client.do methods.
      Tests cover various error scenarios including:
      - Error responses with status codes >= 400
      - Error messages with successful status codes
      - Empty error messages
      - Successful responses
      14b5a9a1
  7. 11 Nov, 2024 1 commit
  8. 14 Aug, 2024 1 commit
  9. 02 Aug, 2024 1 commit
  10. 22 Jul, 2024 1 commit
  11. 15 Jul, 2024 1 commit
    • royjhan's avatar
      Introduce `/api/embed` endpoint supporting batch embedding (#5127) · b9f5e16c
      royjhan authored
      * Initial Batch Embedding
      
      * Revert "Initial Batch Embedding"
      
      This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.
      
      * Initial Draft
      
      * mock up notes
      
      * api/embed draft
      
      * add server function
      
      * check normalization
      
      * clean up
      
      * normalization
      
      * playing around with truncate stuff
      
      * Truncation
      
      * Truncation
      
      * move normalization to go
      
      * Integration Test Template
      
      * Truncation Integration Tests
      
      * Clean up
      
      * use float32
      
      * move normalize
      
      * move normalize test
      
      * refactoring
      
      * integration float32
      
      * input handling and handler testing
      
      * Refactoring of legacy and new
      
      * clear comments
      
      * merge conflicts
      
      * touches
      
      * embedding type 64
      
      * merge conflicts
      
      * fix hanging on single string
      
      * refactoring
      
      * test values
      
      * set context length
      
      * clean up
      
      * testing clean up
      
      * testing clean up
      
      * remove function closure
      
      * Revert "remove function closure"
      
      This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.
      
      * remove function closure
      
      * remove redundant error check
      
      * clean up
      
      * more clean up
      
      * clean up
      b9f5e16c
  12. 12 Jun, 2024 1 commit
  13. 06 Jun, 2024 1 commit
  14. 14 May, 2024 1 commit
  15. 07 May, 2024 1 commit
  16. 29 Apr, 2024 1 commit
  17. 23 Apr, 2024 1 commit
    • Daniel Hiltgen's avatar
      Request and model concurrency · 34b9db5a
      Daniel Hiltgen authored
      This change adds support for multiple concurrent requests, as well as
      loading multiple models by spawning multiple runners. The default
      settings are currently set at 1 concurrent request per model and only 1
      loaded model at a time, but these can be adjusted by setting
      OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
      34b9db5a
  18. 10 Apr, 2024 1 commit
  19. 08 Apr, 2024 1 commit
  20. 26 Mar, 2024 1 commit
  21. 20 Feb, 2024 1 commit
  22. 04 Jan, 2024 1 commit
  23. 05 Dec, 2023 3 commits
  24. 04 Dec, 2023 1 commit
    • Bruce MacDonald's avatar
      chat api (#991) · 7a0899d6
      Bruce MacDonald authored
      - update chat docs
      - add messages chat endpoint
      - remove deprecated context and template generate parameters from docs
      - context and template are still supported for the time being and will continue to work as expected
      - add partial response to chat history
      7a0899d6
  25. 15 Nov, 2023 2 commits
  26. 02 Nov, 2023 1 commit
  27. 27 Oct, 2023 1 commit
  28. 26 Oct, 2023 1 commit
  29. 20 Oct, 2023 1 commit
  30. 13 Oct, 2023 1 commit
  31. 11 Oct, 2023 2 commits
  32. 09 Oct, 2023 1 commit
  33. 04 Oct, 2023 1 commit
  34. 06 Sep, 2023 1 commit
  35. 28 Aug, 2023 1 commit
  36. 26 Aug, 2023 1 commit