1. 02 Apr, 2025 1 commit
  2. 01 Apr, 2025 1 commit
  3. 13 Mar, 2025 1 commit
  4. 05 Mar, 2025 1 commit
    • Blake Mizerany's avatar
      server/internal/registry: take over pulls from server package (#9485) · e2252d0f
      Blake Mizerany authored
      This commit replaces the old pull implementation in the server package
      with the new, faster, more robust pull implementation in the registry
      package.
      
      The new endpoint, and now the remove endpoint too, are behind the
      feature gate "client2" enabled only by setting the OLLAMA_EXPERIMENT
      environment variable include "client2".
      
      Currently, the progress indication is wired to perform the same as the
      previous implementation to avoid making changes to the CLI, and because
      the status reports happen at the start of the download, and the end of
      the write to disk, the progress indication is not as smooth as it could
      be. This is a known issue and will be addressed in a future change.
      
      This implementation may be ~0.5-1.0% slower in rare cases, depending on
      network and disk speed, but is generally MUCH faster and more robust
      than the its predecessor in all other cases.
      e2252d0f
  5. 27 Feb, 2025 1 commit
  6. 24 Feb, 2025 1 commit
  7. 20 Feb, 2025 1 commit
    • Bruce MacDonald's avatar
      api: document client stream behavior with a test (#8996) · 14b5a9a1
      Bruce MacDonald authored
      Added unit tests to verify error handling behavior in the Client.stream and Client.do methods.
      Tests cover various error scenarios including:
      - Error responses with status codes >= 400
      - Error messages with successful status codes
      - Empty error messages
      - Successful responses
      14b5a9a1
  8. 07 Feb, 2025 1 commit
  9. 13 Jan, 2025 1 commit
  10. 08 Jan, 2025 1 commit
  11. 03 Jan, 2025 1 commit
    • Bruce MacDonald's avatar
      api: remove unused create fields · 29a8975c
      Bruce MacDonald authored
      These fields are deprecated, but specifying them will not do anything. Removing them as the other deprecated fields will still work, but these do not, so they dont match our existing pattern.
      29a8975c
  12. 01 Jan, 2025 1 commit
  13. 11 Dec, 2024 1 commit
  14. 05 Dec, 2024 2 commits
  15. 30 Nov, 2024 1 commit
  16. 12 Nov, 2024 1 commit
  17. 11 Nov, 2024 1 commit
  18. 06 Nov, 2024 1 commit
  19. 28 Aug, 2024 1 commit
  20. 14 Aug, 2024 1 commit
  21. 06 Aug, 2024 1 commit
  22. 05 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Implement linux NUMA detection · f457d634
      Daniel Hiltgen authored
      If the system has multiple numa nodes, enable numa support in llama.cpp
      If we detect numactl in the path, use that, else use the basic "distribute" mode.
      f457d634
  23. 02 Aug, 2024 1 commit
  24. 30 Jul, 2024 1 commit
  25. 29 Jul, 2024 1 commit
  26. 27 Jul, 2024 1 commit
  27. 22 Jul, 2024 2 commits
  28. 18 Jul, 2024 1 commit
  29. 17 Jul, 2024 1 commit
  30. 16 Jul, 2024 4 commits
  31. 15 Jul, 2024 3 commits
    • Michael Yang's avatar
      tools · d02bbebb
      Michael Yang authored
      d02bbebb
    • Jeffrey Morgan's avatar
    • royjhan's avatar
      Introduce `/api/embed` endpoint supporting batch embedding (#5127) · b9f5e16c
      royjhan authored
      * Initial Batch Embedding
      
      * Revert "Initial Batch Embedding"
      
      This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.
      
      * Initial Draft
      
      * mock up notes
      
      * api/embed draft
      
      * add server function
      
      * check normalization
      
      * clean up
      
      * normalization
      
      * playing around with truncate stuff
      
      * Truncation
      
      * Truncation
      
      * move normalization to go
      
      * Integration Test Template
      
      * Truncation Integration Tests
      
      * Clean up
      
      * use float32
      
      * move normalize
      
      * move normalize test
      
      * refactoring
      
      * integration float32
      
      * input handling and handler testing
      
      * Refactoring of legacy and new
      
      * clear comments
      
      * merge conflicts
      
      * touches
      
      * embedding type 64
      
      * merge conflicts
      
      * fix hanging on single string
      
      * refactoring
      
      * test values
      
      * set context length
      
      * clean up
      
      * testing clean up
      
      * testing clean up
      
      * remove function closure
      
      * Revert "remove function closure"
      
      This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.
      
      * remove function closure
      
      * remove redundant error check
      
      * clean up
      
      * more clean up
      
      * clean up
      b9f5e16c
  32. 14 Jul, 2024 1 commit
  33. 02 Jul, 2024 1 commit