1. 01 Jan, 2025 1 commit
  2. 11 Dec, 2024 1 commit
  3. 05 Dec, 2024 2 commits
  4. 30 Nov, 2024 1 commit
  5. 12 Nov, 2024 1 commit
  6. 06 Nov, 2024 1 commit
  7. 28 Aug, 2024 1 commit
  8. 06 Aug, 2024 1 commit
  9. 05 Aug, 2024 1 commit
    • Daniel Hiltgen's avatar
      Implement linux NUMA detection · f457d634
      Daniel Hiltgen authored
      If the system has multiple numa nodes, enable numa support in llama.cpp
      If we detect numactl in the path, use that, else use the basic "distribute" mode.
      f457d634
  10. 30 Jul, 2024 1 commit
  11. 29 Jul, 2024 1 commit
  12. 27 Jul, 2024 1 commit
  13. 18 Jul, 2024 1 commit
  14. 17 Jul, 2024 1 commit
  15. 16 Jul, 2024 4 commits
  16. 15 Jul, 2024 3 commits
    • Michael Yang's avatar
      tools · d02bbebb
      Michael Yang authored
      d02bbebb
    • Jeffrey Morgan's avatar
    • royjhan's avatar
      Introduce `/api/embed` endpoint supporting batch embedding (#5127) · b9f5e16c
      royjhan authored
      * Initial Batch Embedding
      
      * Revert "Initial Batch Embedding"
      
      This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.
      
      * Initial Draft
      
      * mock up notes
      
      * api/embed draft
      
      * add server function
      
      * check normalization
      
      * clean up
      
      * normalization
      
      * playing around with truncate stuff
      
      * Truncation
      
      * Truncation
      
      * move normalization to go
      
      * Integration Test Template
      
      * Truncation Integration Tests
      
      * Clean up
      
      * use float32
      
      * move normalize
      
      * move normalize test
      
      * refactoring
      
      * integration float32
      
      * input handling and handler testing
      
      * Refactoring of legacy and new
      
      * clear comments
      
      * merge conflicts
      
      * touches
      
      * embedding type 64
      
      * merge conflicts
      
      * fix hanging on single string
      
      * refactoring
      
      * test values
      
      * set context length
      
      * clean up
      
      * testing clean up
      
      * testing clean up
      
      * remove function closure
      
      * Revert "remove function closure"
      
      This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.
      
      * remove function closure
      
      * remove redundant error check
      
      * clean up
      
      * more clean up
      
      * clean up
      b9f5e16c
  17. 14 Jul, 2024 1 commit
  18. 02 Jul, 2024 1 commit
  19. 01 Jul, 2024 1 commit
  20. 21 Jun, 2024 1 commit
  21. 19 Jun, 2024 1 commit
    • royjhan's avatar
      Extend api/show and ollama show to return more model info (#4881) · fedf7163
      royjhan authored
      
      
      * API Show Extended
      
      * Initial Draft of Information
      Co-Authored-By: default avatarPatrick Devine <pdevine@sonic.net>
      
      * Clean Up
      
      * Descriptive arg error messages and other fixes
      
      * Second Draft of Show with Projectors Included
      
      * Remove Chat Template
      
      * Touches
      
      * Prevent wrapping from files
      
      * Verbose functionality
      
      * Docs
      
      * Address Feedback
      
      * Lint
      
      * Resolve Conflicts
      
      * Function Name
      
      * Tests for api/show model info
      
      * Show Test File
      
      * Add Projector Test
      
      * Clean routes
      
      * Projector Check
      
      * Move Show Test
      
      * Touches
      
      * Doc update
      
      ---------
      Co-authored-by: default avatarPatrick Devine <pdevine@sonic.net>
      fedf7163
  22. 17 Jun, 2024 1 commit
  23. 16 Jun, 2024 1 commit
  24. 12 Jun, 2024 1 commit
  25. 06 Jun, 2024 1 commit
  26. 04 Jun, 2024 1 commit
  27. 14 May, 2024 1 commit
  28. 10 May, 2024 1 commit
  29. 09 May, 2024 3 commits
  30. 07 May, 2024 1 commit
  31. 06 May, 2024 1 commit
  32. 29 Apr, 2024 1 commit