1. 25 Jul, 2024 1 commit
  2. 24 Jul, 2024 1 commit
  3. 22 Jul, 2024 3 commits
    • Daniel Hiltgen's avatar
      Enable windows error dialog for subprocess startup · e12fff88
      Daniel Hiltgen authored
      Make sure if something goes wrong spawning the process, the user gets
      enough info to be able to try to self correct, or at least file a bug
      with details so we can fix it.  Once the process starts, we immediately
      change back to the recommended setting to prevent the blocking dialog.
      This ensures if the model fails to load (OOM, unsupported model type,
      etc.) the process will exit quickly and we can scan the stdout/stderr
      of the subprocess for the reason to report via API.
      e12fff88
    • Jeffrey Morgan's avatar
      f8fedbda
    • Daniel Hiltgen's avatar
      Refine error reporting for subprocess crash · a3c20e3f
      Daniel Hiltgen authored
      On windows, the exit status winds up being the search term many
      users search for and end up piling in on issues that are unrelated.
      This refines the reporting so that if we have a more detailed message
      we'll suppress the exit status portion of the message.
      a3c20e3f
  4. 21 Jul, 2024 1 commit
  5. 20 Jul, 2024 2 commits
    • Daniel Hiltgen's avatar
      Adjust windows ROCm discovery · 283948c8
      Daniel Hiltgen authored
      The v5 hip library returns unsupported GPUs which wont enumerate at
      inference time in the runner so this makes sure we align discovery.  The
      gfx906 cards are no longer supported so we shouldn't compile with that
      GPU type as it wont enumerate at runtime.
      283948c8
    • Jeffrey Morgan's avatar
      add patch for tekken (#5807) · 1475eab9
      Jeffrey Morgan authored
      1475eab9
  6. 16 Jul, 2024 1 commit
  7. 15 Jul, 2024 1 commit
    • royjhan's avatar
      Introduce `/api/embed` endpoint supporting batch embedding (#5127) · b9f5e16c
      royjhan authored
      * Initial Batch Embedding
      
      * Revert "Initial Batch Embedding"
      
      This reverts commit c22d54895a280b54c727279d85a5fc94defb5a29.
      
      * Initial Draft
      
      * mock up notes
      
      * api/embed draft
      
      * add server function
      
      * check normalization
      
      * clean up
      
      * normalization
      
      * playing around with truncate stuff
      
      * Truncation
      
      * Truncation
      
      * move normalization to go
      
      * Integration Test Template
      
      * Truncation Integration Tests
      
      * Clean up
      
      * use float32
      
      * move normalize
      
      * move normalize test
      
      * refactoring
      
      * integration float32
      
      * input handling and handler testing
      
      * Refactoring of legacy and new
      
      * clear comments
      
      * merge conflicts
      
      * touches
      
      * embedding type 64
      
      * merge conflicts
      
      * fix hanging on single string
      
      * refactoring
      
      * test values
      
      * set context length
      
      * clean up
      
      * testing clean up
      
      * testing clean up
      
      * remove function closure
      
      * Revert "remove function closure"
      
      This reverts commit 55d48c6ed17abe42e7a122e69d603ef0c1506787.
      
      * remove function closure
      
      * remove redundant error check
      
      * clean up
      
      * more clean up
      
      * clean up
      b9f5e16c
  8. 13 Jul, 2024 1 commit
  9. 12 Jul, 2024 1 commit
  10. 11 Jul, 2024 3 commits
  11. 10 Jul, 2024 4 commits
  12. 09 Jul, 2024 1 commit
  13. 08 Jul, 2024 1 commit
  14. 07 Jul, 2024 4 commits
  15. 06 Jul, 2024 8 commits
  16. 05 Jul, 2024 7 commits