1. 19 Aug, 2024 1 commit
  2. 13 Aug, 2024 2 commits
  3. 12 Aug, 2024 1 commit
  4. 07 Aug, 2024 2 commits
  5. 05 Aug, 2024 2 commits
  6. 02 Aug, 2024 2 commits
  7. 01 Aug, 2024 4 commits
  8. 29 Jul, 2024 2 commits
  9. 27 Jul, 2024 1 commit
  10. 26 Jul, 2024 2 commits
  11. 25 Jul, 2024 3 commits
  12. 24 Jul, 2024 1 commit
  13. 23 Jul, 2024 1 commit
  14. 22 Jul, 2024 3 commits
  15. 20 Jul, 2024 1 commit
    • Daniel Hiltgen's avatar
      Adjust windows ROCm discovery · 283948c8
      Daniel Hiltgen authored
      The v5 hip library returns unsupported GPUs which wont enumerate at
      inference time in the runner so this makes sure we align discovery.  The
      gfx906 cards are no longer supported so we shouldn't compile with that
      GPU type as it wont enumerate at runtime.
      283948c8
  16. 17 Jul, 2024 1 commit
  17. 10 Jul, 2024 1 commit
    • Daniel Hiltgen's avatar
      Bump ROCm on windows to 6.1.2 · 1f50356e
      Daniel Hiltgen authored
      This also adjusts our algorithm to favor our bundled ROCm.
      I've confirmed VRAM reporting still doesn't work properly so we
      can't yet enable concurrency by default.
      1f50356e
  18. 05 Jul, 2024 1 commit
  19. 04 Jul, 2024 1 commit
  20. 03 Jul, 2024 1 commit
    • Daniel Hiltgen's avatar
      Better nvidia GPU discovery logging · ef757da2
      Daniel Hiltgen authored
      Refine the way we log GPU discovery to improve the non-debug
      output, and report more actionable log messages when possible
      to help users troubleshoot on their own.
      ef757da2
  21. 02 Jul, 2024 2 commits
  22. 01 Jul, 2024 1 commit
  23. 29 Jun, 2024 1 commit
  24. 28 Jun, 2024 2 commits
  25. 19 Jun, 2024 1 commit
    • royjhan's avatar
      Extend api/show and ollama show to return more model info (#4881) · fedf7163
      royjhan authored
      
      
      * API Show Extended
      
      * Initial Draft of Information
      Co-Authored-By: default avatarPatrick Devine <pdevine@sonic.net>
      
      * Clean Up
      
      * Descriptive arg error messages and other fixes
      
      * Second Draft of Show with Projectors Included
      
      * Remove Chat Template
      
      * Touches
      
      * Prevent wrapping from files
      
      * Verbose functionality
      
      * Docs
      
      * Address Feedback
      
      * Lint
      
      * Resolve Conflicts
      
      * Function Name
      
      * Tests for api/show model info
      
      * Show Test File
      
      * Add Projector Test
      
      * Clean routes
      
      * Projector Check
      
      * Move Show Test
      
      * Touches
      
      * Doc update
      
      ---------
      Co-authored-by: default avatarPatrick Devine <pdevine@sonic.net>
      fedf7163