1. 17 Nov, 2025 1 commit
    • Daniel Hiltgen's avatar
      bring back sysfs based VRAM information for AMD (#12871) · 2f36d769
      Daniel Hiltgen authored
      * build: optimize dockerfile context for iterating
      
      This moves the copy of the source into the layer AFTER
      doing software installs so we don't have to go through
      the RPM install for cuda, etc. every time you touch a
      source file.
      
      * amd: implement linux sysfs based VRAM lookup
      
      This adds a C++ implementation of sysfs DRM VRAM discovery
      for more accurate free VRAM data on linux for AMD GPUs.
      2f36d769
  2. 06 Nov, 2025 2 commits
  3. 04 Nov, 2025 1 commit
    • Daniel Hiltgen's avatar
      discovery: only retry AMD GPUs (#12894) · 27f1fde4
      Daniel Hiltgen authored
      * discovery: only retry AMD GPUs
      
      CUDA and Vulkan don't crash on unsupported devices, so retry isn't necessary.
      This also refactors the code to shift the Library specific logic into the ml
      package.
      
      * review comments
      27f1fde4
  4. 28 Oct, 2025 1 commit
    • Daniel Hiltgen's avatar
      Fix vulkan PCI ID and ID handling (#12775) · 14977a93
      Daniel Hiltgen authored
      * Fix vulkan PCI ID and ID handling
      
      Intel GPUs may not report PCI IDs which was leading to incorrect overlap
      detection.  Switch to using the existing PCI IDs, however AMD GPUs claim not to
      report PCI IDs, but actually do, so try anyway, as this is required for ADLX to
      find the GPUs on Windows. Numeric IDs lead to scheduling problems, so this also
      switches Vulkan to use UUID based IDs. The GPU discovery patches have been
      squashed into a single patch to simplify future rebases.
      
      * review comments
      14977a93
  5. 20 Oct, 2025 1 commit
  6. 16 Oct, 2025 1 commit
  7. 13 Oct, 2025 1 commit
  8. 10 Oct, 2025 1 commit
  9. 02 Oct, 2025 1 commit
    • Daniel Hiltgen's avatar
      Update GGML to b6646 (#12245) · c68f367e
      Daniel Hiltgen authored
      Notable EOLs with this change:
      - MacOS v12 and v13 are no longer supported (v14+ required)
      - AMD gfx900 and gfx906 are no longer supported
      c68f367e
  10. 01 Oct, 2025 1 commit
    • Daniel Hiltgen's avatar
      Use runners for GPU discovery (#12090) · bc8909fb
      Daniel Hiltgen authored
      This revamps how we discover GPUs in the system by leveraging the Ollama
      runner.  This should eliminate inconsistency between our GPU discovery and the
      runners capabilities at runtime, particularly for cases where we try to filter
      out unsupported GPUs.  Now the runner does that implicitly based on the actual
      device list.  In some cases free VRAM reporting can be unreliable which can
      leaad to scheduling mistakes, so this also includes a patch to leverage more
      reliable VRAM reporting libraries if available.
      
      Automatic workarounds have been removed as only one GPU leveraged this, which
      is now documented. This GPU will soon fall off the support matrix with the next
      ROCm bump.
      
      Additional cleanup of the scheduler and discovery packages can be done in the
      future once we have switched on the new memory management code, and removed
      support for the llama runner.
      bc8909fb