1. 06 Nov, 2025 1 commit
  2. 28 Oct, 2025 1 commit
    • Daniel Hiltgen's avatar
      Fix vulkan PCI ID and ID handling (#12775) · 14977a93
      Daniel Hiltgen authored
      * Fix vulkan PCI ID and ID handling
      
      Intel GPUs may not report PCI IDs which was leading to incorrect overlap
      detection.  Switch to using the existing PCI IDs, however AMD GPUs claim not to
      report PCI IDs, but actually do, so try anyway, as this is required for ADLX to
      find the GPUs on Windows. Numeric IDs lead to scheduling problems, so this also
      switches Vulkan to use UUID based IDs. The GPU discovery patches have been
      squashed into a single patch to simplify future rebases.
      
      * review comments
      14977a93
  3. 15 Oct, 2025 1 commit
    • Santosh Bhavani's avatar
      ml/backend/ggml: NVML fallback for unified memory GPUs (#12619) · 8fafc8af
      Santosh Bhavani authored
      * Simplify NVML fallback for unified memory GPUs
      
      Remove device-specific checks and environment variable dependency for
      NVML_ERROR_NOT_SUPPORTED fallback. When NVML doesn't support memory
      queries, unconditionally use /proc/meminfo instead of checking device
      names or OLLAMA_UNIFIED_MEMORY environment variable.
      
      This provides better memory reporting by using MemAvailable which
      accounts for reclaimable memory, avoiding the underreporting issue
      described in NVIDIA support article a_id/5728.
      
      Tested on NVIDIA GB10 unified memory iGPU with consistent and accurate
      memory reporting across multiple model load/unload cycles.
      
      * Add NVML fallback patch for unified memory GPUs
      8fafc8af