1. 03 Dec, 2025 1 commit
    • Daniel Hiltgen's avatar
      CUDA: filter devices on secondary discovery (#13317) · 3f308367
      Daniel Hiltgen authored
      We now do a deeper probe of CUDA devices to verify the library version has
      the correct compute capability coverage for the device.  Due to ROCm also
      interpreting the CUDA env var to filter AMD devices, we try to avoid setting
      it which leads to problems in mixed vendor systems.  However without setting
      it for this deeper probe, each CUDA library subprocess discovers all CUDA GPUs
      and on systems with lots of GPUs, this can lead to hitting timeouts.  The fix is
      to turn on the CUDA visibility env var just for this deeper probe use-case.
      3f308367
  2. 02 Dec, 2025 7 commits
  3. 01 Dec, 2025 3 commits
    • Daniel Hiltgen's avatar
      win: warn if ggml-base detected in PATH (#13289) · 55417275
      Daniel Hiltgen authored
      If the user has somehow installed another GGML based app which places a
      ggml-base lib somewhere in their PATH, we can experience runtime problems
      due to incompatibilities.  This change adds a warning message if we detect
      a ggml-base outside of our install location to aid in troubleshooting.
      55417275
    • Bruce MacDonald's avatar
      api/client: handle non-json streaming errors (#13007) · 5b6a8e60
      Bruce MacDonald authored
      While processing the response stream during a chat or generation if an error is occurred it is parsed and returned to the user. The issue with the existing code is that this assumed the response would be valid JSON, which is not a safe assumption and caused cryptic error messages to be displayed due to parsing failures:
      `invalid character 'i' looking for beginning of value`
      
      This change updates the stream function to return the raw error string if it cant be parsed as JSON. This should help with debugging issues by making sure the actual error reaches the user.
      5b6a8e60
    • Daniel Hiltgen's avatar
      jetpack: require exact match or skip cuda_jetpack* (#13288) · 467bbc0d
      Daniel Hiltgen authored
      The cuda_jetpack libs will enumerate discrete GPUs on SBSA systems
      which leads to runtime failures of missing kernels.  This fix
      requires an exact match to enable jetpacks instead of relying on
      enumeration to filter out supported libraries.
      467bbc0d
  4. 30 Nov, 2025 1 commit
  5. 29 Nov, 2025 1 commit
  6. 26 Nov, 2025 1 commit
  7. 20 Nov, 2025 6 commits
  8. 19 Nov, 2025 10 commits
  9. 18 Nov, 2025 7 commits
  10. 17 Nov, 2025 3 commits