Commits · d4e0da08907f7611e1a2d9bda319bb30cd4ff029 · OpenDAS / ollama

06 Nov, 2025 1 commit

Remove unnecessary MacOs 13 and lower Patches (#12656) · d4e0da08

Thomas Stocker authored Nov 07, 2025

* Remove unnecessary macos 13 Patch

* Remove unnecessary MacOs Version Guard patch

* rename patchesw

* remove again macos13 patch

* rename files

d4e0da08

28 Oct, 2025 1 commit

Fix vulkan PCI ID and ID handling (#12775) · 14977a93

Daniel Hiltgen authored Oct 28, 2025

* Fix vulkan PCI ID and ID handling

Intel GPUs may not report PCI IDs which was leading to incorrect overlap
detection.  Switch to using the existing PCI IDs, however AMD GPUs claim not to
report PCI IDs, but actually do, so try anyway, as this is required for ADLX to
find the GPUs on Windows. Numeric IDs lead to scheduling problems, so this also
switches Vulkan to use UUID based IDs. The GPU discovery patches have been
squashed into a single patch to simplify future rebases.

* review comments

14977a93

15 Oct, 2025 1 commit

ml/backend/ggml: NVML fallback for unified memory GPUs (#12619) · 8fafc8af

Santosh Bhavani authored Oct 15, 2025

* Simplify NVML fallback for unified memory GPUs

Remove device-specific checks and environment variable dependency for
NVML_ERROR_NOT_SUPPORTED fallback. When NVML doesn't support memory
queries, unconditionally use /proc/meminfo instead of checking device
names or OLLAMA_UNIFIED_MEMORY environment variable.

This provides better memory reporting by using MemAvailable which
accounts for reclaimable memory, avoiding the underreporting issue
described in NVIDIA support article a_id/5728.

Tested on NVIDIA GB10 unified memory iGPU with consistent and accurate
memory reporting across multiple model load/unload cycles.

* Add NVML fallback patch for unified memory GPUs

8fafc8af