Commits · 18fdcc94e55d8ca393be9d01b30246dbbca6f6af · OpenDAS / ollama

23 Dec, 2025 1 commit
- docs: fix broken .md links and render issues (#13550) · 18fdcc94
  Vallabh Mahajan authored Dec 23, 2025
  
  18fdcc94
12 Nov, 2025 1 commit

Enable Vulkan with a temporary opt-in setting (#12931) · 6286d9a3

Daniel Hiltgen authored Nov 12, 2025

* docs: vulkan information

* Revert "CI: Set up temporary opt-out Vulkan support (#12614)"

This reverts commit 8b6e5bae.

* vulkan: temporary opt-in for Vulkan support

Revert this once we're ready to enable by default.

* win: add vulkan CI build

6286d9a3

11 Nov, 2025 1 commit
- docs: fix metal gpu section header (#13045) · 6df42088
  Sheikh authored Nov 11, 2025
  
  6df42088
07 Nov, 2025 1 commit

doc: re-add login autostart faq and GPU updates (#12975) · 60b89735

Daniel Hiltgen authored Nov 07, 2025

* doc: re-add login autostart faq

This appears to have been accidentally dropped during the doc migration.

* docs: GPU updates lost on the doc update

* review comments: improve windows login disable instructions

60b89735

28 Oct, 2025 2 commits
- docs: add docs for docs.ollama.com (#12805) · 3d99d977
  Parth Sareen authored Oct 28, 2025
  
  3d99d977
- docs: rename to mdx to setup docs site (#12804) · 6d02a43a
  Parth Sareen authored Oct 28, 2025
  
  6d02a43a
16 Oct, 2025 1 commit
- cuda: tidy up CC settings (#12668) · 27067993
  Daniel Hiltgen authored Oct 16, 2025
```
8.7 is Jetpack only, so no need on x86 builds
10.3 covers [G]B300
```
  27067993
11 Oct, 2025 1 commit
- doc: remove AMD EOL GPUs (#12567) · 70d9e363
  Daniel Hiltgen authored Oct 10, 2025
  
  70d9e363
02 Oct, 2025 1 commit

Update GGML to b6646 (#12245) · c68f367e

Daniel Hiltgen authored Oct 02, 2025

Notable EOLs with this change:
- MacOS v12 and v13 are no longer supported (v14+ required)
- AMD gfx900 and gfx906 are no longer supported

c68f367e

01 Oct, 2025 1 commit

Use runners for GPU discovery (#12090) · bc8909fb

Daniel Hiltgen authored Oct 01, 2025

This revamps how we discover GPUs in the system by leveraging the Ollama
runner. This should eliminate inconsistency between our GPU discovery and the
runners capabilities at runtime, particularly for cases where we try to filter
out unsupported GPUs. Now the runner does that implicitly based on the actual
device list. In some cases free VRAM reporting can be unreliable which can
leaad to scheduling mistakes, so this also includes a patch to leverage more
reliable VRAM reporting libraries if available.

Automatic workarounds have been removed as only one GPU leveraged this, which
is now documented. This GPU will soon fall off the support matrix with the next
ROCm bump.

Additional cleanup of the scheduler and discovery packages can be done in the
future once we have switched on the new memory management code, and removed
support for the llama runner.

bc8909fb

05 Jul, 2025 1 commit
- doc: add NVIDIA blackwell to supported list (#11307) · 9d60bb44
  Daniel Hiltgen authored Jul 05, 2025
  
  9d60bb44
23 Jun, 2025 1 commit

Re-remove cuda v11 (#10694) · 1c6669e6

Daniel Hiltgen authored Jun 23, 2025

* Re-remove cuda v11

Revert the revert - drop v11 support requiring drivers newer than Feb 23

This reverts commit c6bcdc42.

* Simplify layout

With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)

* distinct sbsa variant for linux arm64

This avoids accidentally trying to load the sbsa cuda libraries on
a jetson system which results in crashes.

* temporary prevent rocm+cuda mixed loading

1c6669e6

13 May, 2025 1 commit

Revert "remove cuda v11 (#10569)" (#10692) · c6bcdc42

Daniel Hiltgen authored May 13, 2025

Bring back v11 until we can better warn users that their driver
is too old.

This reverts commit fa393554.

c6bcdc42

07 May, 2025 1 commit

remove cuda v11 (#10569) · fa393554

Daniel Hiltgen authored May 06, 2025

This reduces the size of our Windows installer payloads by ~256M by dropping
support for nvidia drivers older than Feb 2023. Hardware support is unchanged.

Linux default bundle sizes are reduced by ~600M to 1G.

fa393554

13 Feb, 2025 1 commit
- docs: add H200 as supported device. (#9076) · 3a4449e2
  frob authored Feb 13, 2025
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
  3a4449e2
20 Jan, 2025 1 commit
- docs: update suspend header in gpu.md (#8487) · 7bb356c6
  EndoTheDev authored Jan 20, 2025
  
  7bb356c6
10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

26 Oct, 2024 1 commit

Better support for AMD multi-GPU on linux (#7212) · d7c94e0c

Daniel Hiltgen authored Oct 26, 2024

* Better support for AMD multi-GPU

This resolves a number of problems related to AMD multi-GPU setups on linux.

The numeric IDs used by rocm are not the same as the numeric IDs exposed in
sysfs although the ordering is consistent.  We have to count up from the first
valid gfx (major/minor/patch with non-zero values) we find starting at zero.

There are 3 different env vars for selecting GPUs, and only ROCR_VISIBLE_DEVICES
supports UUID based identification, so we should favor that one, and try
to use UUIDs if detected to avoid potential ordering bugs with numeric IDs

* ROCR_VISIBLE_DEVICES only works on linux

Use the numeric ID only HIP_VISIBLE_DEVICES on windows

d7c94e0c

05 Sep, 2024 1 commit

Update gpu.md: Add RTX 3050 Ti and RTX 3050 Ti (#5888) · 5f944baa

Michael authored Sep 05, 2024

* Update gpu.md

    Seems strange that the laptop versions of 3050 and 3050 Ti would be supported but not the non-notebook, but this is what the page (https://developer.nvidia.com/cuda-gpus

) says.
Signed-off-by: bean5 <2052646+bean5@users.noreply.github.com>

* Update gpu.md

Remove notebook reference

---------
Signed-off-by: bean5 <2052646+bean5@users.noreply.github.com>

5f944baa

20 Jul, 2024 1 commit

Adjust windows ROCm discovery · 283948c8

Daniel Hiltgen authored Jul 19, 2024

The v5 hip library returns unsupported GPUs which wont enumerate at
inference time in the runner so this makes sure we align discovery. The
gfx906 cards are no longer supported so we shouldn't compile with that
GPU type as it wont enumerate at runtime.

283948c8

01 Jul, 2024 1 commit
- Update gpu.md (#5382) · 27402cb7
  Eduard authored Jul 01, 2024
```
Runs fine on a NVIDIA GeForce GTX 1050 Ti
```
  27402cb7
14 Jun, 2024 1 commit
- update 40xx gpu compat matrix (#5036) · 4dc7fb95
  Patrick Devine authored Jun 13, 2024
  
  4dc7fb95
21 Mar, 2024 2 commits
- Add docs for GPU selection and nvidia uvm workaround · d8fdbfd8
  Daniel Hiltgen authored Mar 21, 2024
  
  d8fdbfd8
- doc: faq gpu compatibility (#3142) · a5ba0fcf
  Bruce MacDonald authored Mar 21, 2024
  
  a5ba0fcf