Commits · 3a4449e2f1b19bec5b2a2d0a0b1ea2bd53d1b4cc · OpenDAS / ollama

13 Feb, 2025 1 commit
- docs: add H200 as supported device. (#9076) · 3a4449e2
  frob authored Feb 13, 2025
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
  3a4449e2
20 Jan, 2025 1 commit
- docs: update suspend header in gpu.md (#8487) · 7bb356c6
  EndoTheDev authored Jan 20, 2025
  
  7bb356c6
10 Dec, 2024 1 commit

build: Make target improvements (#7499) · 4879a234

Daniel Hiltgen authored Dec 10, 2024

* llama: wire up builtin runner

This adds a new entrypoint into the ollama CLI to run the cgo built runner.
On Mac arm64, this will have GPU support, but on all other platforms it will
be the lowest common denominator CPU build.  After we fully transition
to the new Go runners more tech-debt can be removed and we can stop building
the "default" runner via make and rely on the builtin always.

* build: Make target improvements

Add a few new targets and help for building locally.
This also adjusts the runner lookup to favor local builds, then
runners relative to the executable, and finally payloads.

* Support customized CPU flags for runners

This implements a simplified custom CPU flags pattern for the runners.
When built without overrides, the runner name contains the vector flag
we check for (AVX) to ensure we don't try to run on unsupported systems
and crash.  If the user builds a customized set, we omit the naming
scheme and don't check for compatibility.  This avoids checking
requirements at runtime, so that logic has been removed as well.  This
can be used to build GPU runners with no vector flags, or CPU/GPU
runners with additional flags (e.g. AVX512) enabled.

* Use relative paths

If the user checks out the repo in a path that contains spaces, make gets
really confused so use relative paths for everything in-repo to avoid breakage.

* Remove payloads from main binary

* install: clean up prior libraries

This removes support for v0.3.6 and older versions (before the tar bundle)
and ensures we clean up prior libraries before extracting the bundle(s).
Without this change, runners and dependent libraries could leak when we
update and lead to subtle runtime errors.

4879a234

26 Oct, 2024 1 commit

Better support for AMD multi-GPU on linux (#7212) · d7c94e0c

Daniel Hiltgen authored Oct 26, 2024

* Better support for AMD multi-GPU

This resolves a number of problems related to AMD multi-GPU setups on linux.

The numeric IDs used by rocm are not the same as the numeric IDs exposed in
sysfs although the ordering is consistent.  We have to count up from the first
valid gfx (major/minor/patch with non-zero values) we find starting at zero.

There are 3 different env vars for selecting GPUs, and only ROCR_VISIBLE_DEVICES
supports UUID based identification, so we should favor that one, and try
to use UUIDs if detected to avoid potential ordering bugs with numeric IDs

* ROCR_VISIBLE_DEVICES only works on linux

Use the numeric ID only HIP_VISIBLE_DEVICES on windows

d7c94e0c

05 Sep, 2024 1 commit

Update gpu.md: Add RTX 3050 Ti and RTX 3050 Ti (#5888) · 5f944baa

Michael authored Sep 05, 2024

* Update gpu.md

    Seems strange that the laptop versions of 3050 and 3050 Ti would be supported but not the non-notebook, but this is what the page (https://developer.nvidia.com/cuda-gpus

) says.
Signed-off-by: bean5 <2052646+bean5@users.noreply.github.com>

* Update gpu.md

Remove notebook reference

---------
Signed-off-by: bean5 <2052646+bean5@users.noreply.github.com>

5f944baa

20 Jul, 2024 1 commit

Adjust windows ROCm discovery · 283948c8

Daniel Hiltgen authored Jul 19, 2024

The v5 hip library returns unsupported GPUs which wont enumerate at
inference time in the runner so this makes sure we align discovery. The
gfx906 cards are no longer supported so we shouldn't compile with that
GPU type as it wont enumerate at runtime.

283948c8

01 Jul, 2024 1 commit
- Update gpu.md (#5382) · 27402cb7
  Eduard authored Jul 01, 2024
```
Runs fine on a NVIDIA GeForce GTX 1050 Ti
```
  27402cb7
14 Jun, 2024 1 commit
- update 40xx gpu compat matrix (#5036) · 4dc7fb95
  Patrick Devine authored Jun 13, 2024
  
  4dc7fb95
21 Mar, 2024 2 commits
- Add docs for GPU selection and nvidia uvm workaround · d8fdbfd8
  Daniel Hiltgen authored Mar 21, 2024
  
  d8fdbfd8
- doc: faq gpu compatibility (#3142) · a5ba0fcf
  Bruce MacDonald authored Mar 21, 2024
  
  a5ba0fcf