Commits · 1c6669e64cc8a482fbf1e35c0249f17b35a4e87a · OpenDAS / ollama

23 Jun, 2025 1 commit

Daniel Hiltgen authored Jun 23, 2025

* Re-remove cuda v11

Revert the revert - drop v11 support requiring drivers newer than Feb 23

This reverts commit c6bcdc42.

* Simplify layout

With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)

* distinct sbsa variant for linux arm64

This avoids accidentally trying to load the sbsa cuda libraries on
a jetson system which results in crashes.

* temporary prevent rocm+cuda mixed loading

1c6669e6

13 May, 2025 1 commit

Revert "remove cuda v11 (#10569)" (#10692) · c6bcdc42

Daniel Hiltgen authored May 13, 2025

Bring back v11 until we can better warn users that their driver
is too old.

This reverts commit fa393554.

c6bcdc42

07 May, 2025 1 commit

remove cuda v11 (#10569) · fa393554

Daniel Hiltgen authored May 06, 2025

This reduces the size of our Windows installer payloads by ~256M by dropping
support for nvidia drivers older than Feb 2023. Hardware support is unchanged.

Linux default bundle sizes are reduced by ~600M to 1G.

fa393554

08 Apr, 2025 1 commit

cleanup: remove OLLAMA_TMPDIR and references to temporary executables (#10182) · ccc8c677

frob authored Apr 09, 2025



* cleanup: remove OLLAMA_TMPDIR
* cleanup: ollama doesn't use temporary executables anymore

---------
Co-authored-by: Richard Lyons <frob@cloudstaff.com>

ccc8c677

25 Mar, 2025 1 commit
- docs: add flags to example linux log output command (#9852) · 5e0b904e
  copeland3300 authored Mar 25, 2025
  
  5e0b904e
25 Feb, 2025 1 commit
- Move cgroups fix out of AMD section. (#9072) · 4df98f3e
  frob authored Feb 25, 2025
```
Co-authored-by: Richard Lyons <frob@cloudstaff.com>
```
  4df98f3e
07 Feb, 2025 1 commit
- docs: improve syntax highlighting in code blocks (#8854) · b901a712
  Azis Alvriyanto authored Feb 08, 2025
  
  b901a712
10 Dec, 2024 1 commit
- all: fix typos in documentation, code, and comments (#7021) · abfdc471
  Stefan Weil authored Dec 10, 2024
  
  abfdc471
21 Nov, 2024 1 commit
- docs: Link to AMD guide on multi-GPU guidance (#7744) · d8632982
  Daniel Hiltgen authored Nov 20, 2024
  
  d8632982
12 Nov, 2024 2 commits

doc: capture numeric group requirement (#6941) · ac07160c

Daniel Hiltgen authored Nov 12, 2024

Docker uses the container filesystem for name resolution, so we can't guide users
to use the name of the host group.  Instead they must specify the numeric ID.

ac07160c

docs: Capture docker cgroup workaround (#7519) · 6606e424

Daniel Hiltgen authored Nov 12, 2024

GPU support can break on some systems after a while.  This captures a
known workaround to solve the problem.

6606e424

11 Sep, 2024 1 commit

Verify permissions for AMD GPU (#6736) · 9246e6dd

Daniel Hiltgen authored Sep 11, 2024

This adds back a check which was lost many releases back to verify /dev/kfd permissions
which when lacking, can lead to confusing failure modes of:
"rocBLAS error: Could not initialize Tensile host: No devices found"

This implementation does not hard fail the serve command but instead will fall back to CPU
with an error log. In the future we can include this in the GPU discovery UX to show
detected but unsupported devices we discovered.

9246e6dd

05 Aug, 2024 1 commit

Disable paging for journalctl (#6154) · b73b0940

frob authored Aug 05, 2024

Users using `journalctl` to get logs for issue logging sometimes don't realize that paging is causing information to be missed.

b73b0940

04 Jul, 2024 1 commit
- Document older win10 terminal problems · 52abc8ac
  Daniel Hiltgen authored May 13, 2024
```
We haven't found a workaround, so for now recommend updating.
```
  52abc8ac
03 Jul, 2024 1 commit

Better nvidia GPU discovery logging · ef757da2

Daniel Hiltgen authored Jul 03, 2024

Refine the way we log GPU discovery to improve the non-debug
output, and report more actionable log messages when possible
to help users troubleshoot on their own.

ef757da2

19 Jun, 2024 1 commit
- Implement log rotation for tray app · 9d8a4988
  Daniel Hiltgen authored Jun 15, 2024
  
  9d8a4988
23 May, 2024 1 commit
- Add isolated gpu test to troubleshooting · f77713bf
  Daniel Hiltgen authored May 23, 2024
  
  f77713bf
21 May, 2024 1 commit
- doc updates for the faq/troubleshooting (#4565) · 3bade04e
  Patrick Devine authored May 21, 2024
  
  3bade04e
20 May, 2024 1 commit
- chore: fix typo in docs (#4536) · 8800c8a5
  alwqx authored May 21, 2024
  
  8800c8a5
09 May, 2024 1 commit
- Doc container usage and workaround for nvidia errors · 8cc0ee2e
  Daniel Hiltgen authored May 09, 2024
  
  8cc0ee2e
01 Apr, 2024 1 commit

Safeguard for noexec · 0a74cb31

Daniel Hiltgen authored Mar 28, 2024

We may have users that run into problems with our current
payload model, so this gives us an escape valve.

0a74cb31

21 Mar, 2024 1 commit
- doc: faq gpu compatibility (#3142) · a5ba0fcf
  Bruce MacDonald authored Mar 21, 2024
  
  a5ba0fcf
15 Mar, 2024 1 commit
- Add ROCm support to linux install script (#2966) · 6459377a
  Daniel Hiltgen authored Mar 14, 2024
  
  6459377a
11 Mar, 2024 1 commit
- Update troubleshooting.md · 6d3adfbe
  Jeffrey Morgan authored Mar 11, 2024
  
  6d3adfbe
07 Mar, 2024 2 commits

Refined ROCm troubleshooting docs · 69f02278
Daniel Hiltgen authored Mar 07, 2024

69f02278

Revamp ROCm support · 6c5ccb11

Daniel Hiltgen authored Feb 15, 2024

This refines where we extract the LLM libraries to by adding a new
OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already
idempotenent, so this should speed up startups after the first time a
new release is deployed. It also cleans up after itself.

We now build only a single ROCm version (latest major) on both windows
and linux. Given the large size of ROCms tensor files, we split the
dependency out. It's bundled into the installer on windows, and a
separate download on windows. The linux install script is now smart and
detects the presence of AMD GPUs and looks to see if rocm v6 is already
present, and if not, then downloads our dependency tar file.

For Linux discovery, we now use sysfs and check each GPU against what
ROCm supports so we can degrade to CPU gracefully instead of having
llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows
dynamic library loading logic to access the amdhip64.dll APIs to query
the GPU information.

6c5ccb11

15 Feb, 2024 1 commit

Implement new Go based Desktop app · 29e90cc1

Daniel Hiltgen authored Dec 26, 2023

This focuses on Windows first, but coudl be used for Mac
and possibly linux in the future.

29e90cc1

29 Jan, 2024 1 commit
- Add container hints for troubleshooting · e7dbb003
  Daniel Hiltgen authored Jan 29, 2024
```
Some users are new to containers and unsure where the server logs go
```
  e7dbb003
11 Jan, 2024 1 commit

Build multiple CPU variants and pick the best · d88c527b

Daniel Hiltgen authored Jan 07, 2024

This reduces the built-in linux version to not use any vector extensions
which enables the resulting builds to run under Rosetta on MacOS in
Docker. Then at runtime it checks for the actual CPU vector
extensions and loads the best CPU library available

d88c527b

22 Dec, 2023 1 commit

Clean up documentation (#1506) · 291700c9

Matt Williams authored Dec 22, 2023



* Clean up documentation

Will probably need to update with PRs for new release.
Signed-off-by: Matt Williams <m@technovangelist.com>

* Correcting to fit in 0.1.15 changes
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* addressing comments
Signed-off-by: Matt Williams <m@technovangelist.com>

* more api cleanup
Signed-off-by: Matt Williams <m@technovangelist.com>

* its llava not llama
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update docs/troubleshooting.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Updated hosting to server and documented all env vars
Signed-off-by: Matt Williams <m@technovangelist.com>

* remove last of the cli descriptions
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* update further per conversation with jeff earlier today
Signed-off-by: Matt Williams <m@technovangelist.com>

* cleanup the doc readme
Signed-off-by: Matt Williams <m@technovangelist.com>

* move upgrade to faq
Signed-off-by: Matt Williams <m@technovangelist.com>

* first change
Signed-off-by: Matt Williams <m@technovangelist.com>

* updated
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update docs/faq.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* examples in parent
Signed-off-by: Matt Williams <m@technovangelist.com>

* add exapmle for create model.
Signed-off-by: Matt Williams <m@technovangelist.com>

* update faq
Signed-off-by: Matt Williams <m@technovangelist.com>

* update create model api
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update docs/api.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/faq.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/troubleshooting.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* update the readme in docs
Signed-off-by: Matt Williams <m@technovangelist.com>

* update a few more things
Signed-off-by: Matt Williams <m@technovangelist.com>

* Update docs/troubleshooting.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/faq.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/modelfile.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

* Update docs/troubleshooting.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

---------
Signed-off-by: Matt Williams <m@technovangelist.com>
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>

291700c9