Commits · f6c29409dc0823fcb0b42e8138ea3e208d6b5edf · OpenDAS / ollama

29 Oct, 2025 1 commit
- docs: add new cloud model + fix openai redirect (#12812) · f6c29409
  Jeffrey Morgan authored Oct 28, 2025
  
  f6c29409
28 Oct, 2025 5 commits
- docs: update readme and links (#12809) · d828517e
  Parth Sareen authored Oct 28, 2025
  
  d828517e
- docs: add docs for docs.ollama.com (#12805) · 3d99d977
  Parth Sareen authored Oct 28, 2025
  
  3d99d977
- docs: rename to mdx to setup docs site (#12804) · 6d02a43a
  Parth Sareen authored Oct 28, 2025
  
  6d02a43a
- Revert "docs: add reference to docs.ollama.com (#12800)" (#12803) · 5483497d
  Parth Sareen authored Oct 28, 2025
```
This reverts commit 934dd9e1.
```
  5483497d
- docs: add reference to docs.ollama.com (#12800) · 934dd9e1
  Parth Sareen authored Oct 28, 2025
  
  934dd9e1
16 Oct, 2025 1 commit
- cuda: tidy up CC settings (#12668) · 27067993
  Daniel Hiltgen authored Oct 16, 2025
```
8.7 is Jetpack only, so no need on x86 builds
10.3 covers [G]B300
```
  27067993
11 Oct, 2025 1 commit
- doc: remove AMD EOL GPUs (#12567) · 70d9e363
  Daniel Hiltgen authored Oct 10, 2025
  
  70d9e363
07 Oct, 2025 2 commits

docs: improve accuracy of LLM library docs (#12530) · 303be930
Daniel Hiltgen authored Oct 07, 2025

303be930

Bring back escape valve for llm libraries and fix Jetpack6 crash (#12529) · bd15eba4

Daniel Hiltgen authored Oct 07, 2025

* Bring back escape valve for llm libraries

If the new discovery logic picks the wrong library, this gives users the
ability to force a specific one using the same pattern as before. This
can also potentially speed up bootstrap discovery if one of the libraries
takes a long time to load and ultimately bind to no devices.  For example
unsupported AMD iGPUS can sometimes take a while to discover and rule out.

* Bypass extra discovery on jetpack systems

On at least Jetpack6, cuda_v12 appears to expose the iGPU, but crashes later on in
cublasInit so if we detect a Jetpack, short-circuit and use that variant.

bd15eba4

02 Oct, 2025 1 commit

Update GGML to b6646 (#12245) · c68f367e

Daniel Hiltgen authored Oct 02, 2025

Notable EOLs with this change:
- MacOS v12 and v13 are no longer supported (v14+ required)
- AMD gfx900 and gfx906 are no longer supported

c68f367e

01 Oct, 2025 1 commit

Use runners for GPU discovery (#12090) · bc8909fb

Daniel Hiltgen authored Oct 01, 2025

This revamps how we discover GPUs in the system by leveraging the Ollama
runner. This should eliminate inconsistency between our GPU discovery and the
runners capabilities at runtime, particularly for cases where we try to filter
out unsupported GPUs. Now the runner does that implicitly based on the actual
device list. In some cases free VRAM reporting can be unreliable which can
leaad to scheduling mistakes, so this also includes a patch to leverage more
reliable VRAM reporting libraries if available.

Automatic workarounds have been removed as only one GPU leveraged this, which
is now documented. This GPU will soon fall off the support matrix with the next
ROCm bump.

Additional cleanup of the scheduler and discovery packages can be done in the
future once we have switched on the new memory management code, and removed
support for the llama runner.

bc8909fb

22 Sep, 2025 2 commits
- docs: update cloud.md for cloud models · af060eb2
  jmorganca authored Sep 19, 2025
  
  af060eb2
- docs: move turbo.md to cloud.md · ae5c3300
  jmorganca authored Sep 19, 2025
  
  ae5c3300
15 Sep, 2025 1 commit
- doc: show how to clear the cgo cache (#12298) · 93c64ea1
  Daniel Hiltgen authored Sep 15, 2025
  
  93c64ea1
11 Sep, 2025 1 commit
- feat: add dimensions field to embed requests (#12242) · feb18cd7
  Michael Yang authored Sep 11, 2025
```
* feat: add field to truncate embeddings

* add openai embeddings for dimensions
```
  feb18cd7
10 Sep, 2025 1 commit

Add v12 + v13 cuda support (#12000) · 17a023f3

Daniel Hiltgen authored Sep 10, 2025

* Add support for upcoming NVIDIA Jetsons

The latest Jetsons with JetPack 7 are moving to an SBSA compatible model and
will not require building a JetPack specific variant.

* cuda: bring back dual versions

This adds back dual CUDA versions for our releases,
with v11 and v13 to cover a broad set of GPUs and
driver versions.

* win: break up native builds in build_windows.ps1

* v11 build working on windows and linux

* switch to cuda v12.8 not JIT

* Set CUDA compression to size

* enhance manual install linux docs

17a023f3

08 Sep, 2025 1 commit
- docs: show how to debug nvidia init failures (#12216) · 950d33aa
  Daniel Hiltgen authored Sep 08, 2025
```
This debug setting can help troubleshoot obscure initialization failures.
```
  950d33aa
15 Aug, 2025 1 commit
- docs: added missing comma in 'Ollama's Javascript library'' (#11915) · 883d0312
  Thomas Pelster authored Aug 15, 2025
  
  883d0312
14 Aug, 2025 1 commit
- doc: clarify both rocm and main bundle necessary (#11900) · 7ccfd97a
  Daniel Hiltgen authored Aug 14, 2025
```
Some users expect the rocm bundles to be self-sufficient, but are designed to be additive.
```
  7ccfd97a
06 Aug, 2025 3 commits
- docs: update the faq (#11760) · 44bc36d0
  Patrick Devine authored Aug 06, 2025
  
  44bc36d0
- Update downloading to pulling in api.md (#11170) · 8a75e9ee
  Gao feng authored Aug 07, 2025
```
update api.md to make it consist with code.
https://github.com/ollama/ollama/blob/main/server/download.go#L447
```
  8a75e9ee
- docs: update turbo model name (#11707) · 4742e12c
  Parth Sareen authored Aug 05, 2025
  
  4742e12c
05 Aug, 2025 1 commit
- docs: add docs for Ollama Turbo (#11687) · ee92ca3e
  Jeffrey Morgan authored Aug 05, 2025
  
  ee92ca3e
28 Jul, 2025 1 commit
- docs: fix typos and remove trailing whitespaces (#11554) · 3515cc37
  Yoshi authored Jul 28, 2025
  
  3515cc37
22 Jul, 2025 1 commit
- Update linux.md (#11462) · 4151ef8c
  ycomiti authored Jul 22, 2025
  
  4151ef8c
17 Jul, 2025 1 commit
- docs: add the no-Modelfile function of `ollama create` (#9077) · 802ad16c
  frob authored Jul 17, 2025
  
  802ad16c
16 Jul, 2025 1 commit
- docs: fix typo in macos.md (#11425) · 2e3fd86d
  Marcelo Fornet authored Jul 16, 2025
  
  2e3fd86d
11 Jul, 2025 1 commit
- docs: update modelfile.md to reflect current default num_ctx (#11189) · 4261a3b0
  先知 authored Jul 11, 2025
```
As in the commit 44b466ee, the default context length has been increased to 4096.
```
  4261a3b0
08 Jul, 2025 2 commits

doc: add MacOS docs (#11334) · 66fb8575
Daniel Hiltgen authored Jul 08, 2025
```
also removes stale model dir instructions for windows
```
66fb8575

Reduce default parallelism to 1 (#11330) · 20c3266e

Daniel Hiltgen authored Jul 08, 2025

The current scheduler algorithm of picking the paralellism based on available
VRAM complicates the upcoming dynamic layer memory allocation algorithm. This
changes the default to 1, with the intent going forward that parallelism is
explicit and will no longer be dynamically determined. Removal of the dynamic
logic will come in a follow up.

20c3266e

07 Jul, 2025 2 commits
- add `tool_name` to api.md (#11326) · 43107b15
  Parth Sareen authored Jul 07, 2025
  
  43107b15
- template: add tool result compatibility (#11294) · 1f91cb0c
  Parth Sareen authored Jul 07, 2025
  
  1f91cb0c
05 Jul, 2025 1 commit
- doc: add NVIDIA blackwell to supported list (#11307) · 9d60bb44
  Daniel Hiltgen authored Jul 05, 2025
  
  9d60bb44
23 Jun, 2025 1 commit

Re-remove cuda v11 (#10694) · 1c6669e6

Daniel Hiltgen authored Jun 23, 2025

* Re-remove cuda v11

Revert the revert - drop v11 support requiring drivers newer than Feb 23

This reverts commit c6bcdc42.

* Simplify layout

With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)

* distinct sbsa variant for linux arm64

This avoids accidentally trying to load the sbsa cuda libraries on
a jetson system which results in crashes.

* temporary prevent rocm+cuda mixed loading

1c6669e6

18 Jun, 2025 1 commit
- benchmark: remove unused benchmark test (#11120) · 8bcb3125
  Jeffrey Morgan authored Jun 18, 2025
```
Removes a test under benchmark/ that is unused
```
  8bcb3125
07 Jun, 2025 2 commits
- docs: update link to AMD drivers in linux.md (#10973) · fc030961
  Krzysztof Jeziorny authored Jun 07, 2025
  
  fc030961
- Revert "server: add model capabilities to the list endpoint (#10174)" (#11004) · 09d308d6
  Jeffrey Morgan authored Jun 06, 2025
```
This reverts commit 09430011.
```
  09d308d6
06 Jun, 2025 1 commit
- docs: fix typo in development.md (#10998) · c6a6d729
  Hunter Wittenborn authored Jun 06, 2025
  
  c6a6d729
04 Jun, 2025 1 commit
- server: add model capabilities to the list endpoint (#10174) · 09430011
  JasonHonKL authored Jun 05, 2025
  
  09430011