Commits · 18fdcc94e55d8ca393be9d01b30246dbbca6f6af · OpenDAS / ollama

23 Dec, 2025 1 commit
- docs: fix broken .md links and render issues (#13550) · 18fdcc94
  Vallabh Mahajan authored Dec 23, 2025
  
  18fdcc94
18 Dec, 2025 1 commit
- add REQUIRES command to Modelfile (#13361) · 8852220f
  Jeffrey Morgan authored Dec 18, 2025
  
  8852220f
12 Dec, 2025 2 commits

docs: add docs for v1/responses and rework openai compat section (#13416) · 9f782285

Devon Rifkin authored Dec 11, 2025



* docs: add docs for v1/responses and rework openai compat section

I reworked the examples to be separated by topic and to be fully
runnable (i.e., they now log output instead of just suggesting how a
call might be made).

We now use `<CodeGroup>`s so that each example has a dropdown on the
docs site for users to choose, which makes the examples a lot more
digestible (since you only see approx 1/3 of the code you used to).

I also added a new tool to extract code examples into files so that it's
easier to actually run them and check that they work.

## Example

```shell
go run docs/tools/extract-examples/main.go docs/api/openai-compatibility.mdx
```

Output:

```
Extracting code examples to: /var/folders/vq/wfm2g6k917d3ldzpjdxc8ph00000gn/T/mdx-examples-3271754368

  - 01_basic.py
  - 01_basic.js
  - 01_basic.sh
  - 02_responses.py
  - 02_responses.js
  - 02_responses.sh
  - 03_vision.py
  - 03_vision.js
  - 03_vision.sh

Extracted 9 file(s) to /var/folders/vq/wfm2g6k917d3ldzpjdxc8ph00000gn/T/mdx-examples-3271754368

To run examples:

  cd /var/folders/vq/wfm2g6k917d3ldzpjdxc8ph00000gn/T/mdx-examples-3271754368
  npm install   # for JS examples

then run individual files with `node file.js`, `python file.py`, `bash file.sh`
```

In the future we should consider actually running the examples in CI and
having some sort of acceptance test so we can automatically detect when
our examples break. So this is just a start in that direction.

* Update docs/api/openai-compatibility.mdx
Co-authored-by: Parth Sareen <parth.sareen@ollama.com>

* Update docs/api/openai-compatibility.mdx
Co-authored-by: Parth Sareen <parth.sareen@ollama.com>

---------
Co-authored-by: Parth Sareen <parth.sareen@ollama.com>

9f782285

docs: fix link to modelfile.mdx (#13220) · 93d45d7a
Alexander Gusak authored Dec 12, 2025

93d45d7a

02 Dec, 2025 2 commits
- Update user message format for temperature query (#13256) · cc9555af
  Nathan Hook authored Dec 02, 2025
  
  cc9555af
- Add Vulkan GPU support instructions in development.md (#13265) · 20aee967
  hello_world authored Dec 03, 2025
```
Added Vulkan SDK installation instructions and environment variable setup for building with Vulkan support.
```
  20aee967
29 Nov, 2025 1 commit
- docs: fix output formatting in faq.mdx (#13231) · 0c248960
  Ondrej Kokes authored Nov 29, 2025
```
There were a few Markdown typos in one FAQ answer. It now renders as a proper ascii table.
```
  0c248960
26 Nov, 2025 1 commit
- docs: remove deprecated parameters (#13237) · 8b1b89a9
  EntropyYue authored Nov 26, 2025
  
  8b1b89a9
18 Nov, 2025 1 commit
- docs: fix typo in vscode.mdx (#13116) · 8ed1adf3
  Lhiam Andrei Lingco authored Nov 19, 2025
  
  8ed1adf3
17 Nov, 2025 1 commit
- docs: link to ollama.com instead of hardcoding list of cloud models (#13110) · aa676b31
  Jeffrey Morgan authored Nov 16, 2025
  
  aa676b31
14 Nov, 2025 1 commit
- docs: add logprobs to openapi (#13090) · ce29f695
  Parth Sareen authored Nov 14, 2025
  
  ce29f695
13 Nov, 2025 2 commits
- embeddings: added cli command to embedding docs (#12993) · 482bec82
  nicole pardal authored Nov 13, 2025
  
  482bec82
- docs: fix typo (VSCode -> VS Code) (#13072) · 684a9a8c
  Kowyo authored Nov 13, 2025
  
  684a9a8c
12 Nov, 2025 1 commit

Enable Vulkan with a temporary opt-in setting (#12931) · 6286d9a3

Daniel Hiltgen authored Nov 12, 2025

* docs: vulkan information

* Revert "CI: Set up temporary opt-out Vulkan support (#12614)"

This reverts commit 8b6e5bae.

* vulkan: temporary opt-in for Vulkan support

Revert this once we're ready to enable by default.

* win: add vulkan CI build

6286d9a3

11 Nov, 2025 4 commits
- docs: rename api-reference.md back to api.md since redirect stopped working (#13056) · cb1cb064
  Jeffrey Morgan authored Nov 11, 2025
  
  cb1cb064
- docs: fix openapi.yaml warnings, rename api.md to api-reference.md (#12904) · 2d5e066c
  Jeffrey Morgan authored Nov 11, 2025
  
  2d5e066c
- docs/openapi: document that delete and copy responses are empty (#13055) · 15968714
  Bruce MacDonald authored Nov 11, 2025
```
Some route endpoints return an empty response with a 200 OK. These should be documented in the OpenAPI doc. Note that the previous deletion response was not correct.
```
  15968714
- docs: fix metal gpu section header (#13045) · 6df42088
  Sheikh authored Nov 11, 2025
  
  6df42088
08 Nov, 2025 1 commit
- docs: update n8n URL for Ollama (#12994) · 755ac3b0
  Parth Sareen authored Nov 07, 2025
  
  755ac3b0
07 Nov, 2025 2 commits
- doc: re-add login autostart faq and GPU updates (#12975) · 60b89735
  Daniel Hiltgen authored Nov 07, 2025
```
* doc: re-add login autostart faq

This appears to have been accidentally dropped during the doc migration.

* docs: GPU updates lost on the doc update

* review comments: improve windows login disable instructions
```
  60b89735
- docs: fix 404 link to modelfile documentation (#12996) · d2ef679d
  Tomoya Fujita authored Nov 08, 2025
  
  d2ef679d
05 Nov, 2025 1 commit

embeddings: added embedding command for cl (#12795) · 1ca608bc

nicole pardal authored Nov 05, 2025

Co-authored-by: A-Akhil <akhilrahul70@gmail.com>

This PR introduces a new ollama embed command that allows users to generate embeddings directly from the command line.

Added ollama embed MODEL [TEXT...] command for generating text embeddings
Supports both direct text arguments and stdin piping for scripted workflows

Outputs embeddings as JSON arrays (one per line)

1ca608bc

29 Oct, 2025 3 commits
- docs: temporarily restore api.md and cleanup docs paths (#12818) · 93e45f0f
  Jeffrey Morgan authored Oct 28, 2025
  
  93e45f0f
- docs: fix root api documentation page (#12813) · a3421608
  Jeffrey Morgan authored Oct 28, 2025
  
  a3421608
- docs: add new cloud model + fix openai redirect (#12812) · f6c29409
  Jeffrey Morgan authored Oct 28, 2025
  
  f6c29409
28 Oct, 2025 5 commits
- docs: update readme and links (#12809) · d828517e
  Parth Sareen authored Oct 28, 2025
  
  d828517e
- docs: add docs for docs.ollama.com (#12805) · 3d99d977
  Parth Sareen authored Oct 28, 2025
  
  3d99d977
- docs: rename to mdx to setup docs site (#12804) · 6d02a43a
  Parth Sareen authored Oct 28, 2025
  
  6d02a43a
- Revert "docs: add reference to docs.ollama.com (#12800)" (#12803) · 5483497d
  Parth Sareen authored Oct 28, 2025
```
This reverts commit 934dd9e1.
```
  5483497d
- docs: add reference to docs.ollama.com (#12800) · 934dd9e1
  Parth Sareen authored Oct 28, 2025
  
  934dd9e1
16 Oct, 2025 1 commit
- cuda: tidy up CC settings (#12668) · 27067993
  Daniel Hiltgen authored Oct 16, 2025
```
8.7 is Jetpack only, so no need on x86 builds
10.3 covers [G]B300
```
  27067993
11 Oct, 2025 1 commit
- doc: remove AMD EOL GPUs (#12567) · 70d9e363
  Daniel Hiltgen authored Oct 10, 2025
  
  70d9e363
07 Oct, 2025 2 commits

docs: improve accuracy of LLM library docs (#12530) · 303be930
Daniel Hiltgen authored Oct 07, 2025

303be930

Bring back escape valve for llm libraries and fix Jetpack6 crash (#12529) · bd15eba4

Daniel Hiltgen authored Oct 07, 2025

* Bring back escape valve for llm libraries

If the new discovery logic picks the wrong library, this gives users the
ability to force a specific one using the same pattern as before. This
can also potentially speed up bootstrap discovery if one of the libraries
takes a long time to load and ultimately bind to no devices.  For example
unsupported AMD iGPUS can sometimes take a while to discover and rule out.

* Bypass extra discovery on jetpack systems

On at least Jetpack6, cuda_v12 appears to expose the iGPU, but crashes later on in
cublasInit so if we detect a Jetpack, short-circuit and use that variant.

bd15eba4

02 Oct, 2025 1 commit

Update GGML to b6646 (#12245) · c68f367e

Daniel Hiltgen authored Oct 02, 2025

Notable EOLs with this change:
- MacOS v12 and v13 are no longer supported (v14+ required)
- AMD gfx900 and gfx906 are no longer supported

c68f367e

01 Oct, 2025 1 commit

Use runners for GPU discovery (#12090) · bc8909fb

Daniel Hiltgen authored Oct 01, 2025

This revamps how we discover GPUs in the system by leveraging the Ollama
runner. This should eliminate inconsistency between our GPU discovery and the
runners capabilities at runtime, particularly for cases where we try to filter
out unsupported GPUs. Now the runner does that implicitly based on the actual
device list. In some cases free VRAM reporting can be unreliable which can
leaad to scheduling mistakes, so this also includes a patch to leverage more
reliable VRAM reporting libraries if available.

Automatic workarounds have been removed as only one GPU leveraged this, which
is now documented. This GPU will soon fall off the support matrix with the next
ROCm bump.

Additional cleanup of the scheduler and discovery packages can be done in the
future once we have switched on the new memory management code, and removed
support for the llama runner.

bc8909fb

22 Sep, 2025 2 commits
- docs: update cloud.md for cloud models · af060eb2
  jmorganca authored Sep 19, 2025
  
  af060eb2
- docs: move turbo.md to cloud.md · ae5c3300
  jmorganca authored Sep 19, 2025
  
  ae5c3300
15 Sep, 2025 1 commit
- doc: show how to clear the cgo cache (#12298) · 93c64ea1
  Daniel Hiltgen authored Sep 15, 2025
  
  93c64ea1
11 Sep, 2025 1 commit
- feat: add dimensions field to embed requests (#12242) · feb18cd7
  Michael Yang authored Sep 11, 2025
```
* feat: add field to truncate embeddings

* add openai embeddings for dimensions
```
  feb18cd7