Commits · 2aaf29acb5caf3f1fdc50cd48542c94df801752f · OpenDAS / ollama

11 Nov, 2025 1 commit
- app/ui: do not send to prevent errors with cloud provider · 2aaf29ac
  Eva Ho authored Nov 10, 2025
  
  2aaf29ac
10 Nov, 2025 1 commit
- app/ui: using streamdown AI elements for markdown rendering · a42f826a
  Eva H authored Nov 10, 2025
  
  a42f826a
08 Nov, 2025 3 commits
- app/docs: remove out of date storybook instructions (#13006) · e10a3533
  Bruce MacDonald authored Nov 08, 2025
  
  e10a3533
- bugfix: don't include both consolidated.safetensors and model-*.safetensors (#13010) · 91ec3ddb
  Patrick Devine authored Nov 07, 2025
  
  91ec3ddb
- docs: update n8n URL for Ollama (#12994) · 755ac3b0
  Parth Sareen authored Nov 07, 2025
  
  755ac3b0
07 Nov, 2025 2 commits
- doc: re-add login autostart faq and GPU updates (#12975) · 60b89735
  Daniel Hiltgen authored Nov 07, 2025
```
* doc: re-add login autostart faq

This appears to have been accidentally dropped during the doc migration.

* docs: GPU updates lost on the doc update

* review comments: improve windows login disable instructions
```
  60b89735
- docs: fix 404 link to modelfile documentation (#12996) · d2ef679d
  Tomoya Fujita authored Nov 08, 2025
  
  d2ef679d
06 Nov, 2025 15 commits
- Remove unnecessary MacOs 13 and lower Patches (#12656) · d4e0da08
  Thomas Stocker authored Nov 07, 2025
```
* Remove unnecessary macos 13 Patch

* Remove unnecessary MacOs Version Guard patch

* rename patchesw

* remove again macos13 patch

* rename files
```
  d4e0da08
- openai: fix tool call ID mapping (#12988) · 565b802a
  Jeffrey Morgan authored Nov 06, 2025
  
  565b802a
- readme: add security tools section and Ollama fortress to community integrations (#12981) · 6c79e6c0
  Saifeddine ALOUI authored Nov 07, 2025
  
  6c79e6c0
- server: fix duplicate 'is' typo in comment (#12985) · 780762f9
  breatn authored Nov 06, 2025
  
  780762f9
- api: add omitempty to required tool function parameter type (#12989) · 30fcc719
  Jeffrey Morgan authored Nov 06, 2025
  
  30fcc719
- address comment · 3501a4bd
  Eva Ho authored Nov 06, 2025
  
  3501a4bd
- Merge pull request #12973 from macarronesc/main · 73a0cafc
  Eva H authored Nov 06, 2025
```
feat: add support for WebP images in Ollama's app
```
  73a0cafc
- address comments · e309c804
  Eva Ho authored Nov 06, 2025
  
  e309c804
- ggml update to b6840 (#12791) · 544b6739
  Daniel Hiltgen authored Nov 06, 2025
  
  544b6739
- refactor: remove GIF support from image validation tests and logging · a4a53692
  Daniel Alejandro Coll Tejeda authored Nov 06, 2025
  
  a4a53692
- readme: remove 404 link (#11351) · c4ba257c
  7394112478 authored Nov 06, 2025
  
  c4ba257c
- readme: add hle-eval-ollama to list of terminal community integrations (#11371) · 342e58ce
  mags0ft authored Nov 06, 2025
  
  342e58ce
- readme: add lollms and lollms WebUI to community integrations (#11981) · 47b2585c
  Saifeddine ALOUI authored Nov 06, 2025
  
  47b2585c
- app: fix macOS file picker to support Uniform Type Identifiers (#12965) · 4111db01
  Vincent Koc authored Nov 05, 2025
  
  4111db01
- address comment · 536c987c
  Eva Ho authored Nov 05, 2025
  
  536c987c
05 Nov, 2025 13 commits
- fixing thinking not scrolling issue · a534d4e9
  Eva Ho authored Nov 05, 2025
  
  a534d4e9
- address comments · 74586aa9
  Eva Ho authored Nov 05, 2025
  
  74586aa9
- ui: using streamdown AI elements for markdown rendering · 8c74f5dd
  Eva Ho authored Nov 04, 2025
  
  8c74f5dd
- ci: re-enable signing (#12974) · 80d34260
  Daniel Hiltgen authored Nov 05, 2025
  
  80d34260
- feat: add support for WebP images in Ollama's app · bddfa210
  Daniel Alejandro Coll Tejeda authored Nov 05, 2025
  
  bddfa210
- embeddings: added embedding command for cl (#12795) · 1ca608bc
  nicole pardal authored Nov 05, 2025
```
Co-authored-by: A-Akhil <akhilrahul70@gmail.com>

This PR introduces a new ollama embed command that allows users to generate embeddings directly from the command line.

Added ollama embed MODEL [TEXT...] command for generating text embeddings
Supports both direct text arguments and stdin piping for scripted workflows

Outputs embeddings as JSON arrays (one per line)
```
  1ca608bc
- mac: fix stale VRAM data (#12972) · 6aa72830
  Daniel Hiltgen authored Nov 05, 2025
```
The scheduler updates free VRAM based on current loaded models.  This was
mutating the persisted list of GPUs, and when coupled with the non-refreshing
logic for Metal that lead to stale low VRAM reporting after unload.  The fix is
to make sure the GPU discovery always returns a copy so the schedulers GPU list
is in fact ephemeral and doesn't leak any temporary adjustments back into the
persistent list.
```
  6aa72830
- bugfix: show connection string for interactive cli usage (#12930) · f89fc1ca
  Patrick Devine authored Nov 05, 2025
  
  f89fc1ca
- win: revert CPU discovery logic to 0.12.3 (#12969) · 97e05d2a
  Daniel Hiltgen authored Nov 05, 2025
```
The behavior change in 0.12.4 is the most likely the root cause of hangs some
users are seeing.  This reverts to the 0.12.3 code, with some added trace
logging.
```
  97e05d2a
- readme: Add handy-ollama to community integrations (#8601) · 8bbc7395
  Youdon authored Nov 06, 2025
  
  8bbc7395
- log: trace logging for scheduler (#12961) · 408c2f99
  Daniel Hiltgen authored Nov 05, 2025
  
  408c2f99
- Add Tool Call ID (#12956) · 809b9c68
  Grace authored Nov 04, 2025
```
* routes/types: add tool call id

---------
Co-authored-by: ParthSareen <parth.sareen@ollama.com>
```
  809b9c68
- log: instrument CPU discovery timing (#12960) · ba8c0358
  Daniel Hiltgen authored Nov 04, 2025
  
  ba8c0358
04 Nov, 2025 5 commits

discovery: only retry AMD GPUs (#12894) · 27f1fde4

Daniel Hiltgen authored Nov 04, 2025

* discovery: only retry AMD GPUs

CUDA and Vulkan don't crash on unsupported devices, so retry isn't necessary.
This also refactors the code to shift the Library specific logic into the ml
package.

* review comments

27f1fde4

vulkan: Add memory detection for Intel GPU using DXGI+PDH (#12664) · 220e133f

virajwad authored Nov 04, 2025

* PDH free memory skeleton

* Add PDH printing

* Add LUID support for Vulkan

* wire luid from ggml-vulkan to mem-dxgi-pdh file

* Fix to ggml-impl

* Continue skeleton

* Implemented ggml_dxgi_pdh_get_device_memory

* fix comments

* Fix - change value GB to bytes

* add ifdefs to only support windows and not linux

* modify error codes

* Finished ggml_dxgi_pdh_init() function

* completed ggml_dxgi_pdh_release()

* Formatting changes, add static to functions

* fix build errors

* fix go build error

* fix luid - now should match between dxgi and vulkan

* Fix the free memory reporting (was using copy by value, change to reference)

* keep only dxgi1_2.h

* Modifications based on PR feedback

* fix merge conflicts (2) and fix desc1.description printout

* move dxgi + pdh api calls to before the vendor specific library calls

* change from 3 samples to 1 sample for PDH

* modify when old_mode is set

* add fix for building MacOS

* fix release and returns for other vendors

* add patch file

220e133f

app: add code for macOS and Windows apps under 'app' (#12933) · d3b4b997

Daniel Hiltgen authored Nov 04, 2025



* app: add code for macOS and Windows apps under 'app'

* app: add readme

* app: windows and linux only for now

* ci: fix ui CI validation

---------
Co-authored-by: jmorganca <jmorganca@gmail.com>

d3b4b997

vulkan: enable flash attention (#12937) · a4770107

Daniel Hiltgen authored Nov 04, 2025

Also adjusts the vulkan windows build pattern to match recent changes in other backends
so incremental builds are faster.

a4770107

ggml: Increase maximum graph size · ef549d51

Jesse Gross authored Oct 30, 2025

The initial implementation of qwen3-vl:235b exceeded the maximum graph
size based on the number of tensors. Although this was later fixed
through the use of the mrope operation, we are close to the limit in
some cases. This updates to track the current llama.cpp usage of GGML.

ef549d51