Commits · 1c6669e64cc8a482fbf1e35c0249f17b35a4e87a · OpenDAS / ollama

23 Jun, 2025 2 commits

Daniel Hiltgen authored Jun 23, 2025

* Re-remove cuda v11

Revert the revert - drop v11 support requiring drivers newer than Feb 23

This reverts commit c6bcdc42.

* Simplify layout

With only one version of the GPU libraries, we can simplify things down somewhat.  (Jetsons still require special handling)

* distinct sbsa variant for linux arm64

This avoids accidentally trying to load the sbsa cuda libraries on
a jetson system which results in crashes.

* temporary prevent rocm+cuda mixed loading

1c6669e6

readme: add ai-hub to community integrations (#11169) · 2bb69b40
AJ authored Jun 23, 2025

2bb69b40

20 Jun, 2025 4 commits

build speedups (#11142) · 65bff664
Daniel Hiltgen authored Jun 20, 2025
```
Enable parallel building of the GPU architectures.
```
65bff664
convert: utility for merging tensors (#11069) · c088ac0e
Michael Yang authored Jun 20, 2025

c088ac0e
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) · 0a066cfd
Michael Yang authored Jun 20, 2025
```
* Reapply "feat: incremental gguf parser (#10822)" (#11114)

This reverts commit a6e64fbd.

* fix older ggufs
```
0a066cfd

ggml: Check return status for computation. · 87b7af6c

Jesse Gross authored Jun 19, 2025

We don't check the return status after computing the graph, which
can silently lead to bad outputs if we try to keep going and future
computation succeeds. This appears to happens in certain cases on
Apple M2 devices.

Fixes #11070

87b7af6c

19 Jun, 2025 1 commit
- int: add coverage for older models (#11137) · f2527b08
  Daniel Hiltgen authored Jun 19, 2025
```
Verified these fail on 0.9.1 and pass on HEAD.
```
  f2527b08
18 Jun, 2025 6 commits
- benchmark: remove unused benchmark test (#11120) · 8bcb3125
  Jeffrey Morgan authored Jun 18, 2025
```
Removes a test under benchmark/ that is unused
```
  8bcb3125
- Revert "Revert "ggml: Export GPU UUIDs" (#11115)" (#11117) · 6baf1e31
  Jeffrey Morgan authored Jun 18, 2025
```
Reverts PR #11115. The original change was mistakingly reverted instead of #10822
```
  6baf1e31
- Revert "ggml: Export GPU UUIDs" (#11115) · ed567ef4
  Jeffrey Morgan authored Jun 18, 2025
```
This reverts commit aaa78180.
```
  ed567ef4
- Revert "feat: incremental gguf parser (#10822)" (#11114) · a6e64fbd
  Jeffrey Morgan authored Jun 18, 2025
```
This reverts commit 6b04cad7.
```
  a6e64fbd
- cache: fix comment function name in cache.go (#11110) · 60cfa2a2
  曹家巧 authored Jun 18, 2025
  
  60cfa2a2
- tools: return empty arguments object instead of null (#11113) · 55bbf3b4
  Jeffrey Morgan authored Jun 18, 2025
  
  55bbf3b4
17 Jun, 2025 1 commit

tools: fix parsing tool calls without any parameters (#11101) · 6bda1d24

Jeffrey Morgan authored Jun 17, 2025

Fixes issue where tool calls that don't expect any parameters were
not being parsed. This also fixes two additional issues: one where
2+ tool calls would not be correctly parsed, and cases where tool calls
with invalid parameters would still get parsed

6bda1d24

16 Jun, 2025 3 commits
- model: treat 'user defined' tokens as special tokens (#11077) · 9e125d88
  Jeffrey Morgan authored Jun 16, 2025
  
  9e125d88
- gguf: fix write order (#11068) · a6fbfc88
  Michael Yang authored Jun 16, 2025
```
* ggml: test write gguf order
* ggml: fix write tensor order
```
  a6fbfc88
- readme: add ollama-launcher to community integrations (#11080) · 50202896
  NGC13009 authored Jun 16, 2025
  
  50202896
14 Jun, 2025 1 commit
- readme: add GPTranslate to community integrations (#11071) · 5a8eb0e1
  Phil authored Jun 14, 2025
  
  5a8eb0e1
12 Jun, 2025 2 commits

tools: loosen tool parsing to allow for more formats (#11030) · 9f8a18ec
Jeffrey Morgan authored Jun 12, 2025

9f8a18ec

feat: incremental gguf parser (#10822) · 6b04cad7

Michael Yang authored Jun 12, 2025



* incremental gguf parser
* gguf: update test to not rely on gguf on disc
* re-use existing create gguf
* read capabilities from gguf kv
* kv exists
* update tests
* s/doneFunc/successFunc/g
* new buffered reader

---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>

6b04cad7

11 Jun, 2025 3 commits

feat: uneven splits (#11048) · 45f56355

Michael Yang authored Jun 11, 2025

The current splitDim function only operates on tensors that are split evenly which isn't always the case, e.g. a QKV tensor. This change allows the function to be used for arbitrary splits

45f56355

skip tokenizer.model if possible (#11050) · 0dabb4ef
Michael Yang authored Jun 11, 2025
```
if tokenizer.json is already copied, skip tokenizer.model
```
0dabb4ef

use nn.Linear in place of ml.Tensor (#11049) · 2e77aa1a

Michael Yang authored Jun 11, 2025

while nn.Linear.Forward isn't applicable for sparse MLP, it's still
a nice container for the tensors

2e77aa1a

10 Jun, 2025 3 commits
- readme: add ollama-multirun to community integrations (#11038) · deaabe29
  Attogram Project authored Jun 10, 2025
  
  deaabe29
- readme: update quickstart link text to Gemma 3 · af21a5ac
  Jeffrey Morgan authored Jun 10, 2025
  
  af21a5ac
- readme: update quickstart example to Gemma 3 · f63d7f68
  Jeffrey Morgan authored Jun 10, 2025
  
  f63d7f68
09 Jun, 2025 1 commit

mac: handle "keep" named apps (#11031) · 82ad1dbc

Daniel Hiltgen authored Jun 09, 2025

When a user elects to keep the existing app, the
new Ollama is named `Ollama 2.app`
This fixes the app startup flow to handle this naming pattern.

82ad1dbc

08 Jun, 2025 1 commit
- spawn desktop quickly (#11011) · feeabdad
  Daniel Hiltgen authored Jun 08, 2025
```
Give the desktop app a hint to start fast.
```
  feeabdad
07 Jun, 2025 2 commits
- docs: update link to AMD drivers in linux.md (#10973) · fc030961
  Krzysztof Jeziorny authored Jun 07, 2025
  
  fc030961
- Revert "server: add model capabilities to the list endpoint (#10174)" (#11004) · 09d308d6
  Jeffrey Morgan authored Jun 06, 2025
```
This reverts commit 09430011.
```
  09d308d6
06 Jun, 2025 4 commits
- launch app hidden (#10962) · a8ed68bd
  Daniel Hiltgen authored Jun 06, 2025
```
When starting the app in the background, start it hidden.
```
  a8ed68bd
- win: handle more than 2048 processes (#10997) · 2ae65ae4
  Daniel Hiltgen authored Jun 06, 2025
```
Fix an array out of bounds crash
```
  2ae65ae4
- move thinking logic into its own package (#10990) · a3b6886b
  Devon Rifkin authored Jun 06, 2025
```
move thinking logic into its own package
```
  a3b6886b
- docs: fix typo in development.md (#10998) · c6a6d729
  Hunter Wittenborn authored Jun 06, 2025
  
  c6a6d729
05 Jun, 2025 2 commits
- Merge pull request #10987 from ollama/drifkin/export-thinking-parser · 2cf007c9
  Devon Rifkin authored Jun 05, 2025
```
export ThinkingParser
```
  2cf007c9
- export ThinkingParser · 0683efa6
  Devon Rifkin authored Jun 05, 2025
  
  0683efa6
04 Jun, 2025 1 commit
- server: add model capabilities to the list endpoint (#10174) · 09430011
  JasonHonKL authored Jun 05, 2025
  
  09430011
31 May, 2025 1 commit
- readme: add SimpleOllamaUnity to community integrations (#10817) · 5c42800f
  HardCodeDev authored May 31, 2025
  
  5c42800f
30 May, 2025 1 commit
- tools: resiliency upgrade to name and arg extraction from template (#10917) · 65f10c28
  Parth Sareen authored May 30, 2025
  
  65f10c28
29 May, 2025 1 commit

ggml: Export GPU UUIDs · aaa78180

Jesse Gross authored Apr 24, 2025

This enables matching up devices and information reported by the backend
with system management libraries such as nvml to get accurate free
memory reporting.

aaa78180