Commits · 603ceefaa67feee627e01cae1df1e0642e1c868f · OpenDAS / ollama

08 Dec, 2025 1 commit

Michael Yang authored Nov 18, 2025

change to a flatter directory structure and group the options with the
function

update models to call rope in one place

603ceefa

29 Oct, 2025 1 commit
- feat(model): add qwen3vl (#12665) · 7d25b9e1
  Michael Yang authored Oct 28, 2025
  
  7d25b9e1
28 Oct, 2025 1 commit
- s/From*Slice/From*s/ (#12255) · 1188f408
  Michael Yang authored Oct 28, 2025
  
  1188f408
13 Oct, 2025 1 commit
- fix(qwen3): deepseek distill · 6c833d5f
  Michael Yang authored Oct 13, 2025
```
deepseek's qwen3 distill uses a different rope scheme so support both
```
  6c833d5f
23 Sep, 2025 1 commit
- multi-regexp pretokenizer (#12325) · a40d427b
  Michael Yang authored Sep 23, 2025
  
  a40d427b
18 Sep, 2025 1 commit
- feat: qwen3 embed (#12301) · 7460259e
  Michael Yang authored Sep 18, 2025
```
* cleanup

* use pooling.TypeNone

* pooling test

* qwen3 embed
```
  7460259e
17 Sep, 2025 1 commit
- fix(llama): other llama flavours (#12308) · 564b558c
  Michael Yang authored Sep 17, 2025
```
* fix(llama): rope scale

* spm llama

* skip moe models

* cleanup
```
  564b558c
16 Sep, 2025 1 commit
- use split activations when possible (#12293) · ad95d5b3
  Michael Yang authored Sep 16, 2025
```
* use ggml_*_split activations when possible

* forward qkv
```
  ad95d5b3
15 Sep, 2025 1 commit
- batch: use tensors for outputs (#12185) · 6f711714
  Michael Yang authored Sep 15, 2025
```
this cleans up the model interface slightly without too much impact in
other areas
```
  6f711714
11 Jun, 2025 1 commit

use nn.Linear in place of ml.Tensor (#11049) · 2e77aa1a

Michael Yang authored Jun 11, 2025

while nn.Linear.Forward isn't applicable for sparse MLP, it's still
a nice container for the tensors

2e77aa1a

22 May, 2025 1 commit

ml: Panic rather than return error on tensor allocation failure · 1f371ea9

Jesse Gross authored May 19, 2025

FromFloatSlice and FromIntSlice return an error if the shape doesn't
match the passed data or if memory can't be allocated. Since these
are inputs, the memory being allocated is system memory rather than VRAM.

In many cases, the caller can't really handle the error and panics.

Empty and Zeros directly panic if they can't allocate memory.

This makes things consistent by panicing for the first two cases,
removing a fair amount of error handling code. This is also consistent
with how Go typically handles these situations.

1f371ea9

21 May, 2025 1 commit
- feat: qwen3 dense and sparse models (#10708) · e0ed984c
  Michael Yang authored May 21, 2025
```
* feat: qwen3 dense
* feat: qwen3moe
* fix llama4 moe
```
  e0ed984c