Commits · a6fbfc880c3de9b57e341db374907e2fedda9fa6 · OpenDAS / ollama

"official/projects/triviaqa/dataset.py" did not exist on "d7e9ece3b84b855896df7023f6881269ae417b30"

16 Jun, 2025 1 commit
- gguf: fix write order (#11068) · a6fbfc88
  Michael Yang authored Jun 16, 2025
```
* ggml: test write gguf order
* ggml: fix write tensor order
```
  a6fbfc88
19 May, 2025 1 commit

ggml: Seperate tensor load from backend creation · 94ab428e

Jesse Gross authored Apr 17, 2025

Currently, when the backend is created, the tensors are loaded at the
same time, which is a slow operation. This separates them to be two
steps:
 - Create backend, including enumerating tensors and memory allocation
 - Loading tensor data

This allows more flexibility in managing model loading.

94ab428e

06 May, 2025 1 commit

Move quantization to new backend (#10363) · 42481045

Daniel Hiltgen authored May 06, 2025

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

42481045

01 May, 2025 1 commit

fix: write gguf padding (#10510) · a7835c67

Michael Yang authored Apr 30, 2025

* add gguf_test

* fix padding

padding was being added to offset but not to the running count

a7835c67