Commits · 68e04c7ff88dca128016f75dc5bbd2f794bd2028 · OpenDAS / ollama

17 Oct, 2025 1 commit

test: harden scheduler tests (#12662) · 68e04c7f

Daniel Hiltgen authored Oct 17, 2025

* test: harden scheduler tests

This removes reschedDelay which was stale code, and adds
a new configurable timeout for the waitForVRAMRecovery so
tests can now set the timeout to be very short to avoid the
scheduler getting stuck and hitting a test timeout.

* test: tune tests for partial loads

Give stress tests more time when the model is split between CPU/GPU

68e04c7f

08 Oct, 2025 1 commit
- Integration test tuning (#12492) · 4e5d862e
  Daniel Hiltgen authored Oct 08, 2025
```
Remove some flaky scenarios, and switch to chat for better reliability
```
  4e5d862e
22 Sep, 2025 1 commit

tests: add single threaded history test (#12295) · c23e6f4c

Daniel Hiltgen authored Sep 22, 2025

* tests: add single threaded history test

Also tidies up some existing tests to handle more model output variation

* test: add support for testing specific architectures

c23e6f4c

05 Jul, 2025 1 commit

int: add performance integration tests (#11173) · 4f473e22

Daniel Hiltgen authored Jul 05, 2025

4f473e22

19 Jun, 2025 1 commit
- int: add coverage for older models (#11137) · f2527b08
  Daniel Hiltgen authored Jun 19, 2025
```
Verified these fail on 0.9.1 and pass on HEAD.
```
  f2527b08
06 May, 2025 1 commit

Move quantization to new backend (#10363) · 42481045

Daniel Hiltgen authored May 06, 2025

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

42481045

16 Apr, 2025 1 commit

Integration test improvements (#9654) · ed4e1393

Daniel Hiltgen authored Apr 16, 2025

Add some new test coverage for various model architectures,
and switch from orca-mini to the small llama model.

ed4e1393