Commits · 517807cdf29d2c8d22bc748a2cfde2b61bd67c98 · OpenDAS / ollama

29 Aug, 2025 1 commit

perf: build graph for next batch async to keep GPU busy (#11863) · 517807cd

Daniel Hiltgen authored Aug 29, 2025

* perf: build graph for next batch in parallel to keep GPU busy

This refactors the main run loop of the ollama runner to perform the main GPU
intensive tasks (Compute+Floats) in a go routine so we can prepare the next
batch in parallel to reduce the amount of time the GPU stalls waiting for the
next batch of work.

* tests: tune integration tests for ollama engine

This tunes the integration tests to focus more on models supported
by the new engine.

517807cd

16 Apr, 2025 1 commit

Integration test improvements (#9654) · ed4e1393

Daniel Hiltgen authored Apr 16, 2025

Add some new test coverage for various model architectures,
and switch from orca-mini to the small llama model.

ed4e1393

08 Apr, 2025 1 commit
- fix(integration): move waitgroup Add(1) outside goroutine to avoid potential issue (#10070) · e7019c94
  CYJiang authored Apr 09, 2025
```
Signed-off-by: googs1025 <googs1025@gmail.com>
```
  e7019c94
02 Apr, 2025 1 commit

chore(all): replace instances of interface with any (#10067) · 9876c9fa

Bruce MacDonald authored Apr 02, 2025

Both interface{} and any (which is just an alias for interface{} introduced in Go 1.18) represent the empty interface that all types satisfy.

9876c9fa

10 Dec, 2024 1 commit
- all: fix typos in documentation, code, and comments (#7021) · abfdc471
  Stefan Weil authored Dec 10, 2024
  
  abfdc471
22 Nov, 2024 1 commit
- tests: fix max queue integration test (#7782) · f0a35181
  Daniel Hiltgen authored Nov 22, 2024
```
This had fallen out of sync with the envconfig behavior, where max queue default was not zero.
```
  f0a35181
05 Aug, 2024 1 commit
- fix concurrency test · 7ed36741
  Michael Yang authored Aug 05, 2024
  
  7ed36741
22 Jul, 2024 1 commit
- int · 0f191012
  Michael Yang authored Jul 03, 2024
  
  0f191012
16 May, 2024 1 commit

Skip max queue test on remote · 7f2fbad7

Daniel Hiltgen authored May 16, 2024

This test needs to be able to adjust the queue size down from
our default setting for a reliable test, so it needs to skip on
remote test execution mode.

7f2fbad7

05 May, 2024 1 commit
- Add integration test to push max queue limits · 45d61aaa
  Daniel Hiltgen authored May 05, 2024
  
  45d61aaa