ollamarunner: Provide mechanism for backends to report loading progress

This enables the runner to report progress back to the Ollama server, both for showing status to the user and also to prevent the server from killing the runner if it thinks things have stalled. Most of the infrastructure was already there, this extends it to be available to the backends.

ollamarunner: Provide mechanism for backends to report loading progress
This enables the runner to report progress back to the Ollama server, both for showing status to the user and also to prevent the server from killing the runner if it thinks things have stalled. Most of the infrastructure was already there, this extends it to be available to the backends.
0ff28758 · Jesse Gross · Jesse Gross · d3e9ca3e · 0ff28758 · 0ff28758
Commit 0ff28758 authored Mar 20, 2025 by Jesse Gross Committed by Jesse Gross Mar 21, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 7 additions and 0 deletions

ml/backend.go ml/backend.go +4 -0

runner/ollamarunner/runner.go runner/ollamarunner/runner.go +3 -0

No files found.
--- a/ml/backend.go
+++ b/ml/backend.go
@@ -60,6 +60,10 @@ type CacheConfig struct {

 // BackendParams controls how the backend loads and executes models
 type BackendParams struct {
+	// Progress is a callback function that allows reporting percentage completion
+	// of model loading
+	Progress func(float32)
+
 	// NumThreads sets the number of threads to use if running on the CPU
 	NumThreads int


--- a/runner/ollamarunner/runner.go
+++ b/runner/ollamarunner/runner.go
@@ -783,6 +783,9 @@ func Execute(args []string) error {
 	}

 	params := ml.BackendParams{
+		Progress: func(progress float32) {
+			server.progress = progress
+		},
 		NumThreads:     *threads,
 		NumGPULayers:   *numGPULayers,
 		MainGPU:        *mainGPU,