ollamarunner: Pass runner performance parameters to backends
Currently the following parameters are in the runner but not used: - numGPULayers - mainGPU - threads - tensorSplit This passes them through to the backend, which is where they would actually get used. However, the GGML backend does not yet do anything with them.
Showing
Please register or sign in to comment