"driver/include/tensor.hpp" did not exist on "05e046654c9a226444091806a418a77fe0e4a4c2"
- 01 Nov, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 14 Jun, 2024 1 commit
-
-
Daniel Hiltgen authored
adjust timing on some tests so they don't timeout on small/slow GPUs
-
- 23 Apr, 2024 1 commit
-
-
Daniel Hiltgen authored
This change adds support for multiple concurrent requests, as well as loading multiple models by spawning multiple runners. The default settings are currently set at 1 concurrent request per model and only 1 loaded model at a time, but these can be adjusted by setting OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.
-
- 26 Mar, 2024 1 commit
-
-
Patrick Devine authored
-
- 25 Mar, 2024 1 commit
-
-
Daniel Hiltgen authored
If images aren't present, pull them. Also fixes the expected responses
-
- 23 Mar, 2024 1 commit
-
-
Daniel Hiltgen authored
This uplevels the integration tests to run the server which can allow testing an existing server, or a remote server.
-
- 23 Dec, 2023 1 commit
-
-
Daniel Hiltgen authored
This should help CI avoid running the integration test logic in a container where it's not currently possible.
-
- 19 Dec, 2023 1 commit
-
-
Daniel Hiltgen authored
A simple test case that verifies llava:7b can read text in an image
-