- 06 May, 2024 15 commits
-
-
Daniel Hiltgen authored
Support Fedoras standard ROCm location
-
Daniel Hiltgen authored
Fix stale test logic
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
The model processing was recently changed to be deferred but this test scenario hadn't been adjusted for that change in behavior.
-
Adrien Brault authored
-
Nurgo authored
-
Daniel Hiltgen authored
Skip PhysX cudart library
-
Daniel Hiltgen authored
Add CUDA Driver API for GPU discovery
-
Tony Loehr authored
Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
Hyden Liu authored
-
Saif authored
-
Jeffrey Morgan authored
* fix llava models not working after first request * individual requests only for llava models
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Centralize server config handling
-
- 05 May, 2024 9 commits
-
-
Daniel Hiltgen authored
This moves all the env var reading into one central module and logs the loaded config once at startup which should help in troubleshooting user server logs
-
alwqx authored
-
Lord Basil - Automate EVERYTHING authored
-
Jeffrey Morgan authored
-
Bernardo de Oliveira Bruning authored
--------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Patrick Devine authored
-
Daniel Hiltgen authored
Make maximum pending request configurable
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
This also bumps up the default to be 50 queued requests instead of 10.
-
- 04 May, 2024 4 commits
-
-
Daniel Hiltgen authored
Explain the 2 different windows download options
-
Daniel Hiltgen authored
-
Michael Yang authored
omit prompt and generate settings from final response
-
Michael Yang authored
-
- 03 May, 2024 8 commits
-
-
Daniel Hiltgen authored
Fix lint warnings
-
Daniel Hiltgen authored
-
Michael Yang authored
rename parser to model/file
-
Dr Nic Williams authored
* Update 'llama2' -> 'llama3' in most places --------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Daniel Hiltgen authored
For some reason this library gives incorrect GPU information, so skip it
-
Daniel Hiltgen authored
Soften timeouts on sched unit tests
-
Daniel Hiltgen authored
This gives us more headroom on the scheduler tests to tamp down some flakes.
-
Michael Yang authored
refactor modelfile parser
-
- 02 May, 2024 3 commits
-
-
Michael Yang authored
fix line ending
-
Michael Yang authored
replace CRLF with LF
-
Jeffrey Morgan authored
-
- 01 May, 2024 1 commit
-
-
Daniel Hiltgen authored
-