- 06 May, 2024 7 commits
-
-
Daniel Hiltgen authored
Add CUDA Driver API for GPU discovery
-
Tony Loehr authored
Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
Hyden Liu authored
-
Saif authored
-
Jeffrey Morgan authored
* fix llava models not working after first request * individual requests only for llava models
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Centralize server config handling
-
- 05 May, 2024 9 commits
-
-
Daniel Hiltgen authored
This moves all the env var reading into one central module and logs the loaded config once at startup which should help in troubleshooting user server logs
-
alwqx authored
-
Lord Basil - Automate EVERYTHING authored
-
Jeffrey Morgan authored
-
Bernardo de Oliveira Bruning authored
--------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Patrick Devine authored
-
Daniel Hiltgen authored
Make maximum pending request configurable
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
This also bumps up the default to be 50 queued requests instead of 10.
-
- 04 May, 2024 4 commits
-
-
Daniel Hiltgen authored
Explain the 2 different windows download options
-
Daniel Hiltgen authored
-
Michael Yang authored
omit prompt and generate settings from final response
-
Michael Yang authored
-
- 03 May, 2024 7 commits
-
-
Daniel Hiltgen authored
Fix lint warnings
-
Daniel Hiltgen authored
-
Michael Yang authored
rename parser to model/file
-
Dr Nic Williams authored
* Update 'llama2' -> 'llama3' in most places --------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Daniel Hiltgen authored
Soften timeouts on sched unit tests
-
Daniel Hiltgen authored
This gives us more headroom on the scheduler tests to tamp down some flakes.
-
Michael Yang authored
refactor modelfile parser
-
- 02 May, 2024 3 commits
-
-
Michael Yang authored
fix line ending
-
Michael Yang authored
replace CRLF with LF
-
Jeffrey Morgan authored
-
- 01 May, 2024 10 commits
-
-
Bryce Reitano authored
* Add a /clear command * change help messages --------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Michael Yang authored
server: destination invalid
-
Michael Yang authored
types/model: fix name for hostport
-
Michael Yang authored
-
alwqx authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Daniel Hiltgen authored
Fix/issue 3736: When runners are closing or expiring. Scheduler is getting dirty VRAM size readings.
-
Jeffrey Morgan authored
-