- 06 May, 2024 4 commits
-
-
Saif authored
-
Jeffrey Morgan authored
* fix llava models not working after first request * individual requests only for llava models
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Centralize server config handling
-
- 05 May, 2024 9 commits
-
-
Daniel Hiltgen authored
This moves all the env var reading into one central module and logs the loaded config once at startup which should help in troubleshooting user server logs
-
alwqx authored
-
Lord Basil - Automate EVERYTHING authored
-
Jeffrey Morgan authored
-
Bernardo de Oliveira Bruning authored
--------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Patrick Devine authored
-
Daniel Hiltgen authored
Make maximum pending request configurable
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
This also bumps up the default to be 50 queued requests instead of 10.
-
- 04 May, 2024 4 commits
-
-
Daniel Hiltgen authored
Explain the 2 different windows download options
-
Daniel Hiltgen authored
-
Michael Yang authored
omit prompt and generate settings from final response
-
Michael Yang authored
-
- 03 May, 2024 7 commits
-
-
Daniel Hiltgen authored
Fix lint warnings
-
Daniel Hiltgen authored
-
Michael Yang authored
rename parser to model/file
-
Dr Nic Williams authored
* Update 'llama2' -> 'llama3' in most places --------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Daniel Hiltgen authored
Soften timeouts on sched unit tests
-
Daniel Hiltgen authored
This gives us more headroom on the scheduler tests to tamp down some flakes.
-
Michael Yang authored
refactor modelfile parser
-
- 02 May, 2024 3 commits
-
-
Michael Yang authored
fix line ending
-
Michael Yang authored
replace CRLF with LF
-
Jeffrey Morgan authored
-
- 01 May, 2024 13 commits
-
-
Bryce Reitano authored
* Add a /clear command * change help messages --------- Co-authored-by:Patrick Devine <patrick@infrahq.com>
-
Michael Yang authored
server: destination invalid
-
Michael Yang authored
types/model: fix name for hostport
-
Michael Yang authored
-
alwqx authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Daniel Hiltgen authored
Fix/issue 3736: When runners are closing or expiring. Scheduler is getting dirty VRAM size readings.
-
Jeffrey Morgan authored
-
Mark Ward authored
-
Mark Ward authored
-
Mark Ward authored
log when the waiting for the process to stop to help debug when other tasks execute during this wait. expire timer clear the timer reference because it will not be reused. close will clean up expireTimer if calling code has not already done this.
-