- 27 Aug, 2024 1 commit
-
-
Patrick Devine authored
-
- 23 Aug, 2024 1 commit
-
-
Michael Yang authored
-
- 19 Aug, 2024 2 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
- 13 Aug, 2024 2 commits
-
-
Bruce MacDonald authored
-
Pamela Fox authored
-
- 12 Aug, 2024 1 commit
-
-
Michael Yang authored
-
- 07 Aug, 2024 2 commits
-
-
royjhan authored
-
Kyle Kelley authored
* Use llama3.1 in tools example * Update api.md
-
- 05 Aug, 2024 2 commits
-
-
frob authored
Users using `journalctl` to get logs for issue logging sometimes don't realize that paging is causing information to be missed.
-
Michael Yang authored
-
- 02 Aug, 2024 2 commits
- 01 Aug, 2024 4 commits
-
-
Kim Hallberg authored
-
royjhan authored
* docs without usage * no usage * rm metric note
-
royjhan authored
-
royjhan authored
* OpenAI Docs * Update docs/openai.md Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Remove newline --------- Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
- 29 Jul, 2024 2 commits
-
-
Veit Heller authored
-
Jeffrey Morgan authored
-
- 27 Jul, 2024 1 commit
-
-
Tibor Schmidt authored
-
- 26 Jul, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 25 Jul, 2024 3 commits
-
-
Michael Yang authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Morgan authored
-
royjhan authored
-
- 24 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 23 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 22 Jul, 2024 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
royjhan authored
-
- 20 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.
-
- 17 Jul, 2024 1 commit
-
-
royjhan authored
* add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test
-
- 10 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.
-
- 05 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 04 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
We haven't found a workaround, so for now recommend updating.
-
- 03 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
Refine the way we log GPU discovery to improve the non-debug output, and report more actionable log messages when possible to help users troubleshoot on their own.
-
- 02 Jul, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
* OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By:
Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by:
Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorgan...
-
- 01 Jul, 2024 1 commit
-
-
Eduard authored
Runs fine on a NVIDIA GeForce GTX 1050 Ti
-
- 29 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-