- 05 Aug, 2024 2 commits
-
-
frob authored
Users using `journalctl` to get logs for issue logging sometimes don't realize that paging is causing information to be missed.
-
Michael Yang authored
-
- 02 Aug, 2024 2 commits
- 01 Aug, 2024 4 commits
-
-
Kim Hallberg authored
-
royjhan authored
* docs without usage * no usage * rm metric note
-
royjhan authored
-
royjhan authored
* OpenAI Docs * Update docs/openai.md Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Remove newline --------- Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
- 29 Jul, 2024 2 commits
-
-
Veit Heller authored
-
Jeffrey Morgan authored
-
- 27 Jul, 2024 1 commit
-
-
Tibor Schmidt authored
-
- 26 Jul, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 25 Jul, 2024 3 commits
-
-
Michael Yang authored
Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Morgan authored
-
royjhan authored
-
- 24 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 23 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
-
- 22 Jul, 2024 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
royjhan authored
-
- 20 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.
-
- 17 Jul, 2024 1 commit
-
-
royjhan authored
* add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test
-
- 10 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.
-
- 05 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 04 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
We haven't found a workaround, so for now recommend updating.
-
- 03 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
Refine the way we log GPU discovery to improve the non-debug output, and report more actionable log messages when possible to help users troubleshoot on their own.
-
- 02 Jul, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
* OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By:
Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by:
Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorgan...
-
- 01 Jul, 2024 1 commit
-
-
Eduard authored
Runs fine on a NVIDIA GeForce GTX 1050 Ti
-
- 29 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 28 Jun, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
-
- 19 Jun, 2024 2 commits
-
-
royjhan authored
* API Show Extended * Initial Draft of Information Co-Authored-By:
Patrick Devine <pdevine@sonic.net> * Clean Up * Descriptive arg error messages and other fixes * Second Draft of Show with Projectors Included * Remove Chat Template * Touches * Prevent wrapping from files * Verbose functionality * Docs * Address Feedback * Lint * Resolve Conflicts * Function Name * Tests for api/show model info * Show Test File * Add Projector Test * Clean routes * Projector Check * Move Show Test * Touches * Doc update --------- Co-authored-by:
Patrick Devine <pdevine@sonic.net>
-
Daniel Hiltgen authored
-
- 17 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 16 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
* docs: add missing instruction for powershell build The powershell script for building Ollama on Windows now requires the `ThreadJob` module. Add this to the instructions and dependency list. * Update development.md
-
- 14 Jun, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Patrick Devine authored
-
- 11 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-