- 22 Jul, 2024 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 20 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.
-
- 17 Jul, 2024 1 commit
-
-
royjhan authored
* add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test
-
- 10 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.
-
- 05 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 04 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
We haven't found a workaround, so for now recommend updating.
-
- 03 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
Refine the way we log GPU discovery to improve the non-debug output, and report more actionable log messages when possible to help users troubleshoot on their own.
-
- 02 Jul, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
* OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By:
Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by:
Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorgan...
-
- 01 Jul, 2024 1 commit
-
-
Eduard authored
Runs fine on a NVIDIA GeForce GTX 1050 Ti
-
- 29 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 28 Jun, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
-
- 19 Jun, 2024 2 commits
-
-
royjhan authored
* API Show Extended * Initial Draft of Information Co-Authored-By:
Patrick Devine <pdevine@sonic.net> * Clean Up * Descriptive arg error messages and other fixes * Second Draft of Show with Projectors Included * Remove Chat Template * Touches * Prevent wrapping from files * Verbose functionality * Docs * Address Feedback * Lint * Resolve Conflicts * Function Name * Tests for api/show model info * Show Test File * Add Projector Test * Clean routes * Projector Check * Move Show Test * Touches * Doc update --------- Co-authored-by:
Patrick Devine <pdevine@sonic.net>
-
Daniel Hiltgen authored
-
- 17 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 16 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
* docs: add missing instruction for powershell build The powershell script for building Ollama on Windows now requires the `ThreadJob` module. Add this to the instructions and dependency list. * Update development.md
-
- 14 Jun, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Patrick Devine authored
-
- 11 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 09 Jun, 2024 2 commits
-
-
Napuh authored
* Added instructions to easily install specific versions on faq.md * Small typo * Moved instructions on how to install specific version to linux.md * Update docs/linux.md * Update docs/linux.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Morgan authored
-
- 07 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 05 Jun, 2024 1 commit
-
-
royjhan authored
* API PS Documentation
-
- 04 Jun, 2024 1 commit
-
-
Shubham authored
* add embed model command and fix question invoke * Update docs/tutorials/langchainpy.md Co-authored-by:
Kim Hallberg <hallberg.kim@gmail.com> * Update docs/tutorials/langchainpy.md --------- Co-authored-by:
Kim Hallberg <hallberg.kim@gmail.com> Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
- 23 May, 2024 2 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
- 21 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 20 May, 2024 1 commit
-
-
alwqx authored
-
- 14 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 13 May, 2024 1 commit
-
-
睡觉型学渣 authored
* Correct typos. * Correct typos.
-
- 09 May, 2024 3 commits
-
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
Carlos Gamez authored
Updated sample code as per warning notification from the package maintainers
-
- 07 May, 2024 1 commit
-
-
boessu authored
fixing pip code.
-
- 06 May, 2024 4 commits
-
-
CrispStrobe authored
* note on naming restrictions else push would fail with cryptic retrieving manifest Error: file does not exist ==> maybe change that in code too * Update docs/import.md --------- Co-authored-by:
C-4-5-3 <154636388+C-4-5-3@users.noreply.github.com> Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Chen authored
-
Mohamed A. Fouad authored
Add -e to viewing logs in order to show end of ollama logs
-
Darinka authored
* Update api.md Changed the calculation of tps (token/s) in the documentation * Update docs/api.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-