- 10 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.
-
- 05 Jul, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 04 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
We haven't found a workaround, so for now recommend updating.
-
- 03 Jul, 2024 1 commit
-
-
Daniel Hiltgen authored
Refine the way we log GPU discovery to improve the non-debug output, and report more actionable log messages when possible to help users troubleshoot on their own.
-
- 02 Jul, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
* OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By:
Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by:
Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
- 01 Jul, 2024 1 commit
-
-
Eduard authored
Runs fine on a NVIDIA GeForce GTX 1050 Ti
-
- 29 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 28 Jun, 2024 2 commits
-
-
Daniel Hiltgen authored
-
royjhan authored
-
- 19 Jun, 2024 2 commits
-
-
royjhan authored
* API Show Extended * Initial Draft of Information Co-Authored-By:
Patrick Devine <pdevine@sonic.net> * Clean Up * Descriptive arg error messages and other fixes * Second Draft of Show with Projectors Included * Remove Chat Template * Touches * Prevent wrapping from files * Verbose functionality * Docs * Address Feedback * Lint * Resolve Conflicts * Function Name * Tests for api/show model info * Show Test File * Add Projector Test * Clean routes * Projector Check * Move Show Test * Touches * Doc update --------- Co-authored-by:
Patrick Devine <pdevine@sonic.net>
-
Daniel Hiltgen authored
-
- 17 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 16 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
* docs: add missing instruction for powershell build The powershell script for building Ollama on Windows now requires the `ThreadJob` module. Add this to the instructions and dependency list. * Update development.md
-
- 14 Jun, 2024 2 commits
-
-
Jeffrey Morgan authored
-
Patrick Devine authored
-
- 11 Jun, 2024 1 commit
-
-
Jeffrey Morgan authored
-
- 09 Jun, 2024 2 commits
-
-
Napuh authored
* Added instructions to easily install specific versions on faq.md * Small typo * Moved instructions on how to install specific version to linux.md * Update docs/linux.md * Update docs/linux.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Morgan authored
-
- 07 Jun, 2024 1 commit
-
-
Michael Yang authored
-
- 05 Jun, 2024 1 commit
-
-
royjhan authored
* API PS Documentation
-
- 04 Jun, 2024 1 commit
-
-
Shubham authored
* add embed model command and fix question invoke * Update docs/tutorials/langchainpy.md Co-authored-by:
Kim Hallberg <hallberg.kim@gmail.com> * Update docs/tutorials/langchainpy.md --------- Co-authored-by:
Kim Hallberg <hallberg.kim@gmail.com> Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
- 23 May, 2024 2 commits
-
-
Daniel Hiltgen authored
-
Daniel Hiltgen authored
-
- 21 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 20 May, 2024 1 commit
-
-
alwqx authored
-
- 14 May, 2024 1 commit
-
-
Patrick Devine authored
-
- 13 May, 2024 1 commit
-
-
睡觉型学渣 authored
* Correct typos. * Correct typos.
-
- 09 May, 2024 3 commits
-
-
Daniel Hiltgen authored
-
Jeffrey Morgan authored
-
Carlos Gamez authored
Updated sample code as per warning notification from the package maintainers
-
- 07 May, 2024 1 commit
-
-
boessu authored
fixing pip code.
-
- 06 May, 2024 6 commits
-
-
CrispStrobe authored
* note on naming restrictions else push would fail with cryptic retrieving manifest Error: file does not exist ==> maybe change that in code too * Update docs/import.md --------- Co-authored-by:
C-4-5-3 <154636388+C-4-5-3@users.noreply.github.com> Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com>
-
Jeffrey Chen authored
-
Mohamed A. Fouad authored
Add -e to viewing logs in order to show end of ollama logs
-
Darinka authored
* Update api.md Changed the calculation of tps (token/s) in the documentation * Update docs/api.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Adrien Brault authored
-
Hyden Liu authored
-
- 05 May, 2024 1 commit
-
-
Daniel Hiltgen authored
This also bumps up the default to be 50 queued requests instead of 10.
-
- 04 May, 2024 1 commit
-
-
Daniel Hiltgen authored
-