- 28 May, 2024 4 commits
-
-
Tai authored
* Add OllamaSpring Project to Readme * Update README.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Orfeo Ciano authored
* Adds olpaka flutter client * Update README.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Lei Jitang authored
Signed-off-by:Lei Jitang <leijitang@outlook.com>
-
Rayan Mostovoi authored
small fix on examples/python-simplechat/client.py to actually get a streamed response and get tokens printed as we receive it (#4671)
-
- 26 May, 2024 2 commits
-
-
Jeffrey Morgan authored
Ensure `nvidia` and `nvidia_uvm` kernel modules are loaded in `install.sh` script and at startup (#4652) * ensure kernel modules are loaded in `install.sh` script and at startup * indentation * use `SUDO` variable * restart if nouveau is detected * consistent success message for AMD
-
Jeffrey Morgan authored
-
- 25 May, 2024 3 commits
-
-
Daniel Hiltgen authored
Report better warning on client closed abort of load
-
Daniel Hiltgen authored
If the client closes the connection before we finish loading the model we abort, so lets make the log message clearer why to help users understand this failure mode
-
Michael Yang authored
Fix download retry issue
-
- 24 May, 2024 6 commits
-
-
Michael Yang authored
fix q5_0, q5_1
-
Michael Yang authored
Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
Michael Yang authored
-
Patrick Devine authored
-
Tim Scheuermann authored
-
Jeffrey Morgan authored
-
- 23 May, 2024 7 commits
-
-
Daniel Hiltgen authored
Tidy up developer guide a little
-
Daniel Hiltgen authored
-
Michael Yang authored
-
Daniel Hiltgen authored
Wire up load progress
-
Daniel Hiltgen authored
This doesn't expose a UX yet, but wires the initial server portion of progress reporting during load
-
Bruce MacDonald authored
Co-authored-by:ManniX-ITA <20623405+mann1x@users.noreply.github.com>
-
Jeffrey Morgan authored
* put flash attention behind flag for now * add test * remove print * up timeout for sheduler tests
-
- 22 May, 2024 3 commits
-
-
Michael authored
-
Ikko Eltociear Ashimine authored
PreTokenziers -> PreTokenizers
-
Josh authored
add Ctrl + W shortcut
-
- 21 May, 2024 8 commits
-
-
Josh Yan authored
-
Patrick Devine authored
-
Michael Yang authored
simplify safetensors reading
-
Michael Yang authored
Convert directly from llama3
-
Sang Park authored
The spelling of the term "request" has been corrected, which was previously mistakenly written as "requeset" in the error log message.
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 20 May, 2024 7 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Patrick Devine authored
-
Patrick Devine authored
-
Patrick Devine authored
-
Patrick Devine authored
-
Patrick Devine authored
-