- 28 May, 2024 8 commits
-
-
Josh authored
* integrated runewidth for display management - fixed cursor movement for mutli-width char * updated input and deletion of multi-byte chars * fixed line history with some exceptions * improved insert and add * fixed issues with moving across lines * end of line extra space tracking' * saved changes * fixed end of line issues with empty spaces * worked some more * worked on end of line * fixed failed test * fixed minor inserting bug * fixed movement hotkeys * adjusted hotkeys * removed comments * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * Update readline/buffer.go Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com> * deleted comments and duplicate code * removed duplicate code * added comments, refactored add function to use addChar * added helper to retrieve lineSpacing, renamed lineFlags for clarity * fixed remove() --------- Co-authored-by:
Bruce MacDonald <brucewmacdonald@gmail.com>
-
Jeffrey Morgan authored
-
Daniel Hiltgen authored
Give the final model loading more time
-
Daniel Hiltgen authored
On some systems, 1 minute isn't sufficient to finish the load after it hits 100% This creates 2 distinct timers, although they're both set to the same value for now so we can refine the timeouts further.
-
Tai authored
* Add OllamaSpring Project to Readme * Update README.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Orfeo Ciano authored
* Adds olpaka flutter client * Update README.md --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
Lei Jitang authored
Signed-off-by:Lei Jitang <leijitang@outlook.com>
-
Rayan Mostovoi authored
small fix on examples/python-simplechat/client.py to actually get a streamed response and get tokens printed as we receive it (#4671)
-
- 26 May, 2024 2 commits
-
-
Jeffrey Morgan authored
Ensure `nvidia` and `nvidia_uvm` kernel modules are loaded in `install.sh` script and at startup (#4652) * ensure kernel modules are loaded in `install.sh` script and at startup * indentation * use `SUDO` variable * restart if nouveau is detected * consistent success message for AMD
-
Jeffrey Morgan authored
-
- 25 May, 2024 3 commits
-
-
Daniel Hiltgen authored
Report better warning on client closed abort of load
-
Daniel Hiltgen authored
If the client closes the connection before we finish loading the model we abort, so lets make the log message clearer why to help users understand this failure mode
-
Michael Yang authored
Fix download retry issue
-
- 24 May, 2024 6 commits
-
-
Michael Yang authored
fix q5_0, q5_1
-
Michael Yang authored
Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
Michael Yang authored
-
Patrick Devine authored
-
Tim Scheuermann authored
-
Jeffrey Morgan authored
-
- 23 May, 2024 7 commits
-
-
Daniel Hiltgen authored
Tidy up developer guide a little
-
Daniel Hiltgen authored
-
Michael Yang authored
-
Daniel Hiltgen authored
Wire up load progress
-
Daniel Hiltgen authored
This doesn't expose a UX yet, but wires the initial server portion of progress reporting during load
-
Bruce MacDonald authored
Co-authored-by:ManniX-ITA <20623405+mann1x@users.noreply.github.com>
-
Jeffrey Morgan authored
* put flash attention behind flag for now * add test * remove print * up timeout for sheduler tests
-
- 22 May, 2024 3 commits
-
-
Michael authored
-
Ikko Eltociear Ashimine authored
PreTokenziers -> PreTokenizers
-
Josh authored
add Ctrl + W shortcut
-
- 21 May, 2024 8 commits
-
-
Josh Yan authored
-
Patrick Devine authored
-
Michael Yang authored
simplify safetensors reading
-
Michael Yang authored
Convert directly from llama3
-
Sang Park authored
The spelling of the term "request" has been corrected, which was previously mistakenly written as "requeset" in the error log message.
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 20 May, 2024 3 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Patrick Devine authored
-