- 19 Dec, 2023 3 commits
-
-
Daniel Hiltgen authored
Run the server.cpp directly inside the Go runtime via cgo while retaining the LLM Go abstractions.
-
Bruce MacDonald authored
-
Bruce MacDonald authored
- remove ggml runner - automatically pull gguf models when ggml detected - tell users to update to gguf in the case automatic pull fails Co-Authored-By:Jeffrey Morgan <jmorganca@gmail.com>
-
- 18 Dec, 2023 2 commits
-
-
Bruce MacDonald authored
-
Patrick Devine authored
-
- 15 Dec, 2023 3 commits
-
-
Patrick Devine authored
-
Patrick Devine authored
-
Patrick Devine authored
-
- 14 Dec, 2023 1 commit
-
-
Bruce MacDonald authored
* restore model load duration on generate response - set model load duration on generate and chat done response - calculate createAt time when response created * remove checkpoints predict opts * Update routes.go
-
- 13 Dec, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 12 Dec, 2023 2 commits
-
-
Patrick Devine authored
-
Jeffrey Morgan authored
-
- 11 Dec, 2023 1 commit
-
-
Patrick Devine authored
--------- Co-authored-by:Matt Apperson <mattapperson@Matts-MacBook-Pro.local>
-
- 10 Dec, 2023 4 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 09 Dec, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 08 Dec, 2023 3 commits
-
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
- 05 Dec, 2023 10 commits
-
-
Bruce MacDonald authored
-
Michael Yang authored
-
Michael Yang authored
-
Bruce MacDonald authored
-
Michael Yang authored
-
Jeffrey Morgan authored
This reverts commit 7a0899d6.
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
previous layer creation was not ideal because: 1. it required reading the input file multiple times, once to calculate the sha256 checksum, another to write it to disk, and potentially one more to decode the underlying gguf 2. used io.ReadSeeker which is prone to user error. if the file isn't reset correctly or in the right place, it could end up reading an empty file there are also some brittleness when reading existing layers else writing the inherited layers will error reading an already closed file this commit aims to fix these issues by restructuring layer creation. 1. it will now write the layer to a temporary file as well as the hash function and move it to the final location on Commit 2. layers are read once once when copied to the destination. exception is raw model files which still requires a second read to decode the model metadata
-
Michael Yang authored
-
- 04 Dec, 2023 1 commit
-
-
Bruce MacDonald authored
- update chat docs - add messages chat endpoint - remove deprecated context and template generate parameters from docs - context and template are still supported for the time being and will continue to work as expected - add partial response to chat history
-
- 01 Dec, 2023 1 commit
-
-
Joshua Pham authored
-
- 30 Nov, 2023 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
- 29 Nov, 2023 5 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Bruce MacDonald authored
-
Timothy Jaeryang Baek authored
Co-authored-by: rootedbox
-
Patrick Devine authored
-