- 28 Sep, 2023 1 commit
-
-
Michael Yang authored
-
- 12 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
* linux gpu support * handle multiple gpus * add cuda docker image (#488) --------- Co-authored-by:Michael Yang <mxyng@pm.me>
-
- 06 Sep, 2023 1 commit
-
-
Patrick Devine authored
-
- 31 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 30 Aug, 2023 1 commit
-
-
Bruce MacDonald authored
* remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm
-
- 29 Aug, 2023 1 commit
-
-
Patrick Devine authored
-
- 17 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 10 Aug, 2023 4 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Patrick Devine authored
-
Bruce MacDonald authored
Co-Authored-By:Jeffrey Morgan <jmorganca@gmail.com>
-
- 08 Aug, 2023 2 commits
-
-
Bruce MacDonald authored
- default to embeddings enabled - move embedding logic for loaded model to request - allow embedding full directory - close llm on reload
-
Jeffrey Morgan authored
Fixes #297 Fixes #296
-
- 07 Aug, 2023 1 commit
-
-
Michael Yang authored
num_keep defines how many tokens to keep in the context when truncating inputs. if left to its default value of -1, the server will calculate num_keep to be the left of the system instructions
-
- 04 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 01 Aug, 2023 2 commits
-
-
Bruce MacDonald authored
- read runner options from map to see what was specified explicitly and overwrite zero values
-
Jeffrey Morgan authored
-
- 28 Jul, 2023 3 commits
-
-
Bruce MacDonald authored
-
Jeffrey Morgan authored
-
Michael Yang authored
-
- 27 Jul, 2023 8 commits
-
-
Michael Yang authored
-
Jeffrey Morgan authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 25 Jul, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 24 Jul, 2023 1 commit
-
-
Patrick Devine authored
-
- 21 Jul, 2023 1 commit
-
-
Patrick Devine authored
-
- 20 Jul, 2023 2 commits
-
-
Patrick Devine authored
-
Bruce MacDonald authored
-
- 19 Jul, 2023 1 commit
-
-
Michael Yang authored
-
- 18 Jul, 2023 1 commit
-
-
Patrick Devine authored
-
- 17 Jul, 2023 1 commit
-
-
Patrick Devine authored
* basic distribution w/ push/pull * add the parser * add create, pull, and push * changes to the parser, FROM line, and fix commands * mkdirp new manifest directories * make `blobs` directory if it does not exist * fix go warnings * add progressbar for model pulls * move model struct --------- Co-authored-by:Jeffrey Morgan <jmorganca@gmail.com>
-
- 14 Jul, 2023 2 commits
-
-
Michael Yang authored
-
Michael Yang authored
feed responses back into the llm
-
- 13 Jul, 2023 1 commit
-
-
Michael Yang authored
-
- 11 Jul, 2023 1 commit
-
-
Michael Yang authored
-