- 02 Nov, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 19 Oct, 2023 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
add error for falcon and starcoder vocab compatibility --------- Co-authored-by:Bruce MacDonald <brucewmacdonald@gmail.com>
-
- 13 Oct, 2023 4 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
- 11 Oct, 2023 1 commit
-
-
Michael Yang authored
-
- 05 Oct, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 25 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
--------- Co-authored-by:Michael Yang <mxyng@pm.me>
-
- 21 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
* remove tmp directories created by previous servers * clean up on server stop * Update routes.go * Update server/routes.go Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> * create top-level temp ollama dir * check file exists before creating --------- Co-authored-by:
Jeffrey Morgan <jmorganca@gmail.com> Co-authored-by:
Michael Yang <mxyng@pm.me>
-
- 12 Sep, 2023 1 commit
-
-
Michael Yang authored
get model and file type from bin file
-
- 07 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 30 Aug, 2023 1 commit
-
-
Bruce MacDonald authored
* remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm
-
- 26 Aug, 2023 2 commits
-
-
Michael Yang authored
warning F16 uses significantly more memory than quantized model so the standard requires don't apply.
-
Michael Yang authored
-
- 17 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 14 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 10 Aug, 2023 4 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-
Michael Yang authored
-