- 20 Sep, 2023 6 commits
-
-
Michael Yang authored
-
Michael Yang authored
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
Bruce MacDonald authored
-
- 18 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
* subprocess improvements - increase start-up timeout - when runner fails to start fail rather than timing out - try runners in order rather than choosing 1 runner - embed metal runner in metal dir rather than gpu - refactor logging and error messages * Update llama.go * Update llama.go * simplify by using glob
-
- 14 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
* enable packaging multiple cuda versions * use nvcc cuda version if available --------- Co-authored-by:Michael Yang <mxyng@pm.me>
-
- 12 Sep, 2023 2 commits
-
-
Bruce MacDonald authored
-
Bruce MacDonald authored
* linux gpu support * handle multiple gpus * add cuda docker image (#488) --------- Co-authored-by:Michael Yang <mxyng@pm.me>
-
- 07 Sep, 2023 1 commit
-
-
Bruce MacDonald authored
-
- 06 Sep, 2023 2 commits
-
-
Jeffrey Morgan authored
-
Jeffrey Morgan authored
-
- 05 Sep, 2023 2 commits
-
-
Bruce MacDonald authored
-
Jeffrey Morgan authored
-
- 30 Aug, 2023 1 commit
-
-
Bruce MacDonald authored
* remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm
-
- 26 Aug, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 25 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 14 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 13 Aug, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 10 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 03 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 01 Aug, 2023 1 commit
-
-
Michael Yang authored
-
- 28 Jul, 2023 1 commit
-
-
Jeffrey Morgan authored
-
- 27 Jul, 2023 1 commit
-
-
Michael Yang authored
-
- 20 Jul, 2023 1 commit
-
-
Michael Yang authored
-
- 11 Jul, 2023 1 commit
-
-
Michael Yang authored
-