"test/srt/test_mla_fp8.py" did not exist on "8207637029082563cab74951fe8d5f86b574b85e"
- 05 Sep, 2023 2 commits
-
-
Bruce MacDonald authored
-
Jeffrey Morgan authored
-
- 30 Aug, 2023 1 commit
-
-
Bruce MacDonald authored
* remove c code * pack llama.cpp * use request context for llama_cpp * let llama_cpp decide the number of threads to use * stop llama runner when app stops * remove sample count and duration metrics * use go generate to get libraries * tmp dir for running llm
-