"flash_attn/vscode:/vscode.git/clone" did not exist on "78b7a1dc1869e03a39cf3d2e2d9e5dbb1f669810"
- 20 Jun, 2023 1 commit
-
-
lvhan028 authored
* add scripts for deploying llama family models via fastertransformer * fix * fix * set symlinks True when copying triton models templates * pack model repository for triton inference server * add exception * fix * update config.pbtxt and launching scripts
-
- 18 Jun, 2023 1 commit
-
-
lvhan028 authored
-