update scripts for deploying llama family model to fastertransformer triton models (#4)
* add scripts for deploying llama family models via fastertransformer * fix * fix * set symlinks True when copying triton models templates * pack model repository for triton inference server * add exception * fix * update config.pbtxt and launching scripts
Showing
Please register or sign in to comment