- 05 Jul, 2023 1 commit
-
-
q.yao authored
* wip * wip * example finish * fix include and namespace * wtf * install lib * batchize * update cmake install * multithread * fix comment * fix * add mmengine * bind llamamodel --------- Co-authored-by:grimoire <yaoqian@pjlab.org.cn>
-
- 04 Jul, 2023 1 commit
-
-
lvhan028 authored
* check-in script for tokenizing a file * use max_input_len
-
- 20 Jun, 2023 2 commits
-
-
Li Zhang authored
* add ft code * gitignore * fix lint * revert fmha
-
lvhan028 authored
* add scripts for deploying llama family models via fastertransformer * fix * fix * set symlinks True when copying triton models templates * pack model repository for triton inference server * add exception * fix * update config.pbtxt and launching scripts
-
- 18 Jun, 2023 1 commit
-
-
lvhan028 authored
-