"include/vscode:/vscode.git/clone" did not exist on "389e84a83b1cc3a0ebfe20f7932ab1928f93bad1"
[Feature] Support Qwen-7B, dynamic NTK scaling and logN scaling in turbomind (#230)
* qwen support * dynamic ntk & logn attn * fix ntk & add chat template * fix ntk scaling & stop words * fix lint * add tiktoken to requirements.txt * fix tokenizer, set model format automatically * update model.py * update readme * fix lint
Showing
| ... | ... | @@ -8,6 +8,7 @@ pybind11 |
| safetensors | ||
| sentencepiece | ||
| setuptools | ||
| tiktoken | ||
| torch | ||
| transformers | ||
| tritonclient[all] |
Please register or sign in to comment