[Feature] Support Qwen-7B, dynamic NTK scaling and logN scaling in turbomind (#230)
* qwen support * dynamic ntk & logn attn * fix ntk & add chat template * fix ntk scaling & stop words * fix lint * add tiktoken to requirements.txt * fix tokenizer, set model format automatically * update model.py * update readme * fix lint
Showing
| ... | ... | @@ -8,6 +8,7 @@ pybind11 |
| safetensors | ||
| sentencepiece | ||
| setuptools | ||
| tiktoken | ||
| torch | ||
| transformers | ||
| tritonclient[all] |
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment