"vscode:/vscode.git/clone" did not exist on "208b6841d3942b1cf1f120c1caa803a760411897"
[Feature] Support Qwen-7B, dynamic NTK scaling and logN scaling in turbomind (#230)
* qwen support * dynamic ntk & logn attn * fix ntk & add chat template * fix ntk scaling & stop words * fix lint * add tiktoken to requirements.txt * fix tokenizer, set model format automatically * update model.py * update readme * fix lint
Showing
Please register or sign in to comment