"docs/vscode:/vscode.git/clone" did not exist on "877a88c57e5f25cec3c9b3748bd0525fceec4908"
[Qwen3-Next] switch to triton and cache conv states to accelerate MTP from 300...
[Qwen3-Next] switch to triton and cache conv states to accelerate MTP from 300 tok/s to 341 tok/s (#10335)
Co-authored-by:
Binyao Jiang <byjiang1996@gmail.com>
Showing
Please register or sign in to comment