"packaging/vscode:/vscode.git/clone" did not exist on "d367a01a18a3ae6bee13d8be3b63fd6a581ea46f"
Remove vllm ops scaled fp8 quant and accelerate per token quant by 20-28% (#4215)
Co-authored-by:
Stefan He <bhe@linkedin.com>
Showing
Please register or sign in to comment