"examples/pytorch/git@developer.sourcefind.cn:OpenDAS/dgl.git" did not exist on "c03046a08f1407bce914905509acbeb343e75db8"
Remove vllm ops scaled fp8 quant and accelerate per token quant by 20-28% (#4215)
Co-authored-by:
Stefan He <bhe@linkedin.com>
Showing
Please register or sign in to comment