"vllm/vscode:/vscode.git/clone" did not exist on "67cee40da035b7478483c76dfbe0bfc321c3822f"
[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for...
[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for rope (int8, fp8) (#34243) Signed-off-by:Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com>
Showing
Please register or sign in to comment