[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for...
[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for rope (int8, fp8) (#34243) Signed-off-by:Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com>
Showing
Please register or sign in to comment