-
Eldar Kurtić authored
[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for rope (int8, fp8) (#34243) Signed-off-by:
Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com>
11c7ace3
[Bugfix] Enable attn quantization of Llama-4 by correctly permuting scales for rope (int8, fp8) (#34243) Signed-off-by:Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com>