-
Eldar Kurtić authored
[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers (#34471) Signed-off-by:
Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
ee1d25f1