[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in...
[Llama4,Quantization] Simplify and generalize logic for Q/K permutations in quantized self-attn layers (#34471) Signed-off-by:Your Name <you@example.com> Co-authored-by:
Your Name <you@example.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com>
Showing
Please register or sign in to comment