[Kernel][Performance] Enable smaller Scaling Factor tiling for NVFP4 small-batch decoding (#30885)
Signed-off-by:LopezCastroRoberto <roberto.lopez.castro@udc.es> Signed-off-by:
Roberto L. Castro <38211239+LopezCastroRoberto@users.noreply.github.com> Signed-off-by:
LopezCastroRoberto <rocastro@redhat.com>
Showing
Please register or sign in to comment