[Kernel] [V1] Fix performance regression for triton unified attention (#18161)
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>
Showing
Please register or sign in to comment
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Lucas Wilkinson <lwilkinson@neuralmagic.com>