[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py...
[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py when running llama4 models and unit test fix (#18100) Signed-off-by:Hongxia Yang <hongxia.yang@amd.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
Showing
Please register or sign in to comment