-
TJian authored
[Bug] [ROCm] Fix Llama 4 Enablement Bug on ROCm: V0 ROCmFlashAttentionImpl and Triton Fused MoE bugs (#16198) Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Signed-off-by:
kliuae <kuanfu.liu@embeddedllm.com> Co-authored-by:
Hongxia Yang <hongxia.yang@amd.com> Co-authored-by:
kliuae <kuanfu.liu@embeddedllm.com>
2976dc27