[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in...
[Bugfix] [ROCm]: Remove assertion logic when using AITER fused moe in unquantizedMethod to reenable LLama4 BF16 (#18205)
Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com>
Showing
Please register or sign in to comment