"cacheflow/model_executor/memory_analyzer.py" did not exist on "721fa3df155e5649bbe2188517594f24f4e63523"
[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py...
[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py when running llama4 models and unit test fix (#18100) Signed-off-by:Hongxia Yang <hongxia.yang@amd.com> Signed-off-by:
tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by:
tjtanaa <tunjian.tan@embeddedllm.com>
Showing
Please register or sign in to comment