Unverified Commit 6dec9f61 authored by Lucas Wilkinson's avatar Lucas Wilkinson Committed by GitHub
Browse files

[BugFix] Fix DeepGEMM over-allocating workspace (#28254)


Signed-off-by: default avatarLucas Wilkinson <lwilkins@redhat.com>
parent bf6a3d0f
......@@ -215,7 +215,7 @@ class DeepGemmExperts(mk.FusedMoEPermuteExpertsUnpermute):
)
assert M_sum % block_m == 0
workspace1 = (M_sum, max(N, K))
workspace1 = (M_sum, N)
workspace2 = (M_sum, max(N // 2, K))
output = (M, K)
return (workspace1, workspace2, output)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment