Add option to use DeepGemm contiguous grouped gemm kernel for fused MoE operations. (#13932)
Signed-off-by:
Bill Nell <bnell@redhat.com>
Showing
This diff is collapsed.
Please register or sign in to comment
Signed-off-by:
Bill Nell <bnell@redhat.com>