"examples/git@developer.sourcefind.cn:OpenDAS/dgl.git" did not exist on "6cbdf37c37904146c4baf12c212d05d824ecd3da"
Add gridwise GEMM pipeline (#89)
* clean up * add mutilple thread scratch to ThreadwiseTensorSliceTransfer_v3r1 * add 2 stage prefetch * add more sanity check into transform_tensor_descriptor * tweak * enabling 2 stage prefetch to exsiting gridwise gemm; tweak * enabling 2 stage prefetch to exsiting gridwise gemm * move gridwise gemm pipeline in class; clean up * add some irregular tile size * update CalculateHasMainK0BlockLoop for multi-stage-prefetch * refactor gridwise gemm pipeline class
Showing
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment