"...composable_kernel.git" did not exist on "6063db7dddbc54d01d0617a23c786a106f9e5a39"
1. change blockwise gemm loopover direction from kmn to mnk ( ~1% improvement)
2. change kernel timing mode to 50 warmup + 50 timed repeat
Showing
Please register or sign in to comment