"vscode:/vscode.git/clone" did not exist on "49df1dc595734d20ecdf9dfe11933e527fea84f1"
1. change blockwise gemm loopover direction from kmn to mnk ( ~1% improvement)
2. change kernel timing mode to 50 warmup + 50 timed repeat
Showing
Please register or sign in to comment