Few fixes.
* prevent clearing c_thread_buffer between consecutive k-dim data tiles GEMM. * Limit the number of launched thread blocks.
Showing
Please register or sign in to comment
* prevent clearing c_thread_buffer between consecutive k-dim data tiles GEMM. * Limit the number of launched thread blocks.