"src/include/threadwise_direct_convolution.hpp" did not exist on "0b8e67ef08b28447509fd3e0f26d8e276b06cbf0"
Hotfix LDS data hazard in fused attention (#360)
* avoid LDS data hazard in gemm_softmax_gemm pipeline * trivial refactors * comments * shrink blockwise gemm v2 thread buffer size * reclaim A block lds space when during 2nd gemm * amend * amend
Showing
Please register or sign in to comment