Merge pull request #1060 from ROCmSoftwarePlatform/mha-train-develop-fix-d-calculate
Fix y and y_grad data vector load size for flash attention
Showing
Please register or sign in to comment
Fix y and y_grad data vector load size for flash attention