Merge pull request #835 from ROCmSoftwarePlatform/mha-train-bias
Add bias for flashattention fwd(v2)
Showing
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment