Merge pull request #1039 from ROCmSoftwarePlatform/mha-train-develop-outaccu
Use Flash-Attention-2 method to do output accumulation for infer/forward
Showing
Please register or sign in to comment
Use Flash-Attention-2 method to do output accumulation for infer/forward