fix bwd error of context parallelism implementation with FA v2 (#498)
fix bwd error with FA v2 Signed-off-by:Xiaowei Ren <xren@nvidia.com> Co-authored-by:
Kirthi Shankar Sivamani <ksivamani@nvidia.com>
Showing
Please register or sign in to comment