Merge branch 'develop_v2.3' into 'main'
[DCU] remove cudaStreamSynchronize for tp overlap See merge request dcutoolkit/deeplearing/TransformerEngine!13
Showing
Please register or sign in to comment
[DCU] remove cudaStreamSynchronize for tp overlap See merge request dcutoolkit/deeplearing/TransformerEngine!13