Merge branch 'develop_v2.3' into 'main'
[DCU] cudaStreamSynchronize for tp gemm overlap See merge request dcutoolkit/deeplearing/TransformerEngine!11
Showing
Please register or sign in to comment
[DCU] cudaStreamSynchronize for tp gemm overlap See merge request dcutoolkit/deeplearing/TransformerEngine!11