"vscode:/vscode.git/clone" did not exist on "b0e53e2d64db0f37ceb73984b9cfad6ad3577a87"
Enable TP-AG overlap with return_layernorm_output (#727)
* Enable TP-AG overlap with return_layernorm_output Signed-off-by:Jaemin Choi <jaeminc@nvidia.com> * Use ub_overlap_ag Signed-off-by:
Jaemin Choi <jaeminc@nvidia.com> --------- Signed-off-by:
Jaemin Choi <jaeminc@nvidia.com> Co-authored-by:
Jaemin Choi <jaeminc@nvidia.com>
Showing
Please register or sign in to comment