"vscode:/vscode.git/clone" did not exist on "5c4930cda058ece236ff5ac58265111d32cc6260"
Sequence parallel perf updates (#1437)
* use _all_gather_base Signed-off-by:ericharper <complex451@gmail.com> * use _reduce_scatter_base Signed-off-by:
ericharper <complex451@gmail.com> * remove torch empty in backward Signed-off-by:
ericharper <complex451@gmail.com> * check self.attn_mask_type Signed-off-by:
ericharper <complex451@gmail.com> * remove extra arg Signed-off-by:
ericharper <complex451@gmail.com> * update get_tensor_shapes logic Signed-off-by:
ericharper <complex451@gmail.com>
Showing
Please register or sign in to comment