"vscode:/vscode.git/clone" did not exist on "fad855cdb4910ddc44730de0a2bb39a3e9f6c4a6"
Sequence parallel perf updates (#1437)
* use _all_gather_base Signed-off-by:ericharper <complex451@gmail.com> * use _reduce_scatter_base Signed-off-by:
ericharper <complex451@gmail.com> * remove torch empty in backward Signed-off-by:
ericharper <complex451@gmail.com> * check self.attn_mask_type Signed-off-by:
ericharper <complex451@gmail.com> * remove extra arg Signed-off-by:
ericharper <complex451@gmail.com> * update get_tensor_shapes logic Signed-off-by:
ericharper <complex451@gmail.com>
Showing
Please register or sign in to comment