[zero] optimize the optimizer step time (#4221)
* optimize the optimizer step time * fix corner case * polish * replace all-reduce with all-gather * set comm device to cuda
Showing
Please register or sign in to comment
* optimize the optimizer step time * fix corner case * polish * replace all-reduce with all-gather * set comm device to cuda