[perf] SyncBatchNorm: avoid 2nd set of all_reduce when wrapped by checkpoint_wrapper (#694)
This change also ensure that we calculate running_{mean,var} correctly
when wrapped.
Showing
Please register or sign in to comment
This change also ensure that we calculate running_{mean,var} correctly
when wrapped.