apex/parallel/sync_batchnorm_kernel.py · 8421cfb44630ca285c7b1d5eb6715e1feabea406 · OpenDAS / apex

[syncBN] · fa719e8b

Jie authored Jan 02, 2019

replacing new_group with torch.distributed.group.WORLD, avoids creating new
group in every iteration.

This should resolve the issue in Training gets stuck when using SyncBN #105

fa719e8b

sync_batchnorm_kernel.py 3.67 KB

Replace sync_batchnorm_kernel.py