[fix]: support pytorch SyncBatchNorm under AMP & checkpointing with FSDP (#659)
* [test]: add a more general test case
- also rebalance the tests a bit
* added missing arg
* balance
* better checking
* balance
* make test smaller and faster
* make ddp results cached and enable sync_bn
* clean up
* fix tests
* changelog
* blance
* fix
* addressing comments
Co-authored-by:
Min Xu <min.xu@acm.org>
Showing
Please register or sign in to comment