[fix] FSDP: multi-pass autograd graph and mixed precision (#513)
* FSDP: multi-pass autograd graph and mixed precision - added BACKWARD_PRE/POST checking - better assert_state - fixed issue of backward hook misfiring * fix * cleanup * Update fairscale/nn/data_parallel/fully_sharded_data_parallel.py Co-authored-by:Myle Ott <myleott@fb.com> Co-authored-by:
Myle Ott <myleott@fb.com>
Showing
Please register or sign in to comment