[fix] [FSDP] making sure we use full params for multiple backwards within an iteration (#775)
* [bug] [FSDP] making sure we use full params for multiple backwards within an iteration
* changelog
Co-authored-by:
Min Xu <min.xu.public@gmail.com>
Showing
Please register or sign in to comment