[minor] clarify a comment (#673)
- we do have a use case of empty params inside a FSDP -- for the overlapping fsdp unit test, we use it to measure timing of compute when no params is needed for all_gather - therefore, I updated the comment to be more correct there. - fixes #661
Showing
Please register or sign in to comment