1. 15 Apr, 2021 1 commit
  2. 13 Apr, 2021 3 commits
  3. 08 Apr, 2021 1 commit
  4. 07 Apr, 2021 2 commits
  5. 06 Apr, 2021 1 commit
  6. 05 Apr, 2021 1 commit
  7. 04 Apr, 2021 3 commits
  8. 02 Apr, 2021 1 commit
  9. 01 Apr, 2021 1 commit
  10. 31 Mar, 2021 4 commits
    • msbaines's avatar
    • anj-s's avatar
      [offload] Audit OffloadModel API, add error messages and remove redundant code path. (#557) · 34384e1b
      anj-s authored
      * renaming/adding error messages
      
      * address comments
      
      * address comments
      
      * add more comments
      
      * add more comments
      34384e1b
    • Min Xu's avatar
      [fix] FSDP: disable single rank process group for auto_wrap_bn and fixed mixed... · a0458b98
      Min Xu authored
      [fix] FSDP: disable single rank process group for auto_wrap_bn and fixed mixed precision regnet test (#556)
      
      * [fix] disable single rank process group for auto_wrap_bn
      
      - beefed up unit test with regnet-like model
      - found that single-rank process group is causing problem
      - disabled it to enable convergence tests on the vissl side
      - use `raise e from None` to get a better assertion output
        in testing.py.
      
      * [test] fix regnet test for ddp+mixed_precision
      
      - need AMP context in FSDP
      - workaround different between ddp & fsdp when bias=True
      - fixed a bug in input data generation that caused different ranks have
        the same data with wrong iteration count.
      - added TODO for need a better loss and grad_scaler and reduced
        iters so there is no nan.
      - added a (disabled) debugging code
      
      * lint
      
      * lint
      
      * add scaler
      
      * lint
      
      * scaler
      
      * add a real loss
      
      * seeding in the ranks
      
      * blance tests
      
      * run AMP DDP==FSDP test only on cuda version 11 and up
      
      * add relu inplace and comment
      
      * make wrap_bn covers more cases in full precision mode
      a0458b98
    • msbaines's avatar
      acb9ef00
  11. 30 Mar, 2021 1 commit
  12. 29 Mar, 2021 1 commit
  13. 28 Mar, 2021 1 commit
  14. 26 Mar, 2021 1 commit
  15. 25 Mar, 2021 2 commits
  16. 22 Mar, 2021 1 commit
  17. 20 Mar, 2021 1 commit
  18. 19 Mar, 2021 3 commits
  19. 18 Mar, 2021 5 commits
  20. 17 Mar, 2021 2 commits
  21. 15 Mar, 2021 1 commit
  22. 12 Mar, 2021 2 commits
  23. 11 Mar, 2021 1 commit