1. 08 Oct, 2020 1 commit
    • Min Xu's avatar
      [test] Add unittest for checkpoint & DDP (#126) · 6658be22
      Min Xu authored
      * Add unittest for checkpoint & DDP
      
      - this change adds test cases to reproduce the error with checkpoint & DDP
      - mandeep mentioned that there is also deadlock in this case, but this
        change doesn't cover that.
      - we cover cases where weight sharing is OK
      - however, same module multiple checkpoint or find_unused_parameters are
        both not OK
      
      * added norm checks
      6658be22
  2. 06 Oct, 2020 2 commits
  3. 05 Oct, 2020 1 commit
  4. 02 Oct, 2020 1 commit
  5. 01 Oct, 2020 3 commits
  6. 29 Sep, 2020 1 commit
  7. 24 Sep, 2020 3 commits
  8. 22 Sep, 2020 3 commits
  9. 17 Sep, 2020 6 commits
  10. 16 Sep, 2020 2 commits
  11. 15 Sep, 2020 2 commits
  12. 14 Sep, 2020 1 commit
  13. 12 Sep, 2020 1 commit
  14. 11 Sep, 2020 1 commit
  15. 10 Sep, 2020 3 commits
  16. 09 Sep, 2020 7 commits
  17. 08 Sep, 2020 1 commit
    • Benjamin Lefaudeux's avatar
      [feat] OSS: Sync all attributes (#67) · 5a268b25
      Benjamin Lefaudeux authored
      Make sure that all attributes (not just LR) are in sync in between the OSS.param_groups and the actual wrapped optimizer. Some frameworks make it possible to alter any attribute on a scheduled basis, which proves useful depending on the optimizer, so the keys need to be generically supported (not just "lr"). Not syncing these attributes is a worst case scenario, since these adjustments are silently not propagated, fixing that. 
      5a268b25
  18. 04 Sep, 2020 1 commit