1. 17 Dec, 2020 3 commits
  2. 16 Dec, 2020 6 commits
  3. 15 Dec, 2020 1 commit
  4. 14 Dec, 2020 1 commit
  5. 10 Dec, 2020 2 commits
  6. 09 Dec, 2020 1 commit
  7. 07 Dec, 2020 1 commit
  8. 06 Dec, 2020 1 commit
  9. 05 Dec, 2020 1 commit
  10. 04 Dec, 2020 2 commits
  11. 03 Dec, 2020 1 commit
    • Min Xu's avatar
      [feat] AdaScale: Gradient Accumulation and Add PyTest unit tests (#202) · ce5860ea
      Min Xu authored
      * added AdaScale to README
      
      * [adascale] added gradient accumulation
      
      - added gradient accumulation
      - tested with cifar full trainings with different value of accumulation
      and verified the full accuracy is obtained
      - also removed the patch optimize flag until we need it
      
      * [adascale] adding pytest
      
      - added basic and ddp tests and grad_accum
      - closes #195
      
      * added changelog
      
      * added ddp grad_accum test
      
      * moved ddp and non-ddp tests into separate files
      
      * added checkpoint test
      
      * more doc
      
      * addressed Mike's comments
      ce5860ea
  12. 02 Dec, 2020 1 commit
  13. 01 Dec, 2020 4 commits
  14. 30 Nov, 2020 1 commit
  15. 27 Nov, 2020 1 commit
  16. 26 Nov, 2020 1 commit
  17. 24 Nov, 2020 1 commit
  18. 22 Nov, 2020 1 commit
  19. 21 Nov, 2020 1 commit
    • Benjamin Lefaudeux's avatar
      [feat] ShardedDataParallel with autoreduce (#157) · ad933b34
      Benjamin Lefaudeux authored
      * rewrite using autograd and Variable execution queue to make the reduce automatic
      * share buckets with OSS to remove duplication
      * some speed still likely on the table since the speed vs. bucketing does not match expectations, could be a follow up
      ad933b34
  20. 20 Nov, 2020 1 commit
  21. 19 Nov, 2020 4 commits
  22. 18 Nov, 2020 2 commits
  23. 17 Nov, 2020 1 commit
    • Min Xu's avatar
      [doc] add AdaScale API doc (#191) · 587b707d
      Min Xu authored
      - removed experimental warning as we have validated it on cifar and
      imagenet, transformer is looking good so far too.
      - fixed API doc formatting
      - make it consistent with the other code in the repo
      - tested by making the doc locally and inspect the results
      587b707d
  24. 16 Nov, 2020 1 commit