1. 03 Dec, 2020 1 commit
    • Min Xu's avatar
      [feat] AdaScale: Gradient Accumulation and Add PyTest unit tests (#202) · ce5860ea
      Min Xu authored
      * added AdaScale to README
      
      * [adascale] added gradient accumulation
      
      - added gradient accumulation
      - tested with cifar full trainings with different value of accumulation
      and verified the full accuracy is obtained
      - also removed the patch optimize flag until we need it
      
      * [adascale] adding pytest
      
      - added basic and ddp tests and grad_accum
      - closes #195
      
      * added changelog
      
      * added ddp grad_accum test
      
      * moved ddp and non-ddp tests into separate files
      
      * added checkpoint test
      
      * more doc
      
      * addressed Mike's comments
      ce5860ea
  2. 01 Dec, 2020 1 commit
  3. 24 Nov, 2020 1 commit
  4. 30 Oct, 2020 1 commit
  5. 29 Oct, 2020 1 commit
  6. 18 Oct, 2020 1 commit
  7. 16 Oct, 2020 1 commit
  8. 24 Sep, 2020 1 commit
  9. 17 Sep, 2020 4 commits
  10. 14 Sep, 2020 1 commit
  11. 09 Sep, 2020 1 commit
  12. 31 Jul, 2020 1 commit
  13. 08 Jul, 2020 1 commit