1. 18 Nov, 2020 2 commits
  2. 17 Nov, 2020 1 commit
    • Min Xu's avatar
      [doc] add AdaScale API doc (#191) · 587b707d
      Min Xu authored
      - removed experimental warning as we have validated it on cifar and
      imagenet, transformer is looking good so far too.
      - fixed API doc formatting
      - make it consistent with the other code in the repo
      - tested by making the doc locally and inspect the results
      587b707d
  3. 16 Nov, 2020 1 commit
  4. 12 Nov, 2020 2 commits
  5. 11 Nov, 2020 2 commits
  6. 10 Nov, 2020 1 commit
    • Tom Birch's avatar
      Single-process control via PipeRPCWrapper (#156) · 5d4f50fb
      Tom Birch authored
      Adds support for:
      * Reused layers (e.g. for weight sharing)
      * Lazily-constructed layers
      * Single-process control via PipeRPCWrapper
      * PipelineStyle.AsyncScheudle, which lays the foundation for asynchronous pipeline work by introducing an event loop for each rank/worker to process either activations or gradients as they arrive
      
      Also added examples for multi-process and PipeRPCWrapper
      5d4f50fb
  7. 06 Nov, 2020 2 commits
  8. 04 Nov, 2020 1 commit
  9. 30 Oct, 2020 2 commits
  10. 29 Oct, 2020 1 commit
  11. 28 Oct, 2020 2 commits
  12. 26 Oct, 2020 1 commit
  13. 23 Oct, 2020 3 commits
  14. 22 Oct, 2020 3 commits
  15. 21 Oct, 2020 7 commits
    • Min Xu's avatar
      [fix] fixing adascale all_reduce (#155) · 6802ad49
      Min Xu authored
      - Aurick noticed this bug and I ran into it yesterday
      - after the fix, our cifar training shows same gain values from
        different replics now:
      
      ```
      20-Oct-20 16:00:19 - DEBUG - rank1 - scale 2, gain ratio 1.3512124098087777
      20-Oct-20 16:00:19 - DEBUG - rank0 - scale 2, gain ratio 1.3512124098087777
      20-Oct-20 16:00:19 - DEBUG - rank1 - timing: data 0:00:00.000600 fwd 0:00:00.003678 loss 0:00:00.000086 bwd 0:00:00.314158 update 0:00:00.002132 rest 0:00:00.000399
      20-Oct-20 16:00:19 - DEBUG - rank0 - timing: data 0:00:00.000643 fwd 0:00:00.003460 loss 0:00:00.000084 bwd 0:00:00.314678 update 0:00:00.002001 rest 0:00:00.000408
      20-Oct-20 16:00:19 - DEBUG - rank1 - scale 2, gain ratio 1.3514997779980324
      20-Oct-20 16:00:19 - DEBUG - rank0 - scale 2, gain ratio 1.3514997779980324
      20-Oct-20 16:00:19 - DEBUG - rank1 - timing: data 0:00:00.000732 fwd 0:00:00.003689 loss 0:00:00.000086 bwd 0:00:00.314176 update 0:00:00.002146 rest 0:00:00.000397
      20-Oct-20 16:00:19 - DEBUG - rank0 - timing: data 0:00:00.000646 fwd 0:00:00.003542 loss 0:00:00.000089 bwd 0:00:00.314549 update 0:00:00.001956 rest 0:00:00.000392
      20-Oct-20 16:00:19 - DEBUG - rank1 - scale 2, gain ratio 1.352149646693932
      20-Oct-20 16:00:19 - DEBUG - rank0 - scale 2, gain ratio 1.352149646693932
      ```
      6802ad49
    • Benjamin Lefaudeux's avatar
      [feature] OSS: Use MNIST to benchmark (#159) · 6f8a8652
      Benjamin Lefaudeux authored
      * switching to MNIST
      * updating the reference values, should be good to go
      * download dataset once for all processes
      6f8a8652
    • Vittorio Caggiano's avatar
      Update index.rst · 577dcd98
      Vittorio Caggiano authored
      fix max depth
      577dcd98
    • Vittorio Caggiano's avatar
      Update index.rst · eb2cabdc
      Vittorio Caggiano authored
      fix maxdepth
      eb2cabdc
    • Vittorio Caggiano's avatar
      Update index.rst · 16b50272
      Vittorio Caggiano authored
      16b50272
    • Vittorio Caggiano's avatar
      Classification Examples of oss + pipe | tutorials/doc update (#119) · 53043d26
      Vittorio Caggiano authored
      
      
      * wip_example
      
      * [wip]mnist_pipe_example
      
      * [wip]mnist_pipe_example
      
      * [wip]mnist_pipe_example
      
      * [wip]mnist_pipe_example
      
      * [wip]mnist_oss_example
      
      * working prototype
      
      * added tutorial script
      
      * update tutorial
      
      * Update mnist_test_oss.py
      
      * Update mnist_test_oss.py
      
      * Update mnist_test_oss.py
      
      * Update mnist_test_pipe.py
      
      * Update tutorial_oss.py
      
      * Update tutorial_pipe.py
      
      * Update tutorial_pipe.py
      
      * Update mnist_test_oss.py
      
      * Update tutorial_pipe.py
      
      * Update mnist_test_pipe.py
      
      * Update tutorial_pipe.py
      
      * fix black
      
      * fix flacke8
      
      * general fixes
      
      * add example oss+pipe
      
      * fix isort
      
      * Update mnist_test_pipe.py
      
      * fix black
      Co-authored-by: default avatarVittorio Caggiano <caggiano@devfair0253.h2.fair>
      53043d26
    • msbaines's avatar
      c6d9be79
  16. 20 Oct, 2020 4 commits
  17. 18 Oct, 2020 1 commit
  18. 17 Oct, 2020 2 commits
  19. 16 Oct, 2020 2 commits