1. 24 Mar, 2021 1 commit
    • Nan Zheng's avatar
      Initial check-in of the transducer extensions (#1069) · d86d1b09
      Nan Zheng authored
      * Initial check-in of the transducer extension.
      
      * Added more comments to help explain the code
      
      * Corrected minor typos
      
      * 1. Renamed variable in tests to match the extension
      2. Disabled ninja build option
      d86d1b09
  2. 23 Feb, 2021 1 commit
  3. 10 Feb, 2021 1 commit
  4. 20 Jan, 2021 1 commit
  5. 17 Dec, 2020 2 commits
  6. 04 Dec, 2020 3 commits
  7. 02 Dec, 2020 1 commit
  8. 01 Dec, 2020 1 commit
  9. 19 Oct, 2020 1 commit
    • lly-zero-one's avatar
      Optimize the sync batchnorm by batching the communication (#980) · 8a1ed9e8
      lly-zero-one authored
      In this PR, we mainly tried to optimize the performance of Syncatchnorm and also fixed one potential issue in the welford_parallel kernel implementation.
      
      For performance improvement, we batched the mean/var/count all_gather communication together and sent it once in the forward path
      We also batch the all_reduce in backward path
      We add the contiguous call on the input of welford_parallel kernel.
      If there is any standard perf benchmark, I would be happy to run it.
      8a1ed9e8
  10. 29 Sep, 2020 1 commit
  11. 15 Sep, 2020 1 commit
  12. 14 Sep, 2020 2 commits
  13. 15 Aug, 2020 1 commit
  14. 10 Aug, 2020 1 commit
  15. 06 Aug, 2020 1 commit
  16. 05 Aug, 2020 1 commit
  17. 01 Aug, 2020 1 commit
  18. 30 Jul, 2020 1 commit
  19. 23 Jul, 2020 1 commit
  20. 22 Jul, 2020 3 commits
  21. 21 Jul, 2020 1 commit
  22. 20 Jul, 2020 3 commits
  23. 16 Jul, 2020 2 commits
  24. 09 Jul, 2020 1 commit
  25. 06 Jul, 2020 1 commit
    • jjsjann123's avatar
      [sync BN] (#792) · 1ff54b8f
      jjsjann123 authored
      * [sync BN]
      
      support non-uniform batch size across process group.
      
      TODO: test should be added once cleaned up.
      
      * updating unit tests
      
      * new unit tests for different inputs
      
      * cleaning
      1ff54b8f
  26. 01 Jul, 2020 1 commit
  27. 30 Jun, 2020 1 commit
  28. 23 Jun, 2020 4 commits