1. 04 Mar, 2021 1 commit
  2. 25 Feb, 2021 1 commit
  3. 25 Jan, 2021 1 commit
    • Jeff Daily's avatar
      fix bugs in syncbn (#46) · 3f49dbf0
      Jeff Daily authored
      - incorrect use of __shfl_down
      - fix warp size assumptions
      - update unit tests to exit on failure
      3f49dbf0
  4. 21 Jan, 2021 2 commits
  5. 19 Jan, 2021 1 commit
  6. 18 Jan, 2021 5 commits
  7. 15 Jan, 2021 1 commit
  8. 31 Dec, 2020 3 commits
  9. 17 Dec, 2020 3 commits
  10. 16 Dec, 2020 1 commit
  11. 15 Dec, 2020 4 commits
  12. 10 Dec, 2020 1 commit
  13. 09 Dec, 2020 2 commits
  14. 04 Dec, 2020 3 commits
  15. 02 Dec, 2020 1 commit
  16. 01 Dec, 2020 1 commit
  17. 04 Nov, 2020 1 commit
  18. 19 Oct, 2020 1 commit
    • lly-zero-one's avatar
      Optimize the sync batchnorm by batching the communication (#980) · 8a1ed9e8
      lly-zero-one authored
      In this PR, we mainly tried to optimize the performance of Syncatchnorm and also fixed one potential issue in the welford_parallel kernel implementation.
      
      For performance improvement, we batched the mean/var/count all_gather communication together and sent it once in the forward path
      We also batch the all_reduce in backward path
      We add the contiguous call on the input of welford_parallel kernel.
      If there is any standard perf benchmark, I would be happy to run it.
      8a1ed9e8
  19. 29 Sep, 2020 1 commit
  20. 15 Sep, 2020 1 commit
  21. 14 Sep, 2020 2 commits
  22. 21 Aug, 2020 1 commit
  23. 18 Aug, 2020 1 commit
  24. 17 Aug, 2020 1 commit