1. 25 Apr, 2022 1 commit
    • user4543's avatar
      Bug - Fix bug of duration feature for model benchmarks in distributed mode. (#347) · b5b1c3da
      user4543 authored
      **Description**
      Fix bug of duration feature for model benchmarks in distributed mode.
      
      **Major Revision**
      - Add all_reduce to sync the result of is_finished(the function to judge whether the model benchmark should be stopped) in each step 
        - to avoid inconsistency between different ranks to determine duration end (some rank may enter one more step and can never finish)
      - Add torch.cuda.synchronize() before and after step time measuring in train_step() for all model benchmarks
        - some operations in train_step() maybe async resulting incorrect step time records (for example, lstm) 
      b5b1c3da
  2. 21 Apr, 2022 1 commit
  3. 19 Apr, 2022 1 commit
  4. 18 Apr, 2022 1 commit
  5. 16 Apr, 2022 1 commit
  6. 11 Apr, 2022 2 commits
  7. 10 Apr, 2022 1 commit
  8. 08 Apr, 2022 2 commits
  9. 01 Apr, 2022 1 commit
  10. 31 Mar, 2022 1 commit
  11. 24 Mar, 2022 1 commit
  12. 22 Mar, 2022 1 commit
  13. 21 Mar, 2022 1 commit
  14. 17 Mar, 2022 1 commit
  15. 16 Mar, 2022 1 commit
    • rafsalas19's avatar
      Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324) · ff51a3ce
      rafsalas19 authored
      **Description**
      Modifications adding GPU-Burn to SuperBench.
      - added third party submodule
      - modified Makefile to make gpu-burn binary
      - added/modified microbenchmarks to add gpu-burn python scripts
      - modified default and azure_ndv4 configs to add gpu-burn
      ff51a3ce
  16. 15 Mar, 2022 2 commits
  17. 09 Mar, 2022 1 commit
  18. 07 Mar, 2022 2 commits
  19. 06 Mar, 2022 1 commit
  20. 28 Feb, 2022 2 commits
  21. 25 Feb, 2022 1 commit
  22. 24 Feb, 2022 2 commits
  23. 22 Feb, 2022 1 commit
  24. 21 Feb, 2022 1 commit
  25. 20 Feb, 2022 2 commits
  26. 15 Feb, 2022 2 commits
  27. 10 Feb, 2022 1 commit
  28. 09 Feb, 2022 2 commits
  29. 08 Feb, 2022 2 commits
  30. 07 Feb, 2022 1 commit
    • Ziyue Yang's avatar
      Benchmarks: Revise Code - Reduce result variance in gpu_copy benchmark (#298) · 85389055
      Ziyue Yang authored
      **Description**
      This commit does the following to optimize result variance in gpu_copy benchmark:
      1) Add warmup phase for gpu_copy benchmark to avoid timing instability caused by first-time CUDA kernel launch overhead;
      2) Use CUDA events for timing instead of CPU timestamps;
      3) Make data checking an option that is not preferred to be enabled in performance test;
      4) Enlarge message size in performance benchmark.
      85389055