1. 25 Mar, 2023 1 commit
  2. 21 Mar, 2023 1 commit
    • Yifan Xiong's avatar
      Benchmark - Fix torch.dist init issue with multiple models (#495) · 644b5395
      Yifan Xiong authored
      Fix potential barrier timeout in init_process_group due to race
      condition of using the same port. Change to different ports when running
      multiple models sequentially in one process.
      For example, when running vgg11/13/16/19, will use port 29501~29504
      respectively.
      644b5395
  3. 28 Jan, 2023 1 commit
  4. 04 Jan, 2023 1 commit
    • Yifan Xiong's avatar
      Benchmarks - Support FP8 in BERT models (#446) · 5197cdf5
      Yifan Xiong authored
      Support FP8 in PyTorch BERT models:
      
      * add fp8 hybrid/e4m3/e5m2 in precision arguments
      * build BERT encoders with `te.TransformerLayer` to repalce
      `transformers.BertModel`
      * wrap forward steps with fp8 autocast
      5197cdf5
  5. 30 Dec, 2022 1 commit
    • Yuting Jiang's avatar
      Executor - Add stdout logging util module and enable real-time logging flushing in executor (#445) · 9dfefce3
      Yuting Jiang authored
      **Description**
      Add stdout logging util module and enable real-time logging flushing in executor
      
      **Major Revision**
      - Add stdout logging util module to redirect stdout into file log
      - enable stdout logging in executor to write benchmark output into both stdout and file `sb-bench.log`
      - enable real-time log flushing in run_command of microbenchmarks through config `log_flushing`
      
      **Minor Revision**
      - add log_n_step args to enable regular step time log in model benchmarks 
      - udpate related docs
      9dfefce3
  6. 29 Apr, 2022 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.5.0 (#350) · 6681c720
      Yifan Xiong authored
      
      
      **Description**
      
      Cherry-pick  bug fixes from v0.5.0 to main.
      
      **Major Revisions**
      
      * Bug - Force to fix ort version as '1.10.0' (#343)
      * Bug - Support no matching rules and unify the output name in result_summary (#345)
      * Analyzer - Support regex in annotations of benchmark naming for metrics in rules (#344)
      * Bug - Fix bugs in sync results on root rank for e2e model benchmarks (#342)
      * Bug - Fix bug of duration feature for model benchmarks in distributed mode (#347)
      * Docs - Upgrade version and release note (#348)
      Co-authored-by: default avatarYuting Jiang <v-yutjiang@microsoft.com>
      6681c720
  7. 01 Apr, 2022 1 commit
  8. 22 Mar, 2022 1 commit
  9. 17 Mar, 2022 1 commit
  10. 06 Mar, 2022 1 commit
  11. 10 Feb, 2022 1 commit
  12. 28 Jan, 2022 1 commit
  13. 19 Jan, 2022 1 commit
  14. 13 Dec, 2021 1 commit
  15. 09 Dec, 2021 1 commit
  16. 28 Sep, 2021 1 commit
  17. 27 Sep, 2021 1 commit
  18. 26 Sep, 2021 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.3.0 (#212) · dfbd70b1
      Yifan Xiong authored
      
      
      **Description**
      
      Cherry-pick  bug fixes from v0.3.0 to main.
      
      **Major Revisions**
      * Docs - Upgrade version and release note (#209)
      * Benchmarks: Build Pipeline - Update rccl-test git submodule to dc1ad48 (#210)
      * Benchmarks: Update - Update benchmarks in configuration file (#208)
      * CI/CD - Update GitHub Action VM (#211)
      * Benchmarks: Fix Bug - Fix wrong parameters for gpu-sm-copy-bw in configuration examples (#203)
      * CI/CD - Fix bug in build image for push event (#205)
      * Benchmark: Fix Bug - fix error message of communication-computation-overlap (#204)
      * Tool: Fix bug - Fix function naming issue in system info  (#200)
      * CI/CD - Push images in GitHub Action (#202)
      * Bug - Fix torch.distributed command for single node (#201)
      * CLI - Integrate system info for node (#199)
      * Benchmarks: Code Revision - Revise CMake files for microbenchmarks. (#196)
      * CI/CD - Add ROCm image build in GitHub Actions (#194)
      * Bug: Fix bug - fix bug of hipBusBandwidth build (#193)
      * Benchmarks: Build Pipeline - Restore rocblas build logic (#197)
      * Bug: Fix Bug - Add barrier before 'destroy_process_group' in model benchmarks (#198)
      * Bug - Revise 'docker run' in sb deploy (#195)
      * Bug - Fix Bug : fix bug of error param operations to operation in rccl-bw of hpe config (#190)
      Co-authored-by: default avatarYuting Jiang <v-yujiang@microsoft.com>
      Co-authored-by: default avatarGuoshuai Zhao <guzhao@microsoft.com>
      Co-authored-by: default avatarZiyue Yang <ziyyang@microsoft.com>
      dfbd70b1
  19. 06 Aug, 2021 2 commits
  20. 29 Jul, 2021 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.2.1 (#142) · 69b2c631
      Yifan Xiong authored
      __Description__
      Cherry-pick bug fixes from v0.2.1 to main.
      
      __Major Revisions__
      * Fix bug of VGG models failed on A100 GPU with batch_size=128.
      * Fix Ansible connection issue when running in localhost.
      * Update version in packages and docs.
      69b2c631
  21. 28 Jun, 2021 2 commits
  22. 21 Jun, 2021 1 commit
  23. 16 Jun, 2021 1 commit
    • Yifan Xiong's avatar
      Bug bash - Fix bugs and refine log in single GPU benchmarks (#97) · ddbc51a1
      Yifan Xiong authored
      Fix bugs and refine log in single GPU benchmarks:
      
      * Fix none framework issue
      * Fix empty parameter bug
      * Remove missed mobilenet_v3 models
      * Change benchmark registration log to debug level
      * Add pid in logging
      * Add missing benchmarks in default config
      * Fix deprecated logging warn
      ddbc51a1
  24. 07 Jun, 2021 1 commit
  25. 04 Jun, 2021 1 commit
  26. 19 May, 2021 1 commit
  27. 26 Apr, 2021 2 commits
  28. 20 Apr, 2021 2 commits
  29. 16 Apr, 2021 2 commits
  30. 12 Apr, 2021 1 commit
  31. 08 Apr, 2021 1 commit
  32. 26 Mar, 2021 1 commit
  33. 22 Mar, 2021 2 commits
  34. 18 Mar, 2021 1 commit