1. 30 Dec, 2021 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.4.0 (#278) · ff563b66
      Yifan Xiong authored
      
      
      __Description__
      
      Cherry-pick  bug fixes from v0.4.0 to main.
      
      __Major Revisions__
      
      * Bug - Fix issues for Ansible and benchmarks (#267)
      * Tests - Refine test cases for microbenchmark (#268)
      * Bug - Build openmpi with ucx support in rocm dockerfiles (#269)
      * Benchmarks: Fix Bug - Fix fio build issue (#272)
      * Docs - Unify metric and add doc for cublas and cudnn functions (#271)
      * Monitor: Revision - Add 'monitor/' prefix to monitor metrics in result summary (#274)
      * Bug - Fix bug of detecting if gpu_index is none (#275)
      * Bug - Fix bugs in data diagnosis (#273)
      * Bug - Fix issue that the root mpi rank may not be the first in the hostfile (#270)
      * Benchmarks: Configuration - Update inference and network benchmarks in configs (#276)
      * Docs - Upgrade version and release note (#277)
      Co-authored-by: default avatarYuting Jiang <v-yutjiang@microsoft.com>
      ff563b66
  2. 14 Dec, 2021 1 commit
  3. 13 Dec, 2021 6 commits
  4. 10 Dec, 2021 5 commits
  5. 09 Dec, 2021 1 commit
  6. 08 Dec, 2021 2 commits
  7. 07 Dec, 2021 1 commit
  8. 06 Dec, 2021 1 commit
  9. 03 Dec, 2021 1 commit
  10. 02 Dec, 2021 3 commits
  11. 01 Dec, 2021 1 commit
  12. 30 Nov, 2021 1 commit
  13. 29 Nov, 2021 1 commit
  14. 26 Nov, 2021 1 commit
  15. 25 Nov, 2021 1 commit
  16. 18 Nov, 2021 1 commit
  17. 15 Nov, 2021 1 commit
    • guoshzhao's avatar
      Benchmarks: Add Feature - Extend the device manager utility to support more functions. (#239) · cc70f9c1
      guoshzhao authored
      **Description**
      Rename `nvidia_helper` utility as `device_manager` module and support more functions:
      ```
      device_manager.get_device_count()
      device_manager.get_device_utilization(idx)
      device_manager.get_device_temperature(idx)
      device_manager.get_device_power_limit(idx)
      device_manager.get_device_memory(idx)
      device_manager.get_device_row_remapped_info(idx)
      device_manager.get_device_ecc_error(idx)
      ```
      cc70f9c1
  18. 12 Nov, 2021 1 commit
  19. 10 Nov, 2021 1 commit
  20. 09 Nov, 2021 2 commits
  21. 30 Oct, 2021 1 commit
  22. 29 Oct, 2021 1 commit
  23. 27 Oct, 2021 2 commits
  24. 22 Oct, 2021 2 commits
  25. 21 Oct, 2021 1 commit