1. 14 Apr, 2023 1 commit
  2. 12 Apr, 2023 2 commits
  3. 07 Apr, 2023 1 commit
  4. 06 Apr, 2023 4 commits
  5. 03 Apr, 2023 1 commit
    • guoshzhao's avatar
      Monitor - Fix the cgroup version checking logic. (#502) · 26373edb
      guoshzhao authored
      **Description**
      Looks `grep cgroup /proc/filesystems` doesn't work for NDv4 whose cgroup
      version is v1, but the result of this command got v2 for NDv4. Instead,
      checking the file existence to judge the cgroup version.
      26373edb
  6. 28 Mar, 2023 1 commit
  7. 25 Mar, 2023 1 commit
  8. 24 Mar, 2023 1 commit
  9. 22 Mar, 2023 2 commits
  10. 21 Mar, 2023 2 commits
  11. 20 Mar, 2023 2 commits
  12. 27 Feb, 2023 1 commit
    • Yuting Jiang's avatar
      Benchmarks: Revision - Support flexible warmup and non-random data... · eba298f5
      Yuting Jiang authored
      Benchmarks: Revision - Support flexible warmup and non-random data initialization in cublas-benchmark  (#479)
      
      **Description**
      revise cublas-benchmark for flexible warmup and fill data with fixed
      number for perf test to improve the running efficiency.
      
      **Major Revision**
      - remove num_in_steps for warmup to support more flexible warmup setting
      for users
      - Add support to generate input with fixed number for perf test
      eba298f5
  13. 13 Feb, 2023 2 commits
  14. 28 Jan, 2023 1 commit
  15. 17 Jan, 2023 1 commit
  16. 04 Jan, 2023 3 commits
  17. 03 Jan, 2023 6 commits
  18. 30 Dec, 2022 2 commits
  19. 29 Dec, 2022 1 commit
  20. 14 Dec, 2022 1 commit
  21. 29 Nov, 2022 1 commit
    • Yang Wang's avatar
      Runner - support 'pattern' in 'mpi' mode to run tasks in parallel (#430) · e4eeda0a
      Yang Wang authored
      * add mpi-parallels mode
      
      * update according to comments
      
      * fix and update doc
      
      * update
      
      * merge into 'mpi' mode
      
      * udpate according to comments
      
      * fix testcases
      
      * fix ansible
      
      * regard pattern as field
      
      * udpate
      
      * fix flake8 version
      
      * add flake8 range
      
      * remove map-by from host config
      
      * udpate comments
      e4eeda0a
  22. 01 Nov, 2022 1 commit
    • Yifan Xiong's avatar
      CLI - Add non-zero return code for `sb [deploy,run]` (#425) · 1b86503d
      Yifan Xiong authored
      Add non-zero return code for `sb deploy` and `sb run` command when
      there're Ansible failures in control plane.
      Return code is set to count of failure.
      
      For failures caused by benchmarks, return code is still set per benchmark
      in results json file.
      1b86503d
  23. 31 Oct, 2022 1 commit
  24. 18 Oct, 2022 1 commit