1. 02 Nov, 2024 1 commit
    • Yifan Xiong's avatar
      CI/CD - Update Image Build Pipeline (#659) · 61770b89
      Yifan Xiong authored
      **Description**
      
      Update image build.
      
      **Major Revision**
      
      * Remove ROCm 6.0 image due to outdated packages
      * Remove build tag for ROCm
      * Preserve build cache for 30 days
      61770b89
  2. 10 Oct, 2024 1 commit
  3. 19 Sep, 2024 1 commit
  4. 20 Aug, 2024 1 commit
  5. 16 Aug, 2024 1 commit
  6. 13 Aug, 2024 1 commit
  7. 08 Aug, 2024 1 commit
  8. 28 Jul, 2024 1 commit
  9. 26 Jul, 2024 2 commits
  10. 25 Jul, 2024 3 commits
  11. 24 Jul, 2024 2 commits
  12. 23 Jul, 2024 1 commit
  13. 22 Apr, 2024 1 commit
  14. 18 Apr, 2024 1 commit
  15. 02 Apr, 2024 1 commit
  16. 21 Mar, 2024 1 commit
  17. 08 Jan, 2024 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.10.0 (#607) · 2c88db90
      Yifan Xiong authored
      **Description**
      
      Cherry-pick bug fixes from v0.10.0 to main.
      
      **Major Revisions**
      
      * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590
      * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591
      * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592
      * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595
      * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596
      * CI/CD - Add ndv5 topo file #597
      * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593
      * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599
      * Dockerfile - Bug fix for rocm docker build and deploy #598
      * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603
      * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604
      * Monitor - U...
      2c88db90
  18. 11 Dec, 2023 1 commit
  19. 10 Dec, 2023 1 commit
  20. 09 Dec, 2023 1 commit
  21. 08 Dec, 2023 1 commit
  22. 07 Dec, 2023 2 commits
  23. 05 Dec, 2023 1 commit
  24. 04 Dec, 2023 1 commit
  25. 27 Nov, 2023 1 commit
    • guoshzhao's avatar
      Monitor - Add support for AMD GPU. (#580) · 028819b3
      guoshzhao authored
      **Description**
      Add AMD support in monitor.
      
      **Major Revision**
      - Add library pyrsmi to collect metrics.
      - Currently can get device_utilization, device_power, device_used_memory
      and device_total_memory.
      028819b3
  26. 22 Nov, 2023 4 commits
  27. 20 Nov, 2023 1 commit
  28. 14 Nov, 2023 1 commit
  29. 07 Nov, 2023 1 commit
  30. 05 Nov, 2023 1 commit
  31. 23 Oct, 2023 1 commit
  32. 22 Aug, 2023 1 commit