1. 22 Nov, 2024 1 commit
  2. 21 Nov, 2024 2 commits
  3. 20 Nov, 2024 1 commit
  4. 15 Nov, 2024 1 commit
  5. 07 Nov, 2024 2 commits
  6. 06 Nov, 2024 1 commit
    • pdr's avatar
      Dockerfile - Add support for arm64 build (#660) · 47949127
      pdr authored
      Add support for arm64 build:
      
      - Updated dockerfile for arm64 build
      - extend cpu stream compilation for neoverse 
      - handle onnxruntime-gpu installation
      - third party builds filtering based on arch
      - disable cuda decode perf build for non x86
      47949127
  7. 05 Nov, 2024 1 commit
    • pdr's avatar
      Bug Fix - Fix numa error on grace cpu in gpu-copy (#658) · 59d36f7f
      pdr authored
      The current GPU Copy BW Performance fails on Nvidia Grace systems. This
      is due to the memory only numa node and thus the numa_run_on_node fails
      for such nodes and halts completely.
      
      This fix checks for the presence of assigned CPU cores for the numa
      node, on checking if it has no cpu cores assigned, it skips that
      specific node during the args creation and continues.
      59d36f7f
  8. 02 Nov, 2024 1 commit
    • Yifan Xiong's avatar
      CI/CD - Update Image Build Pipeline (#659) · 61770b89
      Yifan Xiong authored
      **Description**
      
      Update image build.
      
      **Major Revision**
      
      * Remove ROCm 6.0 image due to outdated packages
      * Remove build tag for ROCm
      * Preserve build cache for 30 days
      61770b89
  9. 10 Oct, 2024 1 commit
  10. 19 Sep, 2024 1 commit
  11. 20 Aug, 2024 1 commit
  12. 16 Aug, 2024 1 commit
  13. 13 Aug, 2024 1 commit
  14. 08 Aug, 2024 1 commit
  15. 28 Jul, 2024 1 commit
  16. 26 Jul, 2024 2 commits
  17. 25 Jul, 2024 3 commits
  18. 24 Jul, 2024 2 commits
  19. 23 Jul, 2024 1 commit
  20. 22 Apr, 2024 1 commit
  21. 18 Apr, 2024 1 commit
  22. 02 Apr, 2024 1 commit
  23. 21 Mar, 2024 1 commit
  24. 08 Jan, 2024 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.10.0 (#607) · 2c88db90
      Yifan Xiong authored
      **Description**
      
      Cherry-pick bug fixes from v0.10.0 to main.
      
      **Major Revisions**
      
      * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590
      * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591
      * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592
      * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595
      * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596
      * CI/CD - Add ndv5 topo file #597
      * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593
      * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599
      * Dockerfile - Bug fix for rocm docker build and deploy #598
      * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603
      * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604
      * Monitor - U...
      2c88db90
  25. 11 Dec, 2023 1 commit
  26. 10 Dec, 2023 1 commit
  27. 09 Dec, 2023 1 commit
  28. 08 Dec, 2023 1 commit
  29. 07 Dec, 2023 2 commits
  30. 05 Dec, 2023 1 commit
  31. 04 Dec, 2023 1 commit
  32. 27 Nov, 2023 1 commit
    • guoshzhao's avatar
      Monitor - Add support for AMD GPU. (#580) · 028819b3
      guoshzhao authored
      **Description**
      Add AMD support in monitor.
      
      **Major Revision**
      - Add library pyrsmi to collect metrics.
      - Currently can get device_utilization, device_power, device_used_memory
      and device_total_memory.
      028819b3
  33. 22 Nov, 2023 1 commit