1. 04 Feb, 2025 2 commits
  2. 08 Jan, 2025 1 commit
  3. 28 Nov, 2024 2 commits
  4. 27 Nov, 2024 1 commit
  5. 22 Nov, 2024 1 commit
  6. 21 Nov, 2024 2 commits
  7. 20 Nov, 2024 1 commit
  8. 15 Nov, 2024 1 commit
  9. 07 Nov, 2024 2 commits
  10. 06 Nov, 2024 1 commit
    • pdr's avatar
      Dockerfile - Add support for arm64 build (#660) · 47949127
      pdr authored
      Add support for arm64 build:
      
      - Updated dockerfile for arm64 build
      - extend cpu stream compilation for neoverse 
      - handle onnxruntime-gpu installation
      - third party builds filtering based on arch
      - disable cuda decode perf build for non x86
      47949127
  11. 05 Nov, 2024 1 commit
    • pdr's avatar
      Bug Fix - Fix numa error on grace cpu in gpu-copy (#658) · 59d36f7f
      pdr authored
      The current GPU Copy BW Performance fails on Nvidia Grace systems. This
      is due to the memory only numa node and thus the numa_run_on_node fails
      for such nodes and halts completely.
      
      This fix checks for the presence of assigned CPU cores for the numa
      node, on checking if it has no cpu cores assigned, it skips that
      specific node during the args creation and continues.
      59d36f7f
  12. 02 Nov, 2024 1 commit
    • Yifan Xiong's avatar
      CI/CD - Update Image Build Pipeline (#659) · 61770b89
      Yifan Xiong authored
      **Description**
      
      Update image build.
      
      **Major Revision**
      
      * Remove ROCm 6.0 image due to outdated packages
      * Remove build tag for ROCm
      * Preserve build cache for 30 days
      61770b89
  13. 10 Oct, 2024 1 commit
  14. 19 Sep, 2024 1 commit
  15. 20 Aug, 2024 1 commit
  16. 16 Aug, 2024 1 commit
  17. 13 Aug, 2024 1 commit
  18. 08 Aug, 2024 1 commit
  19. 28 Jul, 2024 1 commit
  20. 26 Jul, 2024 2 commits
  21. 25 Jul, 2024 3 commits
  22. 24 Jul, 2024 2 commits
  23. 23 Jul, 2024 1 commit
  24. 22 Apr, 2024 1 commit
  25. 18 Apr, 2024 1 commit
  26. 02 Apr, 2024 1 commit
  27. 21 Mar, 2024 1 commit
  28. 08 Jan, 2024 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.10.0 (#607) · 2c88db90
      Yifan Xiong authored
      **Description**
      
      Cherry-pick bug fixes from v0.10.0 to main.
      
      **Major Revisions**
      
      * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590
      * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591
      * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592
      * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595
      * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596
      * CI/CD - Add ndv5 topo file #597
      * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593
      * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599
      * Dockerfile - Bug fix for rocm docker build and deploy #598
      * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603
      * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604
      * Monitor - U...
      2c88db90
  29. 11 Dec, 2023 1 commit
  30. 10 Dec, 2023 1 commit
  31. 09 Dec, 2023 1 commit
  32. 08 Dec, 2023 1 commit