1. 21 Apr, 2026 1 commit
    • Hongtao Zhang's avatar
      CI/CD - Fix setuptools-scm 10.x compatibility for Python 3.12 (#805) · 8c7e2be0
      Hongtao Zhang authored
      
      
      ## Description
      This affects all PRs running `python3 setup.py lint` on the Python 3.12
      CI job.
      
      ## Root Cause
      
      Comparing the last successful cpu-unit-test build (58939, Mar 25) with a
      recent failing build (58996, Apr 14), the Python 3.12 "Install
      dependencies" step shows:
      
      | Package | Successful (Mar 25) | Failing (Apr 14) |
      |---|---|---|
      | `setuptools-scm` | < 10.0 (no `vcs-versioning` dep) | 10.0.5 (requires
      `vcs-versioning`) |
      
      `setuptools-scm` 10.0.5 was released between the two runs and added
      `vcs-versioning` as a new dependency. The `setup_requires` mechanism in
      `setup.py` does not install transitive dependencies, so `vcs-versioning`
      is missing at runtime.
      
      The successful build lint log (Python 3.12): "ModuleNotFoundError: No
      module named 'vcs_versioning'"
      
      This affects all PRs running `python3 setup.py lint` on the Python 3.12
      CI job.
      
      ## Changes
      
      - Add `vcs_versioning` explicitly to `setup_requires` in `setup.py` so
      it is available when `setuptools-scm` is imported during `setup.py`
      execution.
      
      ## Testing
      
      Verified that `setuptools-scm` 10.0.5 declares `vcs-versioning` as a
      dependency, and the CI failure matches the missing transitive dependency
      pattern.
      Co-authored-by: default avatarHongtao Zhang <hongtaozhang@microsoft.com>
      8c7e2be0
  2. 18 Apr, 2026 1 commit
  3. 17 Apr, 2026 1 commit
  4. 28 Jan, 2026 1 commit
  5. 08 Oct, 2025 1 commit
  6. 01 Oct, 2025 1 commit
  7. 12 Aug, 2025 1 commit
  8. 25 Jun, 2025 1 commit
  9. 05 Jun, 2025 1 commit
  10. 30 Apr, 2025 1 commit
  11. 09 Apr, 2025 1 commit
  12. 21 Mar, 2025 1 commit
  13. 12 Mar, 2025 1 commit
    • Hongtao Zhang's avatar
      CI/CD - Update label in the ROCm image build (#693) · 48cd8a3c
      Hongtao Zhang authored
      
      
      Due to the matrix strategy’s default "fail-fast" setting. In GitHub
      Actions, when running a job with a matrix, the individual configurations
      run in parallel. By default, if one matrix job (for example, the one
      labeled "rocm6_2_rocm6_2_x_superbe") fails, the remaining parallel jobs
      are canceled automatically.
      
      In our current build image pipeline, the arm64 build job always are
      canceled by the rocm build job. So, using a non-existent label in the
      job config to prevent rocm build job from scheduling for a temporary
      solution.
      
      ---------
      Co-authored-by: default avatarhongtaozhang <hongtaozhang@microsoft.com>
      48cd8a3c
  14. 07 Mar, 2025 1 commit
  15. 21 Nov, 2024 1 commit
  16. 06 Nov, 2024 1 commit
    • pdr's avatar
      Dockerfile - Add support for arm64 build (#660) · 47949127
      pdr authored
      Add support for arm64 build:
      
      - Updated dockerfile for arm64 build
      - extend cpu stream compilation for neoverse 
      - handle onnxruntime-gpu installation
      - third party builds filtering based on arch
      - disable cuda decode perf build for non x86
      47949127
  17. 02 Nov, 2024 1 commit
    • Yifan Xiong's avatar
      CI/CD - Update Image Build Pipeline (#659) · 61770b89
      Yifan Xiong authored
      **Description**
      
      Update image build.
      
      **Major Revision**
      
      * Remove ROCm 6.0 image due to outdated packages
      * Remove build tag for ROCm
      * Preserve build cache for 30 days
      61770b89
  18. 10 Oct, 2024 1 commit
  19. 28 Jul, 2024 1 commit
  20. 22 Apr, 2024 1 commit
  21. 08 Jan, 2024 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.10.0 (#607) · 2c88db90
      Yifan Xiong authored
      **Description**
      
      Cherry-pick bug fixes from v0.10.0 to main.
      
      **Major Revisions**
      
      * Benchmarks: Microbenchmark - Support different hipblasLt data types in dist_inference #590
      * Benchmarks: Microbenchmark - Support in-place for NCCL/RCCL benchmark #591
      * Bug Fix - Fix NUMA Domains Swap Issue in NDv4 Topology File #592
      * Benchmarks: Microbenchmark - Add data type option for NCCL and RCCL tests #595
      * Benchmarks: Bug Fix - Make metrics of dist-inference-cpp aligned with PyTorch version #596
      * CI/CD - Add ndv5 topo file #597
      * Benchmarks: Microbenchmark - Improve AMD GPU P2P performance with fine-grained GPU memory #593
      * Benchmarks: Build Pipeline - fix nccl and nccl test version to 2.18.3 to resolve hang issue in cuda12.2 docker #599
      * Dockerfile - Bug fix for rocm docker build and deploy #598
      * Benchmarks: Microbenchmark - Adapt to hipblasLt data type changes #603
      * Benchmarks: Micro benchmarks - Update hipblaslt metric unit to tflops #604
      * Monitor - U...
      2c88db90
  22. 09 Dec, 2023 1 commit
  23. 07 Dec, 2023 1 commit
  24. 22 Nov, 2023 1 commit
  25. 22 Aug, 2023 1 commit
  26. 18 Aug, 2023 1 commit
  27. 27 Jul, 2023 1 commit
    • Yuting Jiang's avatar
      Release - SuperBench v0.9.0 (#558) · e1df877b
      Yuting Jiang authored
      **Description**
      Cherry-pick bug fixes from v0.9.0 to main.
      
      **Major Revision**
      - CI/CD: pipeline - clean more disk space to fix rocm building image
      pipeline(#555 )
      - Benchmarks: bug fix - use absolute path for input file in
      DirectXEncodingLatency(#554)
      - CI/CD - add push win docker image on release branch in pipeline (#552)
      - Docs - Upgrade version and release note(#557)
      e1df877b
  28. 05 Jul, 2023 3 commits
  29. 28 Jun, 2023 1 commit
  30. 14 Apr, 2023 1 commit
    • Yifan Xiong's avatar
      Release - SuperBench v0.8.0 (#517) · 51761b3a
      Yifan Xiong authored
      
      
      **Description**
      
      Cherry-pick bug fixes from v0.8.0 to main.
      
      **Major Revisions**
      
      * Monitor - Fix the cgroup version checking logic (#502)
      * Benchmark - Fix matrix size overflow issue in cuBLASLt GEMM (#503)
      * Fix wrong torch usage in communication wrapper for Distributed
      Inference Benchmark (#505)
      * Analyzer: Fix bug in python3.8 due to pandas api change (#504)
      * Bug - Fix bug to get metric from cmd when error happens (#506)
      * Monitor - Collect realtime GPU power when benchmarking (#507)
      * Add num_workers argument in model benchmark (#511)
      * Remove unreachable condition when write host list (#512)
      * Update cuda11.8 image to cuda12.1 based on nvcr23.03 (#513)
      * Doc - Fix wrong unit of cpu-memory-bw-latency in doc (#515)
      * Docs - Upgrade version and release note (#508)
      Co-authored-by: default avatarguoshzhao <guzhao@microsoft.com>
      Co-authored-by: default avatarZiyue Yang <ziyyang@microsoft.com>
      Co-authored-by: default avatarYuting Jiang <yutingjiang@microsoft.com>
      51761b3a
  31. 23 Feb, 2023 1 commit
  32. 29 Dec, 2022 1 commit
  33. 18 Oct, 2022 1 commit
  34. 06 Jul, 2022 1 commit
    • Yifan Xiong's avatar
      Update dependencies and Dockerfile (#371) · 9f03d568
      Yifan Xiong authored
      Update dependencies and Dockerfile:
      * upgrade nccl-tests and rccl-tests to current latest version to match
        NCCL/RCCL versions
      * unify image tag names on DockerHub
      * remove verbose output in Dockerfile and minor fix some flags
      9f03d568
  35. 19 Jun, 2022 1 commit
    • Yifan Xiong's avatar
      Update ROCm Dockerfile (#361) · 483bf782
      Yifan Xiong authored
      **Description**
      
      Update ROCm Dockerfile.
      
      **Major Revisions**
      - Add dockerfile for ROCm 5.1.3
      - Merge 5.1.x and 5.0.x dockerfile
      - Remove 4.2 and 4.0 legacy
      - Update build pipeline accordingly
      483bf782
  36. 25 May, 2022 1 commit
  37. 28 Feb, 2022 1 commit
  38. 25 Feb, 2022 1 commit