1. 23 Apr, 2026 1 commit
    • one's avatar
      Benchmarks: Add gpu-hpl and gpu-hpl-mxp micro benchmarks (#15) · 4fa10f4d
      one authored
      Add gpu-hpl and gpu-hpl-mxp micro benchmarks backed by rocHPL and rocHPL-MxP.
      
      Implemented a shared GPU HPL base that:
      - Generates per-workload HPL dat files and parses the corresponding output files.
      - Supports common HPL inputs such as process grid, matrix size, block size, broadcast topology, warmup, iterations, and reduce operator.
      - Adds rocHPL-specific tuning parameters for gpu-hpl.
      - Formats metric keys from input-derived workload attributes.
      - Reports `flops`, `time`, and `tests_pass` metrics with warmup-aware aggregation.
      
      Add benchmark registrations, parser tests, sample output fixtures, documentation, and recommended configurations for gpu-hpl and gpu-hpl-mxp.
      
      Update rocHPL and rocHPL-MxP third-party integration with build patches, install targets, and SuperBench run helper scripts.
      
      Also update gpu-hpcg metric naming to use flops instead of gflops, remove standalone domain/verification-style metrics from the documented metric surface, and refresh Hygon HPCG documentation/config references accordingly.
      4fa10f4d
  2. 20 Apr, 2026 1 commit
  3. 27 Mar, 2026 1 commit
  4. 25 Mar, 2026 1 commit
  5. 19 Mar, 2026 1 commit
    • one's avatar
      Enhance DTK platform support and GPU detection · 1a57f2d6
      one authored
      - Added Platform.DTK in the microbenchmark framework.
      - Introduced new DTK hipblaslt benchmark class and corresponding tests.
      - Updated Dockerfile to include hipblaslt-bench and its permissions.
      - Registered DTK benchmarks in the benchmark registry for various performance tests.
      - Enhanced GPU detection logic to recognize HYGON GPUs.
      
      This update improves the benchmarking capabilities for DTK, ensuring compatibility and performance testing across platforms.
      1a57f2d6
  6. 18 Jun, 2025 1 commit
    • WenqingLan1's avatar
      Benchmarks - Add GPU Stream Micro Benchmark (#697) · 4eddd50a
      WenqingLan1 authored
      Added GPU Stream benchmark - measures the GPU memory bandwidth and
      efficiency for double datatype through various memory operations
      including copy, scale, add, and triad.
      - added documentation for `gpu-stream` detailing its introduction,
      metrics, and descriptions.
      - added unit tests for `gpu-stream`. Example output is in
      `superbenchmark/tests/data/gpu_stream.log`.
      4eddd50a
  7. 22 Nov, 2024 1 commit
  8. 22 Nov, 2023 1 commit
  9. 06 Jul, 2023 1 commit
  10. 05 Jul, 2023 3 commits
  11. 24 Mar, 2023 1 commit
  12. 21 Mar, 2023 1 commit
  13. 13 Feb, 2023 1 commit
  14. 03 Jan, 2023 1 commit
  15. 16 Mar, 2022 1 commit
    • rafsalas19's avatar
      Benchmarks: Add Feature - Add GPU-Burn as microbenchmark (#324) · ff51a3ce
      rafsalas19 authored
      **Description**
      Modifications adding GPU-Burn to SuperBench.
      - added third party submodule
      - modified Makefile to make gpu-burn binary
      - added/modified microbenchmarks to add gpu-burn python scripts
      - modified default and azure_ndv4 configs to add gpu-burn
      ff51a3ce
  16. 13 Dec, 2021 1 commit
  17. 10 Dec, 2021 1 commit
  18. 12 Nov, 2021 1 commit
  19. 09 Nov, 2021 1 commit
  20. 30 Oct, 2021 1 commit
  21. 22 Oct, 2021 1 commit
  22. 12 Oct, 2021 1 commit
  23. 30 Aug, 2021 3 commits
  24. 27 Aug, 2021 1 commit
  25. 25 Aug, 2021 1 commit
  26. 26 Jul, 2021 1 commit
  27. 23 Jul, 2021 2 commits
  28. 13 Jul, 2021 1 commit
  29. 02 Jun, 2021 1 commit
  30. 01 Jun, 2021 1 commit
  31. 31 May, 2021 1 commit
  32. 19 May, 2021 1 commit
  33. 13 May, 2021 1 commit
  34. 14 Apr, 2021 1 commit
  35. 09 Apr, 2021 1 commit