Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
tsoc
superbenchmark
Commits
f076f38f
Commit
f076f38f
authored
Apr 18, 2026
by
one
Browse files
Docs: Simplify gpu-hpcg metric list
parent
338eb5a7
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
15 additions
and
39 deletions
+15
-39
docs/user-tutorial/benchmarks/micro-benchmarks.md
docs/user-tutorial/benchmarks/micro-benchmarks.md
+15
-39
No files found.
docs/user-tutorial/benchmarks/micro-benchmarks.md
View file @
f076f38f
...
...
@@ -200,45 +200,21 @@ When collecting multiple MPI scales or problem sizes, use separate benchmark sec
#### Metrics
rocHPCG reports operation-level metrics for
`final`
,
`ddot`
,
`waxpby`
,
`spmv`
,
`mg`
, and
`total`
.
| Name | Unit | Description |
|---------------------------------------|--------------------|------------------------------------------------------|
| gpu-hpcg/final_gflops | FLOPS (GFLOPS) | Final rocHPCG score. |
| gpu-hpcg/final_bandwidth | bandwidth (GB/s) | Final rocHPCG bandwidth. |
| gpu-hpcg/final_gflops_per_process | FLOPS (GFLOPS) | Final rocHPCG score per process. |
| gpu-hpcg/final_bandwidth_per_process | bandwidth (GB/s) | Final rocHPCG bandwidth per process. |
| gpu-hpcg/ddot_gflops | FLOPS (GFLOPS) | DDOT throughput. |
| gpu-hpcg/ddot_bandwidth | bandwidth (GB/s) | DDOT bandwidth. |
| gpu-hpcg/ddot_gflops_per_process | FLOPS (GFLOPS) | DDOT throughput per process. |
| gpu-hpcg/ddot_bandwidth_per_process | bandwidth (GB/s) | DDOT bandwidth per process. |
| gpu-hpcg/waxpby_gflops | FLOPS (GFLOPS) | WAXPBY throughput. |
| gpu-hpcg/waxpby_bandwidth | bandwidth (GB/s) | WAXPBY bandwidth. |
| gpu-hpcg/waxpby_gflops_per_process | FLOPS (GFLOPS) | WAXPBY throughput per process. |
| gpu-hpcg/waxpby_bandwidth_per_process | bandwidth (GB/s) | WAXPBY bandwidth per process. |
| gpu-hpcg/spmv_gflops | FLOPS (GFLOPS) | SpMV throughput. |
| gpu-hpcg/spmv_bandwidth | bandwidth (GB/s) | SpMV bandwidth. |
| gpu-hpcg/spmv_gflops_per_process | FLOPS (GFLOPS) | SpMV throughput per process. |
| gpu-hpcg/spmv_bandwidth_per_process | bandwidth (GB/s) | SpMV bandwidth per process. |
| gpu-hpcg/mg_gflops | FLOPS (GFLOPS) | MG throughput. |
| gpu-hpcg/mg_bandwidth | bandwidth (GB/s) | MG bandwidth. |
| gpu-hpcg/mg_gflops_per_process | FLOPS (GFLOPS) | MG throughput per process. |
| gpu-hpcg/mg_bandwidth_per_process | bandwidth (GB/s) | MG bandwidth per process. |
| gpu-hpcg/total_gflops | FLOPS (GFLOPS) | Aggregate rocHPCG throughput. |
| gpu-hpcg/total_bandwidth | bandwidth (GB/s) | Aggregate rocHPCG bandwidth. |
| gpu-hpcg/total_gflops_per_process | FLOPS (GFLOPS) | Aggregate rocHPCG throughput per process. |
| gpu-hpcg/total_bandwidth_per_process | bandwidth (GB/s) | Aggregate rocHPCG bandwidth per process. |
|-----------------------------------------------------|------------------|---------------------------------------------------------|
| gpu-hpcg/${operation}
\_
gflops | FLOPS (GFLOPS) | Throughput for the specified rocHPCG operation. |
| gpu-hpcg/${operation}
\_
bandwidth | bandwidth (GB/s) | Bandwidth for the specified rocHPCG operation. |
| gpu-hpcg/${operation}
\_
gflops_per_process | FLOPS (GFLOPS) | Per-process throughput for the specified operation. |
| gpu-hpcg/${operation}
\_
bandwidth_per_process | bandwidth (GB/s) | Per-process bandwidth for the specified operation. |
| gpu-hpcg/setup_time | time (s) | Setup phase duration. |
| gpu-hpcg/optimization_time | time (s) | Optimization phase duration. |
| gpu-hpcg/total_time | time (s) | Total runtime. |
| gpu-hpcg/is_valid | | Run validity inferred from rocHPCG invalid markers. |
| gpu-hpcg/local_domain_x | | Local domain size in x dimension. |
| gpu-hpcg/local_domain_y | | Local domain size in y dimension. |
| gpu-hpcg/local_domain_z | | Local domain size in z dimension. |
| gpu-hpcg/global_domain_x | | Global domain size in x dimension. |
| gpu-hpcg/global_domain_y | | Global domain size in y dimension. |
| gpu-hpcg/global_domain_z | | Global domain size in z dimension. |
| gpu-hpcg/process_domain_x | | Process topology in x dimension. |
| gpu-hpcg/process_domain_y | | Process topology in y dimension. |
| gpu-hpcg/process_domain_z | | Process topology in z dimension. |
| gpu-hpcg/local_domain_[x
\|
y
\|
z] | | Local domain size for each dimension. |
| gpu-hpcg/global_domain_[x
\|
y
\|
z] | | Global domain size for each dimension. |
| gpu-hpcg/process_domain_[x
\|
y
\|
z] | | Process topology for each dimension. |
### `cpu-stream`
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment