• WenqingLan1's avatar
    Benchmarks - Add GPU Stream Micro Benchmark (#697) · 4eddd50a
    WenqingLan1 authored
    Added GPU Stream benchmark - measures the GPU memory bandwidth and
    efficiency for double datatype through various memory operations
    including copy, scale, add, and triad.
    - added documentation for `gpu-stream` detailing its introduction,
    metrics, and descriptions.
    - added unit tests for `gpu-stream`. Example output is in
    `superbenchmark/tests/data/gpu_stream.log`.
    4eddd50a
test_gpu_stream.py 5.29 KB