Add kernel launch overhead benchmark and associated build scripts
- Introduce kernel_launch_overhead.cu to measure kernel launch latency, system throughput, CPU dispatch overhead, and GPU dispatch time. - Create Makefile for building the benchmark with support for nvcc and hipcc. - Add run-all.sh script to execute the benchmark with specified device settings.
Showing
Please register or sign in to comment