## Simple Benchmark ### Network Benchmark without batchnorm (TF32/F16) in Different GPUs Basic (120k voxels): ```python -m spconv.benchmark bench_basic f16``` and ```python -m spconv.benchmark bench_basic tf32``` | GPUs | F16-Forward | F16-Backward | TF32-Forward | TF32-Backward | | -------------- |:---------------------:|---------------------:|---------------------:| ---------------------:| | T4 | 18.74 | 25.51 | N/A | N/A | | RTX 3080 Laptop (150W) | 8.2 | 11.51 | 15.04 | 26.90 | | A100 | 13.02 | 12.43 | 12.35 | 14.93 | | RTX 3090 | 11.84 | 11.84 | 13.23 | 15.79 | | RTX A6000 | 11.11 | 8.97 | 12.30 | 12.79 | | TESLA V100-32G | 15.55 | 14.90 | N/A | N/A | | TESLA V100-16G | 10.61 | 13.91 | N/A | N/A | Large (900k voxels): ```python -m spconv.benchmark bench_large f16``` and ```python -m spconv.benchmark bench_large tf32``` | GPUs | F16-Forward | F16-Backward | TF32-Forward | TF32-Backward | | -------------- |:---------------------:|---------------------:|---------------------:| ---------------------:| | T4 | 128.7 | 203.3 | N/A | N/A | | RTX 3080 Laptop (150W) | 43.15 | 74.57 | 84.65 | 165.19 | | A100 | 19.85 | 31.24 | 29.58 | 55.63 | | RTX 3090 | 27.83 | 40.45 | 44.51 | 73.17 | | RTX A6000 | 28.62 | 39.86 | 45.43 | 74.11 | | TESLA V100-32G | 50.37 | 72.99 | N/A | N/A | | TESLA V100-16G | 38.65 | 61.47 | N/A | N/A | **NOTE** When you want to benchmark network in your laptop, don't forget to close all apps except terminals! Other apps will consume GPU resource and make kernels run slower.