BENCHMARK.md 2.38 KB
Newer Older
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
<!--
 Copyright 2021 Yan Yan
 
 Licensed under the Apache License, Version 2.0 (the "License");
 you may not use this file except in compliance with the License.
 You may obtain a copy of the License at
 
     http://www.apache.org/licenses/LICENSE-2.0
 
 Unless required by applicable law or agreed to in writing, software
 distributed under the License is distributed on an "AS IS" BASIS,
 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 See the License for the specific language governing permissions and
 limitations under the License.
-->

## Simple Benchmark

yan.yan's avatar
yan.yan committed
19
### Network Benchmark without batchnorm (TF32/F16) in Different GPUs
20

yan.yan's avatar
yan.yan committed
21
Basic (120k voxels): ```python -m spconv.benchmark bench_basic f16``` and ```python -m spconv.benchmark bench_basic tf32```
22

yan.yan's avatar
yan.yan committed
23
| GPUs | F16-Forward | F16-Backward | TF32-Forward  | TF32-Backward |
24
| -------------- |:---------------------:|---------------------:|---------------------:| ---------------------:|
yan.yan's avatar
yan.yan committed
25
26
27
| T4 | 18.74     | 25.51    | N/A      | N/A      |
| RTX 3080 Laptop (150W) | 8.2    | 11.51    | 15.04      | 26.90      |
| A100 | 13.02    | 12.43    | 12.35      | 14.93      |
yan.yan's avatar
yan.yan committed
28
| RTX 3090 | 11.84    | 11.84    | 13.23      | 15.79      |
yan.yan's avatar
yan.yan committed
29
| RTX A6000 | 11.11    | 8.97    | 12.30      | 12.79      |
yan.yan's avatar
yan.yan committed
30
31
| TESLA V100-32G | 15.55    | 14.90    | N/A       | N/A      |
| TESLA V100-16G | 10.61    | 13.91    | N/A       | N/A      |
yan.yan's avatar
yan.yan committed
32
| RTX 4090 | 7.37    | 6.87    | 8.76      | 9.16      |
33

yan.yan's avatar
yan.yan committed
34
Large (900k voxels): ```python -m spconv.benchmark bench_large f16``` and ```python -m spconv.benchmark bench_large tf32```
35

yan.yan's avatar
yan.yan committed
36
37
38
39
40
| GPUs | F16-Forward | F16-Backward | TF32-Forward  | TF32-Backward |
| -------------- |:---------------------:|---------------------:|---------------------:| ---------------------:|
| T4 | 128.7     | 203.3    | N/A      | N/A      |
| RTX 3080 Laptop (150W) | 43.15    | 74.57    | 84.65      | 165.19      |
| A100 | 19.85    | 31.24    | 29.58      | 55.63      |
yan.yan's avatar
yan.yan committed
41
| RTX 3090 | 27.83    | 40.45    | 44.51      | 73.17      |
yan.yan's avatar
yan.yan committed
42
| RTX A6000 | 28.62    | 39.86    | 45.43      | 74.11      |
yan.yan's avatar
yan.yan committed
43
44
| TESLA V100-32G | 50.37    | 72.99   | N/A      | N/A      |
| TESLA V100-16G | 38.65    | 61.47    | N/A       | N/A      |
yan.yan's avatar
yan.yan committed
45
| RTX 4090 | 13.93  | 22.92    | 22.58      | 41.49      |
46
47
48
49
50
51


**NOTE** 
When you want to benchmark network in your laptop, don't forget to close all apps except terminals! Other apps will consume GPU resource and make kernels run slower.