README.md 9.51 KB
Newer Older
yan.yan's avatar
yan.yan committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
<!--
 Copyright 2021 Yan Yan
 
 Licensed under the Apache License, Version 2.0 (the "License");
 you may not use this file except in compliance with the License.
 You may obtain a copy of the License at
 
     http://www.apache.org/licenses/LICENSE-2.0
 
 Unless required by applicable law or agreed to in writing, software
 distributed under the License is distributed on an "AS IS" BASIS,
 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 See the License for the specific language governing permissions and
 limitations under the License.
-->
16
17
18
19
[pypi-ver-cpu]: https://img.shields.io/pypi/v/spconv
[pypi-ver-114]: https://img.shields.io/pypi/v/spconv-cu114
[pypi-ver-111]: https://img.shields.io/pypi/v/spconv-cu111
[pypi-ver-113]: https://img.shields.io/pypi/v/spconv-cu113
yan.yan's avatar
yan.yan committed
20
[pypi-ver-120]: https://img.shields.io/pypi/v/spconv-cu120
21
22
[pypi-ver-102]: https://img.shields.io/pypi/v/spconv-cu102

yan.yan's avatar
yan.yan committed
23
24
[pypi-url-102]: https://pypi.org/project/spconv-cu102/
[pypi-download-102]: https://img.shields.io/pypi/dm/spconv-cu102
25
26
27
28
29
30
[pypi-url-111]: https://pypi.org/project/spconv-cu111/
[pypi-download-111]: https://img.shields.io/pypi/dm/spconv-cu111
[pypi-url-113]: https://pypi.org/project/spconv-cu113/
[pypi-download-113]: https://img.shields.io/pypi/dm/spconv-cu113
[pypi-url-114]: https://pypi.org/project/spconv-cu114/
[pypi-download-114]: https://img.shields.io/pypi/dm/spconv-cu114
yan.yan's avatar
yan.yan committed
31
32
[pypi-url-120]: https://pypi.org/project/spconv-cu120/
[pypi-download-120]: https://img.shields.io/pypi/dm/spconv-cu120
33
34
[pypi-url-cpu]: https://pypi.org/project/spconv/
[pypi-download-cpu]: https://img.shields.io/pypi/dm/spconv
yan.yan's avatar
yan.yan committed
35

yan.yan's avatar
v2.1  
yan.yan committed
36
# SpConv: Spatially Sparse Convolution Library
37
38
[![Build Status](https://github.com/traveller59/spconv/workflows/build/badge.svg)](https://github.com/traveller59/spconv/actions?query=workflow%3Abuild) 

traveller59's avatar
traveller59 committed
39

40
|                | PyPI   | Install  |Downloads  |
yan.yan's avatar
yan.yan committed
41
42
| -------------- |:---------------------:| ---------------------:| ---------------------:| 
| CPU (Linux Only) | [![PyPI Version][pypi-ver-cpu]][pypi-url-cpu] | ```pip install spconv``` | [![pypi monthly download][pypi-download-cpu]][pypi-url-cpu] | 
yan.yan's avatar
yan.yan committed
43
| CUDA 10.2 | [![PyPI Version][pypi-ver-102]][pypi-url-102] | ```pip install spconv-cu102```| [![pypi monthly download][pypi-download-102]][pypi-url-102]| 
yan.yan's avatar
yan.yan committed
44
45
46
| CUDA 11.1 | [![PyPI Version][pypi-ver-111]][pypi-url-111] | ```pip install spconv-cu111```| [![pypi monthly download][pypi-download-111]][pypi-url-111]| 
| CUDA 11.3 (Linux Only) | [![PyPI Version][pypi-ver-113]][pypi-url-113] | ```pip install spconv-cu113```| [![pypi monthly download][pypi-download-113]][pypi-url-113]| 
| CUDA 11.4 | [![PyPI Version][pypi-ver-114]][pypi-url-114] | ```pip install spconv-cu114```| [![pypi monthly download][pypi-download-114]][pypi-url-114]|
yan.yan's avatar
yan.yan committed
47
<!-- | CUDA 12.0 | [![PyPI Version][pypi-ver-120]][pypi-url-120] | ```pip install spconv-cu120```| [![pypi monthly download][pypi-download-120]][pypi-url-120]| -->
tusimple's avatar
tusimple committed
48

49
```spconv``` is a project that provide heavily-optimized sparse convolution implementation with tensor core support. check [benchmark](docs/BENCHMARK.md) to see how fast spconv 2.x runs.
50

yan.yan's avatar
v2.1  
yan.yan committed
51
[Spconv 1.x code](https://github.com/traveller59/spconv/tree/v1.2.1). We won't provide any support for spconv 1.x since it's deprecated. use spconv 2.x if possible. <!--remove this message in spconv 2.2-->
FindDefinition's avatar
FindDefinition committed
52

yan.yan's avatar
yan.yan committed
53
54
Check [spconv 2.x algorithm introduction](docs/spconv2_algo.pdf) to understand sparse convolution algorithm in spconv 2.x!

yan.yan's avatar
yan.yan committed
55
## NEWS
FindDefinition's avatar
FindDefinition committed
56

yan.yan's avatar
yan.yan committed
57
* spconv 2.2: ampere feature support (by [EvernightAurora](https://github.com/EvernightAurora)), pure c++ code generation, nvrtc, drop python 3.6
FindDefinition's avatar
FindDefinition committed
58

yan.yan's avatar
v2.1  
yan.yan committed
59
## Spconv 2.1 vs Spconv 1.x
traveller59's avatar
traveller59 committed
60

yan.yan's avatar
yan.yan committed
61
* spconv now can be installed by **pip**. see install section in readme for more details. Users don't need to build manually anymore!
yan.yan's avatar
v2.1  
yan.yan committed
62
63
64
65
* Microsoft Windows support (only windows 10 has been tested).
* fp32 (not tf32) training/inference speed is increased (+50~80%)
* fp16 training/inference speed is greatly increased when your layer support tensor core (channel size must be multiple of 8).
* int8 op is ready, but we still need some time to figure out how to run int8 in pytorch.
66
* [doesn't depend on pytorch binary](docs/FAQ.md#What-does-no-dependency-on-pytorch-mean), but you may need at least pytorch >= 1.5.0 to run spconv 2.x.
yan.yan's avatar
yan.yan committed
67
* since spconv 2.x doesn't depend on pytorch binary (never in future), it's impossible to support torch.jit/libtorch inference.
traveller59's avatar
traveller59 committed
68

yan.yan's avatar
yan.yan committed
69
## Spconv 2.2 vs Spconv 2.1
70

yan.yan's avatar
yan.yan committed
71
* faster fp16 kernels (~5-30%) in ampere GPUs (tested in RTX 3090)
yan.yan's avatar
yan.yan committed
72
73
* greatly faster int8 kernels (~1.2x~2.7x) in ampere GPUs (tested in RTX 3090)
* no python 3.6 support
yan.yan's avatar
yan.yan committed
74
* nvrtc support: kernel in old GPUs will be compiled in runtime.
yan.yan's avatar
yan.yan committed
75
* [libspconv](docs/PURE_CPP_BUILD.md): pure c++ build of all spconv ops. see [example](example/libspconv/run_build.sh)
76

yan.yan's avatar
yan.yan committed
77
78
79
80
## Usage

Firstly you need to use ```import spconv.pytorch as spconv``` in spconv 2.x.

yan.yan's avatar
v2.1  
yan.yan committed
81
Then see [this](docs/USAGE.md).
yan.yan's avatar
yan.yan committed
82

yan.yan's avatar
v2.1  
yan.yan committed
83
Don't forget to check [performance guide](docs/PERFORMANCE_GUIDE.md).
traveller59's avatar
traveller59 committed
84

yan.yan's avatar
yan.yan committed
85
## Install
traveller59's avatar
traveller59 committed
86

yan.yan's avatar
yan.yan committed
87
You need to install python >= 3.7 first to use spconv 2.x.
traveller59's avatar
traveller59 committed
88

yan.yan's avatar
yan.yan committed
89
You need to install CUDA toolkit first before using prebuilt binaries or build from source.
traveller59's avatar
traveller59 committed
90

yan.yan's avatar
yan.yan committed
91
You need at least CUDA 11.0 to build and run spconv 2.x. We won't offer any support for CUDA < 11.0.
traveller59's avatar
traveller59 committed
92

yan.yan's avatar
yan.yan committed
93
### Prebuilt
traveller59's avatar
traveller59 committed
94

yan.yan's avatar
yan.yan committed
95
We offer python 3.7-3.11 and cuda 10.2/11.1/11.3/11.4/12.0 prebuilt binaries for linux (manylinux).
yan.yan's avatar
yan.yan committed
96

yan.yan's avatar
yan.yan committed
97
We offer python 3.7-3.11 and cuda 10.2/11.1/11.4/12.0 prebuilt binaries for windows 10/11.
traveller59's avatar
traveller59 committed
98

yan.yan's avatar
yan.yan committed
99
For Linux users, you need to install pip >= 20.3 first to install prebuilt.
traveller59's avatar
traveller59 committed
100

yan.yan's avatar
v2.1  
yan.yan committed
101
102
103
104
```pip install spconv``` for CPU only (**Linux Only**). you should only use this for debug usage, the performance isn't optimized due to manylinux limit (no omp support).

```pip install spconv-cu102``` for CUDA 10.2

yan.yan's avatar
yan.yan committed
105
```pip install spconv-cu111``` for CUDA 11.1
traveller59's avatar
traveller59 committed
106

yan.yan's avatar
yan.yan committed
107
```pip install spconv-cu113``` for CUDA 11.3 (**Linux Only**)
yan.yan's avatar
v2.1  
yan.yan committed
108

yan.yan's avatar
yan.yan committed
109
```pip install spconv-cu114``` for CUDA 11.4
traveller59's avatar
traveller59 committed
110

yan.yan's avatar
yan.yan committed
111
```pip install spconv-cu120``` for CUDA 12.0
112

yan.yan's avatar
yan.yan committed
113
**NOTE** It's safe to have different **minor** cuda version between system and conda (pytorch) in **CUDA >= 11.0** because of [CUDA Minor Version Compatibility](https://docs.nvidia.com/deploy/cuda-compatibility/#minor-version-compatibility). For example, you can use spconv-cu114 with anaconda version of pytorch cuda 11.1 in a OS with CUDA 11.2 installed.
yan.yan's avatar
v2.1  
yan.yan committed
114

yan.yan's avatar
yan.yan committed
115
**NOTE** In Linux, you can install spconv-cuxxx without install CUDA to system! only suitable NVIDIA driver is required. for CUDA 11, we need driver >= 450.82.
yan.yan's avatar
v2.1  
yan.yan committed
116

117
118
119
120
#### Prebuilt GPU Support Matrix

See [this page](https://arnon.dk/matching-sm-architectures-arch-and-gencode-for-various-nvidia-cards/) to check supported GPU names by arch.

yan.yan's avatar
yan.yan committed
121
122
If you use a GPU architecture that isn't compiled in prebuilt, spconv will use NVRTC to compile a slightly slower kernel.

123
124
125
| CUDA version | GPU Arch List  |
| -------------- |:---------------------:|
| 11.x       | 52,60,61,70,75,80,86     | 
yan.yan's avatar
yan.yan committed
126
| 12.x       | 70,75,80,86,90     | 
127

yan.yan's avatar
v2.1  
yan.yan committed
128
129
130
131
### Build from source for development (JIT, recommend)

The c++ code will be built automatically when you change c++ code in project.

132
For NVIDIA Embedded Platforms, you need to specify cuda arch before build: ```export CUMM_CUDA_ARCH_LIST="7.2"``` for xavier, ```export CUMM_CUDA_ARCH_LIST="6.2"``` for TX2, ```export CUMM_CUDA_ARCH_LIST="8.7"``` for orin.
yan.yan's avatar
v2.1  
yan.yan committed
133

134
135
You need to remove ```cumm``` in ```requires``` section in pyproject.toml after install editable ```cumm``` and before install spconv due to pyproject limit (can't find editable installed ```cumm```).

136
137
You need to ensure ```pip list | grep spconv``` and ```pip list | grep cumm``` show nothing before install editable spconv/cumm.

yan.yan's avatar
v2.1  
yan.yan committed
138
#### Linux
139

yan.yan's avatar
v2.1  
yan.yan committed
140
141
142
143
144
145
146
147
148
149
150
151
152
153
0. uninstall spconv and cumm installed by pip
1. install build-essential, install CUDA
2. ```git clone https://github.com/FindDefinition/cumm```, ```cd ./cumm```, ```pip install -e .```
3. ```git clone https://github.com/traveller59/spconv```, ```cd ./spconv```, ```pip install -e .```
4. in python, ```import spconv``` and wait for build finish.

#### Windows
0. uninstall spconv and cumm installed by pip
1. install visual studio 2019 or newer. make sure C++ development component is installed. install CUDA
2. set [powershell script execution policy](https://docs.microsoft.com/en-us/powershell/module/microsoft.powershell.core/about/about_execution_policies?view=powershell-7.1)
3. start a new powershell, run ```tools/msvc_setup.ps1```
4. ```git clone https://github.com/FindDefinition/cumm```, ```cd ./cumm```, ```pip install -e .```
5. ```git clone https://github.com/traveller59/spconv```, ```cd ./spconv```, ```pip install -e .```
6. in python, ```import spconv``` and wait for build finish.
yan.yan's avatar
yan.yan committed
154

yan.yan's avatar
v2.1  
yan.yan committed
155
### Build wheel from source (not recommend, this is done in CI.)
traveller59's avatar
traveller59 committed
156

yan.yan's avatar
yan.yan committed
157
You need to rebuild ```cumm``` first if you are build along a CUDA version that not provided in prebuilts.
traveller59's avatar
traveller59 committed
158

yan.yan's avatar
yan.yan committed
159
#### Linux
traveller59's avatar
traveller59 committed
160

yan.yan's avatar
yan.yan committed
161
162
1. install build-essential, install CUDA
2. run ```export SPCONV_DISABLE_JIT="1"```
yan.yan's avatar
v2.1  
yan.yan committed
163
164
3. run ```pip install pccm cumm wheel```
4. run ```python setup.py bdist_wheel```+```pip install dists/xxx.whl```
traveller59's avatar
traveller59 committed
165

yan.yan's avatar
v2.1  
yan.yan committed
166
#### Windows
traveller59's avatar
traveller59 committed
167

yan.yan's avatar
v2.1  
yan.yan committed
168
1. install visual studio 2019 or newer. make sure C++ development component is installed. install CUDA
yan.yan's avatar
yan.yan committed
169
170
171
2. set [powershell script execution policy](https://docs.microsoft.com/en-us/powershell/module/microsoft.powershell.core/about/about_execution_policies?view=powershell-7.1)
3. start a new powershell, run ```tools/msvc_setup.ps1```
4. run ```$Env:SPCONV_DISABLE_JIT = "1"```
yan.yan's avatar
v2.1  
yan.yan committed
172
173
5. run ```pip install pccm cumm wheel```
6. run ```python setup.py bdist_wheel```+```pip install dists/xxx.whl```
traveller59's avatar
traveller59 committed
174

yan.yan's avatar
yan.yan committed
175
## Contributers
176

yan.yan's avatar
yan.yan committed
177
* [EvernightAurora](https://github.com/EvernightAurora): add ampere feature.
yan.yan's avatar
yan.yan committed
178

yan.yan's avatar
yan.yan committed
179
## Note
traveller59's avatar
traveller59 committed
180

yan.yan's avatar
yan.yan committed
181
The work is done when the author is an employee at [Tusimple](https://www.tusimple.com/).
traveller59's avatar
traveller59 committed
182

yan.yan's avatar
yan.yan committed
183
## LICENSE
traveller59's avatar
traveller59 committed
184

FindDefinition's avatar
FindDefinition committed
185
Apache 2.0