README.md 8.81 KB
Newer Older
yan.yan's avatar
yan.yan committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
<!--
 Copyright 2021 Yan Yan
 
 Licensed under the Apache License, Version 2.0 (the "License");
 you may not use this file except in compliance with the License.
 You may obtain a copy of the License at
 
     http://www.apache.org/licenses/LICENSE-2.0
 
 Unless required by applicable law or agreed to in writing, software
 distributed under the License is distributed on an "AS IS" BASIS,
 WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 See the License for the specific language governing permissions and
 limitations under the License.
-->
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
[pypi-ver-cpu]: https://img.shields.io/pypi/v/spconv
[pypi-ver-114]: https://img.shields.io/pypi/v/spconv-cu114
[pypi-ver-111]: https://img.shields.io/pypi/v/spconv-cu111
[pypi-ver-113]: https://img.shields.io/pypi/v/spconv-cu113
[pypi-ver-102]: https://img.shields.io/pypi/v/spconv-cu102

[pypi-url-111]: https://pypi.org/project/spconv-cu111/
[pypi-download-111]: https://img.shields.io/pypi/dm/spconv-cu111
[pypi-url-113]: https://pypi.org/project/spconv-cu113/
[pypi-download-113]: https://img.shields.io/pypi/dm/spconv-cu113
[pypi-url-102]: https://pypi.org/project/spconv-cu102/
[pypi-download-102]: https://img.shields.io/pypi/dm/spconv-cu102
[pypi-url-114]: https://pypi.org/project/spconv-cu114/
[pypi-download-114]: https://img.shields.io/pypi/dm/spconv-cu114
[pypi-url-cpu]: https://pypi.org/project/spconv/
[pypi-download-cpu]: https://img.shields.io/pypi/dm/spconv
yan.yan's avatar
yan.yan committed
32

yan.yan's avatar
v2.1  
yan.yan committed
33
# SpConv: Spatially Sparse Convolution Library
34
35
36
37
38
39
40
41
42
[![Build Status](https://github.com/traveller59/spconv/workflows/build/badge.svg)](https://github.com/traveller59/spconv/actions?query=workflow%3Abuild) 

|                | PyPi Version  | Downloads  |
| -------------- |:---------------------:| ---------------------:| 
| CPU (Linux Only) | [![PyPI Version][pypi-ver-cpu]][pypi-url-cpu] | [![pypi monthly download][pypi-download-cpu]][pypi-url-cpu] | 
| CUDA 10.2 | [![PyPI Version][pypi-ver-102]][pypi-url-102] | [![pypi monthly download][pypi-download-102]][pypi-url-102] | 
| CUDA 11.1 | [![PyPI Version][pypi-ver-111]][pypi-url-111] | [![pypi monthly download][pypi-download-111]][pypi-url-111]| 
| CUDA 11.3 (Linux Only) | [![PyPI Version][pypi-ver-113]][pypi-url-113] |[![pypi monthly download][pypi-download-113]][pypi-url-113]| 
| CUDA 11.4 | [![PyPI Version][pypi-ver-114]][pypi-url-114] | [![pypi monthly download][pypi-download-114]][pypi-url-114]| 
traveller59's avatar
traveller59 committed
43

tusimple's avatar
tusimple committed
44

45
```spconv``` is a project that provide heavily-optimized sparse convolution implementation with tensor core support. check [benchmark](docs/BENCHMARK.md) to see how fast spconv 2.x runs.
46

yan.yan's avatar
v2.1  
yan.yan committed
47
[Spconv 1.x code](https://github.com/traveller59/spconv/tree/v1.2.1). We won't provide any support for spconv 1.x since it's deprecated. use spconv 2.x if possible. <!--remove this message in spconv 2.2-->
FindDefinition's avatar
FindDefinition committed
48

49
50
**WARNING** spconv < 2.1.4 users need to upgrade your version to 2.1.4, it fix a serious bug in SparseInverseConvXd.

yan.yan's avatar
v2.1  
yan.yan committed
51
## Breaking changes in Spconv 2.x
FindDefinition's avatar
FindDefinition committed
52

yan.yan's avatar
v2.1  
yan.yan committed
53
Spconv 1.x users **NEED READ [THIS](docs/SPCONV_2_BREAKING_CHANGEs.md)** before using spconv 2.x.
FindDefinition's avatar
FindDefinition committed
54

yan.yan's avatar
v2.1  
yan.yan committed
55
## Spconv 2.1 vs Spconv 1.x
traveller59's avatar
traveller59 committed
56

yan.yan's avatar
yan.yan committed
57
* spconv now can be installed by **pip**. see install section in readme for more details. Users don't need to build manually anymore!
yan.yan's avatar
v2.1  
yan.yan committed
58
59
60
61
* Microsoft Windows support (only windows 10 has been tested).
* fp32 (not tf32) training/inference speed is increased (+50~80%)
* fp16 training/inference speed is greatly increased when your layer support tensor core (channel size must be multiple of 8).
* int8 op is ready, but we still need some time to figure out how to run int8 in pytorch.
yan.yan's avatar
yan.yan committed
62
63
* doesn't depend on pytorch binary. 
* since spconv 2.x doesn't depend on pytorch binary (never in future), it's impossible to support torch.jit/libtorch inference.
traveller59's avatar
traveller59 committed
64

yan.yan's avatar
v2.1  
yan.yan committed
65
Spconv 2.1 vs 1.x speed:
66
67
68
69
70
71

|                | 1080Ti Spconv 1.x F32 | 1080Ti Spconv 2.0 F32 | 3080M* Spconv 2.1 F16  |
| -------------- |:---------------------:| ---------------------:| ----------:|
| 27x128x128 Fwd | 11ms                  | 5.4ms                 | 1.4ms      |

\* 3080M (Laptop) ~= 3070 Desktop
yan.yan's avatar
yan.yan committed
72

yan.yan's avatar
v2.1  
yan.yan committed
73
74
75
76
77

<!--
TODO Spconv vs [MinkowskiEngine](https://github.com/NVIDIA/MinkowskiEngine) vs [torchsparse](https://github.com/mit-han-lab/torchsparse)
-->

yan.yan's avatar
yan.yan committed
78
79
80
81
## Usage

Firstly you need to use ```import spconv.pytorch as spconv``` in spconv 2.x.

yan.yan's avatar
v2.1  
yan.yan committed
82
Then see [this](docs/USAGE.md).
yan.yan's avatar
yan.yan committed
83

yan.yan's avatar
v2.1  
yan.yan committed
84
Don't forget to check [performance guide](docs/PERFORMANCE_GUIDE.md).
traveller59's avatar
traveller59 committed
85

yan.yan's avatar
yan.yan committed
86
## Install
traveller59's avatar
traveller59 committed
87

yan.yan's avatar
yan.yan committed
88
You need to install python >= 3.7 first to use spconv 2.x.
traveller59's avatar
traveller59 committed
89

yan.yan's avatar
yan.yan committed
90
You need to install CUDA toolkit first before using prebuilt binaries or build from source.
traveller59's avatar
traveller59 committed
91

yan.yan's avatar
yan.yan committed
92
You need at least CUDA 10.2 to build and run spconv 2.x. We won't offer any support for CUDA < 10.2.
traveller59's avatar
traveller59 committed
93

yan.yan's avatar
yan.yan committed
94
### Prebuilt
traveller59's avatar
traveller59 committed
95

yan.yan's avatar
v2.1  
yan.yan committed
96
We offer python 3.7-3.10 and cuda 10.2/11.1/11.3/11.4 prebuilt binaries for linux (manylinux) and windows 10/11.
traveller59's avatar
traveller59 committed
97

yan.yan's avatar
v2.1  
yan.yan committed
98
We will provide prebuilts for CUDA versions supported by latest pytorch release. For example, pytorch 1.10 provide cuda 10.2 and 11.3 prebuilts, so we provide them too.
traveller59's avatar
traveller59 committed
99

yan.yan's avatar
yan.yan committed
100
For Linux users, you need to install pip >= 20.3 first to install prebuilt.
traveller59's avatar
traveller59 committed
101

yan.yan's avatar
v2.1  
yan.yan committed
102
103
104
105
106
107
CUDA 11.1 will be removed in spconv 2.2 because pytorch 1.10 don't provide prebuilts for it.

```pip install spconv``` for CPU only (**Linux Only**). you should only use this for debug usage, the performance isn't optimized due to manylinux limit (no omp support).

```pip install spconv-cu102``` for CUDA 10.2

yan.yan's avatar
yan.yan committed
108
```pip install spconv-cu111``` for CUDA 11.1
traveller59's avatar
traveller59 committed
109

yan.yan's avatar
yan.yan committed
110
```pip install spconv-cu113``` for CUDA 11.3 (**Linux Only**)
yan.yan's avatar
v2.1  
yan.yan committed
111

yan.yan's avatar
yan.yan committed
112
```pip install spconv-cu114``` for CUDA 11.4
traveller59's avatar
traveller59 committed
113

yan.yan's avatar
v2.1  
yan.yan committed
114
115
**NOTE** It's safe to have different **minor** cuda version between system and conda (pytorch) **in Linux**. for example, you can use spconv-cu114 with anaconda version of pytorch cuda 11.1 in a OS with CUDA 11.2 installed.

yan.yan's avatar
yan.yan committed
116
**NOTE** In Linux, you can install spconv-cuxxx without install CUDA to system! only suitable NVIDIA driver is required. for CUDA 11, we need driver >= 450.82.
yan.yan's avatar
v2.1  
yan.yan committed
117
118
119
120
121
122
123

### Build from source for development (JIT, recommend)

The c++ code will be built automatically when you change c++ code in project.

For NVIDIA Embedded Platforms, you need to specify cuda arch before build: ```export CUMM_CUDA_ARCH_LIST="7.2"``` for xavier.

124
125
You need to remove ```cumm``` in ```requires``` section in pyproject.toml after install editable ```cumm``` and before install spconv due to pyproject limit (can't find editable installed ```cumm```).

yan.yan's avatar
v2.1  
yan.yan committed
126
#### Linux
127

yan.yan's avatar
v2.1  
yan.yan committed
128
129
130
131
132
133
134
135
136
137
138
139
140
141
0. uninstall spconv and cumm installed by pip
1. install build-essential, install CUDA
2. ```git clone https://github.com/FindDefinition/cumm```, ```cd ./cumm```, ```pip install -e .```
3. ```git clone https://github.com/traveller59/spconv```, ```cd ./spconv```, ```pip install -e .```
4. in python, ```import spconv``` and wait for build finish.

#### Windows
0. uninstall spconv and cumm installed by pip
1. install visual studio 2019 or newer. make sure C++ development component is installed. install CUDA
2. set [powershell script execution policy](https://docs.microsoft.com/en-us/powershell/module/microsoft.powershell.core/about/about_execution_policies?view=powershell-7.1)
3. start a new powershell, run ```tools/msvc_setup.ps1```
4. ```git clone https://github.com/FindDefinition/cumm```, ```cd ./cumm```, ```pip install -e .```
5. ```git clone https://github.com/traveller59/spconv```, ```cd ./spconv```, ```pip install -e .```
6. in python, ```import spconv``` and wait for build finish.
yan.yan's avatar
yan.yan committed
142

yan.yan's avatar
v2.1  
yan.yan committed
143
### Build wheel from source (not recommend, this is done in CI.)
traveller59's avatar
traveller59 committed
144

yan.yan's avatar
yan.yan committed
145
You need to rebuild ```cumm``` first if you are build along a CUDA version that not provided in prebuilts.
traveller59's avatar
traveller59 committed
146

yan.yan's avatar
yan.yan committed
147
#### Linux
traveller59's avatar
traveller59 committed
148

yan.yan's avatar
yan.yan committed
149
150
1. install build-essential, install CUDA
2. run ```export SPCONV_DISABLE_JIT="1"```
yan.yan's avatar
v2.1  
yan.yan committed
151
152
3. run ```pip install pccm cumm wheel```
4. run ```python setup.py bdist_wheel```+```pip install dists/xxx.whl```
traveller59's avatar
traveller59 committed
153

yan.yan's avatar
v2.1  
yan.yan committed
154
#### Windows
traveller59's avatar
traveller59 committed
155

yan.yan's avatar
v2.1  
yan.yan committed
156
1. install visual studio 2019 or newer. make sure C++ development component is installed. install CUDA
yan.yan's avatar
yan.yan committed
157
158
159
2. set [powershell script execution policy](https://docs.microsoft.com/en-us/powershell/module/microsoft.powershell.core/about/about_execution_policies?view=powershell-7.1)
3. start a new powershell, run ```tools/msvc_setup.ps1```
4. run ```$Env:SPCONV_DISABLE_JIT = "1"```
yan.yan's avatar
v2.1  
yan.yan committed
160
161
5. run ```pip install pccm cumm wheel```
6. run ```python setup.py bdist_wheel```+```pip install dists/xxx.whl```
traveller59's avatar
traveller59 committed
162

yan.yan's avatar
yan.yan committed
163
164
165
166
167
168
169
## Roadmap for Spconv 2.2-2.3: 
* TensorFormat32 support for faster fp32 training when you use NVIDIA Geforce RTX 30x0/Tesla A100/Quadro RTX Ax000 (2.2)
* change implicit gemm weight layout from KRSC to RSKC to make sure we can use native algorithm with implicit gemm weight. (2.2)
* documents (2.2)
* Ampere feature support (2.3)
* pytorch int8 inference, and QAT support (2.3)

yan.yan's avatar
yan.yan committed
170
171
172
173
174
175
176
## TODO in Spconv 2.x
- [ ] Ampere (A100 / RTX 3000 series) feature support (work in progress)
- [ ] torch QAT support (work in progress)
- [ ] TensorRT (torch.fx based)
- [ ] Build C++ only package
- [ ] JIT compilation for CUDA kernels
- [ ] Document (low priority)
yan.yan's avatar
yan.yan committed
177

yan.yan's avatar
yan.yan committed
178
## Note
traveller59's avatar
traveller59 committed
179

yan.yan's avatar
yan.yan committed
180
The work is done when the author is an employee at [Tusimple](https://www.tusimple.com/).
traveller59's avatar
traveller59 committed
181

yan.yan's avatar
yan.yan committed
182
## LICENSE
traveller59's avatar
traveller59 committed
183

FindDefinition's avatar
FindDefinition committed
184
Apache 2.0