CHANGELOG.md 2.33 KB
Newer Older
yanyan's avatar
yanyan committed
1
2
# Changelog

3
4
5
6
7
8
## [2.1.11] - 2021-11-22
### Fixed
- Fixed a bug Volta kernels (TITAN V, Tesla V100), backward weight kernels use f16 as accumulator. we should use f32.
- Fixed a corner case when user use kernel size = 1x1 but stride != 1.
- Fixed a corner case when input feature is non-contiguous when maxpool.

yan.yan's avatar
yan.yan committed
9
10
11
12
13
14
15
16
## [2.1.10] - 2021-11-19
### Fixed
- Fixed a bug in utils.PointToVoxel, shouldn't get cuda stream in cpu code

## [2.1.9] - 2021-11-18
### Removed
- Remove a wrong assert

yan.yan's avatar
yan.yan committed
17
18
19
20
## [2.1.8] - 2021-11-15
### Added
- Add support for pytorch 1.5

21
22
23
24
## [2.1.7] - 2021-11-11
### Fixed
- Fix a bug when net have inverse and run inference in eval mode.

25
26
27
28
29
30
## [2.1.6] - 2021-11-10
### Fixed
- Fix missing -fopenmp in linker for CPU only
### Removed
- remove stale comment sending in CI

31
32
33
34
35
36
37
38
39
## [2.1.5] - 2021-11-10
### Added
- Add cuda profile tool
- Add python 36 support
### Changed
- Format all code
### Removed
- remove a unnecessary device sync and slightly improve performance.

40
41
42
43
44
45
46
47
48
49
50
51
## [2.1.4] - 2021-11-10
### Fixed
- Fix a bug of SparseInverseConv3d

## [2.1.3] - 2021-11-08
### Fixed
- Fix a bug of CPU only package

## [2.1.2] - 2021-11-06
### Fixed
- Fix a bug of python 3.7

yan.yan's avatar
v2.1  
yan.yan committed
52
## [2.1.0] - 2021-10-31
53
54
55
56
### Added
- add implicit gemm algorithm for all kind of convolution with kernel volume <= 32. this algorithm is very fast with float16.
- add pytorch wrapper for voxel generator
- add CPU support and CPU-only build.
yan.yan's avatar
v2.1  
yan.yan committed
57
58
59
60
61
62

## [2.0.2] - 2021-10-26
### Fixed
- Fix a serious bug that do nothing with non-spconv layers in SparseSequential
- Fix a bug of ProxyableClassMeta

yan.yan's avatar
yan.yan committed
63
64
65
66
67
68
## [2.0.0] - 2021-10-16
### Changed
- Change build system from cmake to pccm.
- Change pytorch python code to spconv.pytorch
- Rewrite All c++ code.

69
70
71
72
73
## [1.2.1] - 2020-06-04
### Changed
- The subm indice pair generation speed is greatly increased by two tricks: 1. most subm conv use only kernelsize=3, so we can unroll loops to get 100% performance increase. 2. subm indice pairs have a property: indicePairs[0, i] = indicePairs[1, kernelVolume - i - 1], so we can get another 100% performance increase. 


yanyan's avatar
yanyan committed
74
75
76
77
78
79
80
81
82
## [1.2.0] - 2020-05-28
### Added
- add batch gemm support. small performance increasement but more gpu memory usage. you can use algo=spconv.ConvAlgo.Batch to use it.

### Changed
- replace most of 'functor' with c++14 dispatch in c++ code.

### Fixed
- change gather/scatterAdd kernel parameter to support large points.