Commits · 837335f8014e909749f8ea15f25dde7ec39103aa · OpenDAS / torch-harmonics

17 Dec, 2024 1 commit
- bumping version number and modified chagelog (#61) · 837335f8
  Boris Bonev authored Dec 17, 2024
  
  837335f8
13 Dec, 2024 1 commit

Thorsten Kurth authored Dec 13, 2024

This small fix removes the maximum version number requirement of torch harmonics

8fae572c

12 Dec, 2024 1 commit
- changing defuault grid to equiangular in all SHT routines for consistency (#59) · 1063926f
  Boris Bonev authored Dec 12, 2024
  
  1063926f
01 Oct, 2024 1 commit
- safeguarding all custom CUDA and C++ routines via the _cuda_extension… (#54) · 663bea1f
  Boris Bonev authored Oct 01, 2024
```
* safeguarding all custom CUDA and C++ routines via the _cuda_extension_available flag

* bumping up version number
```
  663bea1f
20 Sep, 2024 1 commit

Bbonev/gradient test fix (#53) · 4fea88bf

Boris Bonev authored Sep 20, 2024

* added analytic gradients to the gradient_analysis notebook

* fixing sht unittest to not check the roundtrip gradient but sht and isht individually

* Updated changelog

4fea88bf

09 Sep, 2024 1 commit

Changing channel dimension of distributed SHT from 1 to -3 (#52) · 60b3b5a2

Boris Bonev authored Sep 09, 2024

* changing distributeds SHT to use dim=-3 as the channel dimension for distributed transpose

* Fixed formatting in tests

* Adding bash script for running test suite

* Replaced asserts regarding number of dimensions in tensor with checks

60b3b5a2

29 Aug, 2024 3 commits
- Fixing authors in pyproject.toml (#51) · 4a6ed467
  Boris Bonev authored Aug 29, 2024
  
  4a6ed467
- adding routines for cleaning up distributed process groups (#50) · b2ce5906
  Thorsten Kurth authored Aug 29, 2024
  
  b2ce5906
- Bbonev/resampling (#49) · 24fcb06e
  Boris Bonev authored Aug 29, 2024
```
* Adding spherical upsampling routine and example notebook for resampling
```
  24fcb06e
28 Aug, 2024 1 commit

fixing amp stuff again (#48) · f1a965bd

Thorsten Kurth authored Aug 28, 2024

- doing manual type conversion in custom autograd
- fixing stride issues in backward pass by making some output tensors contiguous

f1a965bd

27 Aug, 2024 5 commits

AMP hotfix (#47) · 5d7e9b06
Boris Bonev authored Aug 27, 2024
```
* AMP hotfix

* Bumping up version to 0.7.1
```
5d7e9b06

Reworked setup system using both pyproject.toml and setup.py. Reworked CI (#46) · 1bfda531

Boris Bonev authored Aug 27, 2024

* reworked the setup system to use pyproject.toml

* changed the way custom CUDA extension build flag is passed

* reworked the CI

* changed to source distribution only in the CI

1bfda531

Bbonev/ci deploy fix (#45) · ef5bf03f

Boris Bonev authored Aug 27, 2024

* reverting deploy script

* moving setuptools install

* removing requirements

* bumping up version

* bumping up the torch-harmonics version to 0.7.0

ef5bf03f

removing some redundant code (#44) · 418d9576
Thorsten Kurth authored Aug 27, 2024

418d9576

Tkurth/distributed memory reduction (#43) · 78365cb9

Thorsten Kurth authored Aug 27, 2024

* updating amp to new torch.amp
* using amp autocrats to FP32 for disco convolution kernels
* implemented reduce_scatter routines but disabled those because of memory fluctuations which can cause OOM on big networks

78365cb9

22 Aug, 2024 1 commit

Bbonev/disco fused quadrature (#42) · 77a64b2c

Boris Bonev authored Aug 22, 2024

* missed one instance of python3

* fused multiplication of quadrature into multiplication of the Psi tensor

* Added quadrature change to changelog

* removing persistent quad weights tensor as it isn't needed anymore

* added pretty-prenting for convolution modules

* adjusting default value in convolution test

77a64b2c

21 Aug, 2024 1 commit

Tkurth/distributed memory reduction (#41) · ab0e66bc

Thorsten Kurth authored Aug 21, 2024

* removing unneccessary contiguous statements

* replacing reduce scatter with independent reduce and scatter

ab0e66bc

20 Aug, 2024 1 commit
- adding reduce_scatter (#40) · 89fb38df
  Thorsten Kurth authored Aug 20, 2024
  
  89fb38df
19 Aug, 2024 10 commits

more explicit · 3a3480b8
Boris Bonev authored Aug 19, 2024

3a3480b8
making python3 explicit in workflow · b2b1b683
Boris Bonev authored Aug 19, 2024

b2b1b683
changing deploy workflow · 8f054dcb
Boris Bonev authored Aug 19, 2024

8f054dcb
debugging workflow run · 74d2a27f
Boris Bonev authored Aug 19, 2024

74d2a27f
making pypi workflow dispatchable · abf2beda
Boris Bonev authored Aug 19, 2024

abf2beda
trying different python version · ab1285ae
Boris Bonev authored Aug 19, 2024

ab1285ae
making sure python3 is used · 20563f6e
Boris Bonev authored Aug 19, 2024

20563f6e
updating the pip install argument · 0f408ee7
Boris Bonev authored Aug 19, 2024

0f408ee7
updating github workflow · 1121877d
Boris Bonev authored Aug 19, 2024

1121877d

Tkurth/cuda disco (#38) · 29e7fb68

Boris Bonev authored Aug 19, 2024



* adding cuda kernels for disco conv

* making psi_idx an attribute

* adding license headers

* adding author files

* reorganizing files

* draft implementation

* added conditional installation to setup.py

* formatting changes

* removing triton kernel in DISCO convolution

* updated github actions

* updated Readme and changelog

* adding another guard for the cuda installation

* renaming the  cuda extension

* simplifying setup.py

* minor bugfix

* Bbonev/cuda disco cleanup (#32)

* cleanup of disco convolutions based on CUDA extension

* fixing unittest

* changing version to experimental 0.7.0a

* initial rewrite of the distributed convolution with CUDA

* fixing streams

* need to fix install options

* fixing streams

* undid setup.py changes

* reset setup.py

* including CUDAStream

* adjusted the precomputation of theta_cutoff. If you rely on this, your models will not be backwards-compatible.

* adjusting theta_cutoff in the unittest

* adding newly refactored kernels for faster compile

* Tkurth/cuda disco distributed fix (#34)

* attempt to make disco distributed

* working distributed convolutions

* fixing distributed conv

* working distributed disco

* removing irrelevant extra argument

* using stream functions from at instead of c10

* using stream functions from at instead of c10, small fix

* Bbonev/disc even filters (#35)

* initial working commit with new convention of counting collocation points across the diameter instead of across the radius

* fixed a bug in the computation of the even kernels

* changing heuristic for computing theta_cutoff

* Fixing unittest

* Readability improvements

* reworked normalization of filter basis functions

* implemented discrete normalization of disco filters

* relaxing tolerances in convolution unit test

* bugfix to correctly support unequal scale factors in latitudes and longitudes

* hotfix to a bug in the imports

* Bbonev/distributed disco refactor (#37)

* cleaned up normalization code in convolution

* formatting changes in distributed convolution

* Fixing default theta_cutoff to be the same in distributed and local case

* fixed distributed convolution to support the same normalization as non-distributed one

* readability improvements

* fixed initial scale of convolution parameter weights and fixed naming of the normalization routine

* Updated Readme.md

* added comment in Dockerfile regarding older architectures

---------
Co-authored-by: Thorsten Kurth <tkurth@nvidia.com>
Co-authored-by: Boris Bonev <bbonev@nvidia.com>

29e7fb68

05 Feb, 2024 1 commit

Tkurth/distributed disco (#30) · 214fa40a

Thorsten Kurth authored Feb 05, 2024



* initial implementation of distributed DISCO layer

* working distributed convolution

* working refactored serial conv transpose with torch kernel

* working distributed conv and transposed conv when using the python kernel

* working distributed convolution with torch kernel

* fixed triton kernel tests

* adding print statement to debug CI

* adjusting tolerances in local convolution unittest

---------
Co-authored-by: Boris Bonev <bbonev@nvidia.com>

214fa40a

30 Jan, 2024 1 commit
- Bbonev/disco refactor (#29) · 54502a17
  Boris Bonev authored Jan 30, 2024
```
* Cleaned up DISCO convolutions
```
  54502a17
22 Dec, 2023 1 commit

Bbonev/disco refactor (#28) · c971d458

Boris Bonev authored Dec 22, 2023

Changed the code to only implicitly use sparse tensors in the modules, in order to enable compatibility with DDP

c971d458

20 Dec, 2023 1 commit

Bbonev/disco refactor (#27) · 942aa4ea

Boris Bonev authored Dec 20, 2023

* Moved convolutions and exposed them directly

* Added transposition to the unit test

* Minor bugfix in CPU version of DISCO transpose code

* Adding convolution tests to CI

* Added gradient check

* Checking the weight grad as well

* Added test for anisotropic kernels

942aa4ea

15 Dec, 2023 1 commit
- adjusting initialization · 1e5f7a2f
  Boris Bonev authored Dec 16, 2023
  
  1e5f7a2f
14 Dec, 2023 2 commits

Bbonev/coverage report (#26) · 9577cc8f

Boris Bonev authored Dec 14, 2023

* adding github action for coverage report

* removing github action

* report coverage in terminal

* restricting to files directly inside torch-harmonics

* adding exclusion list for coverage

* instricting pytest-cov to use coragerc

* instricting pytest-cov to use coragerc

* changing exclusion list in coveragerc

9577cc8f

Bbonev/discrete continuous convolutions (#25) · a0227d4c

Boris Bonev authored Dec 14, 2023



* Fixing the precomputation of the Psi matrix

* moving tests to a parent folder

* removing tqdm

* deleting deprecated distributed tests

* Moving distributed test

* adapting workfolow

* Added some more comments to make the code more understandable

* more detailed explanation for the derivation of the rotation angles

* added another comment

---------
Co-authored-by: Thorsten Kurth <tkurth@nvidia.com>

a0227d4c

07 Dec, 2023 2 commits

Bbonev/discrete continuous convolutions (#24) · ad927429

Boris Bonev authored Dec 07, 2023



* fixing banding issues with corrected computation of kernel size
* fixing Parameters
* changed normalization of isotropic kernels

---------
Co-authored-by: Thorsten Kurth <tkurth@nvidia.com>

ad927429

Tkurth/flexible sharding (#22) · eab72f04

Thorsten Kurth authored Dec 07, 2023

* working distributed fwd SHT with flexible sharding and without padding

* fixing distributed sht with new logic

* fixed distributed SHT and ISHT with flexible padding

* working unittest for distributed fwd SHT

* distributed tests converted to unit tests

* bumping version number up to 0.6.4

* fixing small splitting bug in th distributed

* updated changeloc, removed tests folder

eab72f04

05 Dec, 2023 1 commit

Bbonev/discrete continuous convolutions (#23) · 31a33579

Boris Bonev authored Dec 05, 2023



* Adding prototype implementation of disco convolutions

* adding s2convolutions.py

* Somewhat functional first implementation in Triton

* Somewhat functional first implementation in Triton

* Made batched execution work on GPU and refactored precomputation of kernel values and supports.

* Adding the Lobatto grid to the grid selection method in quadrature

* fixing bug in triton kernel when iterating over non-zeros

* Updating with working custom triton implementation of DISCO convolution

* Merged both forward and backward DISCO contraction into a single kernel

* Some cleanup and minor bugfix

* bugfixes to the reference implementation

* adding weights to s2conv

* removing unnecessary imports

* suggestion for torch harmonics, math should be checked thoroughly

* Intermediate working reference implementation for the transpose DISCO convolution. Fixed normalization of kernels

* adjusting cutoff frequency in disco convolution

* fixing transpose DISCO contraction

* moving triton to install_requires

* enabling triton by default

---------
Co-authored-by: Thorsten Kurth <tkurth@nvidia.com>

31a33579

02 Dec, 2023 1 commit

Bbonev/discrete continuous convolutions (#21) · acbbf8f7

Boris Bonev authored Dec 02, 2023



* Adding prototype implementation of disco convolutions

* adding s2convolutions.py

* Somewhat functional first implementation in Triton

* Somewhat functional first implementation in Triton

* Made batched execution work on GPU and refactored precomputation of kernel values and supports.

* Adding the Lobatto grid to the grid selection method in quadrature

* fixing bug in triton kernel when iterating over non-zeros

* Updating with working custom triton implementation of DISCO convolution

* Merged both forward and backward DISCO contraction into a single kernel

* Some cleanup and minor bugfix

* bugfixes to the reference implementation

* adding weights to s2conv

* removing unnecessary imports

* suggestion for torch harmonics, math should be checked thoroughly

* Intermediate working reference implementation for the transpose DISCO convolution. Fixed normalization of kernels

* adjusting cutoff frequency in disco convolution

* fixing transpose DISCO contraction

* moving triton to install_requires

---------
Co-authored-by: Thorsten Kurth <tkurth@nvidia.com>

acbbf8f7

23 Nov, 2023 1 commit

Update README.md (#20) · 13aa4927

Thorsten Kurth authored Nov 23, 2023

* Update README.md

adding autocast remark to readme

* Update README.md

small typo fix

13aa4927