- 05 Feb, 2024 1 commit
-
-
Thorsten Kurth authored
* initial implementation of distributed DISCO layer * working distributed convolution * working refactored serial conv transpose with torch kernel * working distributed conv and transposed conv when using the python kernel * working distributed convolution with torch kernel * fixed triton kernel tests * adding print statement to debug CI * adjusting tolerances in local convolution unittest --------- Co-authored-by:Boris Bonev <bbonev@nvidia.com>
-
- 30 Jan, 2024 1 commit
-
-
Boris Bonev authored
* Cleaned up DISCO convolutions
-
- 22 Dec, 2023 1 commit
-
-
Boris Bonev authored
Changed the code to only implicitly use sparse tensors in the modules, in order to enable compatibility with DDP
-
- 20 Dec, 2023 1 commit
-
-
Boris Bonev authored
* Moved convolutions and exposed them directly * Added transposition to the unit test * Minor bugfix in CPU version of DISCO transpose code * Adding convolution tests to CI * Added gradient check * Checking the weight grad as well * Added test for anisotropic kernels
-
- 15 Dec, 2023 1 commit
-
-
Boris Bonev authored
-
- 14 Dec, 2023 1 commit
-
-
Boris Bonev authored
* Fixing the precomputation of the Psi matrix * moving tests to a parent folder * removing tqdm * deleting deprecated distributed tests * Moving distributed test * adapting workfolow * Added some more comments to make the code more understandable * more detailed explanation for the derivation of the rotation angles * added another comment --------- Co-authored-by:Thorsten Kurth <tkurth@nvidia.com>
-
- 07 Dec, 2023 2 commits
-
-
Boris Bonev authored
* fixing banding issues with corrected computation of kernel size * fixing Parameters * changed normalization of isotropic kernels --------- Co-authored-by:Thorsten Kurth <tkurth@nvidia.com>
-
Thorsten Kurth authored
* working distributed fwd SHT with flexible sharding and without padding * fixing distributed sht with new logic * fixed distributed SHT and ISHT with flexible padding * working unittest for distributed fwd SHT * distributed tests converted to unit tests * bumping version number up to 0.6.4 * fixing small splitting bug in th distributed * updated changeloc, removed tests folder
-
- 05 Dec, 2023 1 commit
-
-
Boris Bonev authored
* Adding prototype implementation of disco convolutions * adding s2convolutions.py * Somewhat functional first implementation in Triton * Somewhat functional first implementation in Triton * Made batched execution work on GPU and refactored precomputation of kernel values and supports. * Adding the Lobatto grid to the grid selection method in quadrature * fixing bug in triton kernel when iterating over non-zeros * Updating with working custom triton implementation of DISCO convolution * Merged both forward and backward DISCO contraction into a single kernel * Some cleanup and minor bugfix * bugfixes to the reference implementation * adding weights to s2conv * removing unnecessary imports * suggestion for torch harmonics, math should be checked thoroughly * Intermediate working reference implementation for the transpose DISCO convolution. Fixed normalization of kernels * adjusting cutoff frequency in disco convolution * fixing transpose DISCO contraction * moving triton to install_requires * enabling triton by default --------- Co-authored-by:Thorsten Kurth <tkurth@nvidia.com>
-
- 02 Dec, 2023 1 commit
-
-
Boris Bonev authored
* Adding prototype implementation of disco convolutions * adding s2convolutions.py * Somewhat functional first implementation in Triton * Somewhat functional first implementation in Triton * Made batched execution work on GPU and refactored precomputation of kernel values and supports. * Adding the Lobatto grid to the grid selection method in quadrature * fixing bug in triton kernel when iterating over non-zeros * Updating with working custom triton implementation of DISCO convolution * Merged both forward and backward DISCO contraction into a single kernel * Some cleanup and minor bugfix * bugfixes to the reference implementation * adding weights to s2conv * removing unnecessary imports * suggestion for torch harmonics, math should be checked thoroughly * Intermediate working reference implementation for the transpose DISCO convolution. Fixed normalization of kernels * adjusting cutoff frequency in disco convolution * fixing transpose DISCO contraction * moving triton to install_requires --------- Co-authored-by:Thorsten Kurth <tkurth@nvidia.com>
-
- 13 Nov, 2023 1 commit
-
-
Boris Bonev authored
Major Cleanup of SFNO. Retiring non-linear architecture and fixing initialization. Adding scripts for training and validation.
-
- 23 Oct, 2023 1 commit
-
-
Boris Bonev authored
* Updated Changelog * improved readability of legendre.py * bugfix in computation of dlegpoly * Moving conversion from numpy to torch from the legendre module to the sht module * renaming to vdm in tests.py
-
- 18 Oct, 2023 1 commit
-
-
Thorsten Kurth authored
* fixing contig in distributed sht * make it normally contiguous before all to all * commented the code to be a WAR
-
- 04 Oct, 2023 1 commit
-
-
Boris Bonev authored
* reworked SFNO example * updated changelog
-
- 06 Sep, 2023 1 commit
-
-
Boris Bonev authored
* Added gradient check to test suite * reduced size of the unit test * switched to parametrized for unittests
-
- 08 Aug, 2023 3 commits
-
-
Srikumar Sastry authored
-
Boris Bonev authored
-
Boris Bonev authored
-
- 02 Aug, 2023 2 commits
-
-
Thorsten Kurth authored
-
Thorsten Kurth authored
-
- 26 Jul, 2023 1 commit
-
-
Mogab Elleithy authored
Without this fix, importing torch_harmonics had run into a module import error when trying to execute: ```import examples```
-
- 09 Jun, 2023 2 commits
-
-
Boris Bonev authored
-
Boris Bonev authored
-
- 08 Jun, 2023 4 commits
-
-
Boris Bonev authored
-
Boris Bonev authored
-
Boris Bonev authored
-
Jean Kossaifi authored
Don't import sfno by default (or try/except importing apex)
-
- 07 Jun, 2023 2 commits
-
-
Boris Bonev authored
-
Boris Bonev authored
-
- 26 Apr, 2023 1 commit
-
-
Boris Bonev authored
-
- 17 Feb, 2023 1 commit
-
-
Boris Bonev authored
-