1. 21 Jul, 2025 2 commits
  2. 16 Jul, 2025 1 commit
  3. 08 Jul, 2025 1 commit
    • Thorsten Kurth's avatar
      Tkurth/remove sparse coo tensor (#89) · bd92cdf7
      Thorsten Kurth authored
      * refactoring disco backend code
      
      * removed get_psi as member function and instead put it in _disco_convolution
      
      * setting seeds in tests more consistently
      
      * parametrized test classes to ensure that tests are always run on both CPU and GPU (if available)
      
      * cleaning up
      bd92cdf7
  4. 02 Jul, 2025 1 commit
  5. 13 Jun, 2025 3 commits
  6. 11 Jun, 2025 2 commits
  7. 06 Jun, 2025 1 commit
  8. 04 Jun, 2025 1 commit
  9. 26 May, 2025 1 commit
  10. 24 May, 2025 2 commits
  11. 26 Feb, 2025 1 commit
  12. 21 Feb, 2025 1 commit
  13. 14 Jan, 2025 6 commits
  14. 20 Sep, 2024 1 commit
    • Boris Bonev's avatar
      Bbonev/gradient test fix (#53) · 4fea88bf
      Boris Bonev authored
      * added analytic gradients to the gradient_analysis notebook
      
      * fixing sht unittest to not check the roundtrip gradient but sht and isht individually
      
      * Updated changelog
      4fea88bf
  15. 09 Sep, 2024 1 commit
  16. 29 Aug, 2024 1 commit
  17. 22 Aug, 2024 1 commit
    • Boris Bonev's avatar
      Bbonev/disco fused quadrature (#42) · 77a64b2c
      Boris Bonev authored
      * missed one instance of python3
      
      * fused multiplication of quadrature into multiplication of the Psi tensor
      
      * Added quadrature change to changelog
      
      * removing persistent quad weights tensor as it isn't needed anymore
      
      * added pretty-prenting for convolution modules
      
      * adjusting default value in convolution test
      77a64b2c
  18. 20 Aug, 2024 1 commit
  19. 19 Aug, 2024 1 commit
    • Boris Bonev's avatar
      Tkurth/cuda disco (#38) · 29e7fb68
      Boris Bonev authored
      
      
      * adding cuda kernels for disco conv
      
      * making psi_idx an attribute
      
      * adding license headers
      
      * adding author files
      
      * reorganizing files
      
      * draft implementation
      
      * added conditional installation to setup.py
      
      * formatting changes
      
      * removing triton kernel in DISCO convolution
      
      * updated github actions
      
      * updated Readme and changelog
      
      * adding another guard for the cuda installation
      
      * renaming the  cuda extension
      
      * simplifying setup.py
      
      * minor bugfix
      
      * Bbonev/cuda disco cleanup (#32)
      
      * cleanup of disco convolutions based on CUDA extension
      
      * fixing unittest
      
      * changing version to experimental 0.7.0a
      
      * initial rewrite of the distributed convolution with CUDA
      
      * fixing streams
      
      * need to fix install options
      
      * fixing streams
      
      * undid setup.py changes
      
      * reset setup.py
      
      * including CUDAStream
      
      * adjusted the precomputation of theta_cutoff. If you rely on this, your models will not be backwards-compatible.
      
      * adjusting theta_cutoff in the unittest
      
      * adding newly refactored kernels for faster compile
      
      * Tkurth/cuda disco distributed fix (#34)
      
      * attempt to make disco distributed
      
      * working distributed convolutions
      
      * fixing distributed conv
      
      * working distributed disco
      
      * removing irrelevant extra argument
      
      * using stream functions from at instead of c10
      
      * using stream functions from at instead of c10, small fix
      
      * Bbonev/disc even filters (#35)
      
      * initial working commit with new convention of counting collocation points across the diameter instead of across the radius
      
      * fixed a bug in the computation of the even kernels
      
      * changing heuristic for computing theta_cutoff
      
      * Fixing unittest
      
      * Readability improvements
      
      * reworked normalization of filter basis functions
      
      * implemented discrete normalization of disco filters
      
      * relaxing tolerances in convolution unit test
      
      * bugfix to correctly support unequal scale factors in latitudes and longitudes
      
      * hotfix to a bug in the imports
      
      * Bbonev/distributed disco refactor (#37)
      
      * cleaned up normalization code in convolution
      
      * formatting changes in distributed convolution
      
      * Fixing default theta_cutoff to be the same in distributed and local case
      
      * fixed distributed convolution to support the same normalization as non-distributed one
      
      * readability improvements
      
      * fixed initial scale of convolution parameter weights and fixed naming of the normalization routine
      
      * Updated Readme.md
      
      * added comment in Dockerfile regarding older architectures
      
      ---------
      Co-authored-by: default avatarThorsten Kurth <tkurth@nvidia.com>
      Co-authored-by: default avatarBoris Bonev <bbonev@nvidia.com>
      29e7fb68
  20. 05 Feb, 2024 1 commit
    • Thorsten Kurth's avatar
      Tkurth/distributed disco (#30) · 214fa40a
      Thorsten Kurth authored
      
      
      * initial implementation of distributed DISCO layer
      
      * working distributed convolution
      
      * working refactored serial conv transpose with torch kernel
      
      * working distributed conv and transposed conv when using the python kernel
      
      * working distributed convolution with torch kernel
      
      * fixed triton kernel tests
      
      * adding print statement to debug CI
      
      * adjusting tolerances in local convolution unittest
      
      ---------
      Co-authored-by: default avatarBoris Bonev <bbonev@nvidia.com>
      214fa40a
  21. 22 Dec, 2023 1 commit
    • Boris Bonev's avatar
      Bbonev/disco refactor (#28) · c971d458
      Boris Bonev authored
      Changed the code to only implicitly use sparse tensors in the modules, in order to enable compatibility with DDP
      c971d458
  22. 20 Dec, 2023 1 commit
    • Boris Bonev's avatar
      Bbonev/disco refactor (#27) · 942aa4ea
      Boris Bonev authored
      * Moved convolutions and exposed them directly
      
      * Added transposition to the unit test
      
      * Minor bugfix in CPU version of DISCO transpose code
      
      * Adding convolution tests to CI
      
      * Added gradient check
      
      * Checking the weight grad as well
      
      * Added test for anisotropic kernels
      942aa4ea
  23. 14 Dec, 2023 1 commit
    • Boris Bonev's avatar
      Bbonev/discrete continuous convolutions (#25) · a0227d4c
      Boris Bonev authored
      
      
      * Fixing the precomputation of the Psi matrix
      
      * moving tests to a parent folder
      
      * removing tqdm
      
      * deleting deprecated distributed tests
      
      * Moving distributed test
      
      * adapting workfolow
      
      * Added some more comments to make the code more understandable
      
      * more detailed explanation for the derivation of the rotation angles
      
      * added another comment
      
      ---------
      Co-authored-by: default avatarThorsten Kurth <tkurth@nvidia.com>
      a0227d4c
  24. 07 Dec, 2023 1 commit
    • Thorsten Kurth's avatar
      Tkurth/flexible sharding (#22) · eab72f04
      Thorsten Kurth authored
      * working distributed fwd SHT with flexible sharding and without padding
      
      * fixing distributed sht with new logic
      
      * fixed distributed SHT and ISHT with flexible padding
      
      * working unittest for distributed fwd SHT
      
      * distributed tests converted to unit tests
      
      * bumping version number up to 0.6.4
      
      * fixing small splitting bug in th distributed
      
      * updated changeloc, removed tests folder
      eab72f04
  25. 26 Apr, 2023 2 commits
  26. 17 Feb, 2023 1 commit