1. 21 Jul, 2025 4 commits
  2. 24 May, 2025 1 commit
  3. 09 Sep, 2024 1 commit
  4. 29 Aug, 2024 1 commit
  5. 05 Feb, 2024 1 commit
    • Thorsten Kurth's avatar
      Tkurth/distributed disco (#30) · 214fa40a
      Thorsten Kurth authored
      
      
      * initial implementation of distributed DISCO layer
      
      * working distributed convolution
      
      * working refactored serial conv transpose with torch kernel
      
      * working distributed conv and transposed conv when using the python kernel
      
      * working distributed convolution with torch kernel
      
      * fixed triton kernel tests
      
      * adding print statement to debug CI
      
      * adjusting tolerances in local convolution unittest
      
      ---------
      Co-authored-by: default avatarBoris Bonev <bbonev@nvidia.com>
      214fa40a
  6. 14 Dec, 2023 1 commit
    • Boris Bonev's avatar
      Bbonev/discrete continuous convolutions (#25) · a0227d4c
      Boris Bonev authored
      
      
      * Fixing the precomputation of the Psi matrix
      
      * moving tests to a parent folder
      
      * removing tqdm
      
      * deleting deprecated distributed tests
      
      * Moving distributed test
      
      * adapting workfolow
      
      * Added some more comments to make the code more understandable
      
      * more detailed explanation for the derivation of the rotation angles
      
      * added another comment
      
      ---------
      Co-authored-by: default avatarThorsten Kurth <tkurth@nvidia.com>
      a0227d4c
  7. 07 Dec, 2023 1 commit
    • Thorsten Kurth's avatar
      Tkurth/flexible sharding (#22) · eab72f04
      Thorsten Kurth authored
      * working distributed fwd SHT with flexible sharding and without padding
      
      * fixing distributed sht with new logic
      
      * fixed distributed SHT and ISHT with flexible padding
      
      * working unittest for distributed fwd SHT
      
      * distributed tests converted to unit tests
      
      * bumping version number up to 0.6.4
      
      * fixing small splitting bug in th distributed
      
      * updated changeloc, removed tests folder
      eab72f04