1. 05 Feb, 2024 1 commit
    • Thorsten Kurth's avatar
      Tkurth/distributed disco (#30) · 214fa40a
      Thorsten Kurth authored
      
      
      * initial implementation of distributed DISCO layer
      
      * working distributed convolution
      
      * working refactored serial conv transpose with torch kernel
      
      * working distributed conv and transposed conv when using the python kernel
      
      * working distributed convolution with torch kernel
      
      * fixed triton kernel tests
      
      * adding print statement to debug CI
      
      * adjusting tolerances in local convolution unittest
      
      ---------
      Co-authored-by: default avatarBoris Bonev <bbonev@nvidia.com>
      214fa40a
  2. 22 Dec, 2023 1 commit
    • Boris Bonev's avatar
      Bbonev/disco refactor (#28) · c971d458
      Boris Bonev authored
      Changed the code to only implicitly use sparse tensors in the modules, in order to enable compatibility with DDP
      c971d458
  3. 20 Dec, 2023 1 commit
    • Boris Bonev's avatar
      Bbonev/disco refactor (#27) · 942aa4ea
      Boris Bonev authored
      * Moved convolutions and exposed them directly
      
      * Added transposition to the unit test
      
      * Minor bugfix in CPU version of DISCO transpose code
      
      * Adding convolution tests to CI
      
      * Added gradient check
      
      * Checking the weight grad as well
      
      * Added test for anisotropic kernels
      942aa4ea