• Thorsten Kurth's avatar
    Tkurth/distributed disco (#30) · 214fa40a
    Thorsten Kurth authored
    
    
    * initial implementation of distributed DISCO layer
    
    * working distributed convolution
    
    * working refactored serial conv transpose with torch kernel
    
    * working distributed conv and transposed conv when using the python kernel
    
    * working distributed convolution with torch kernel
    
    * fixed triton kernel tests
    
    * adding print statement to debug CI
    
    * adjusting tolerances in local convolution unittest
    
    ---------
    Co-authored-by: default avatarBoris Bonev <bbonev@nvidia.com>
    214fa40a
convolution.py 17.6 KB