• Boris Bonev's avatar
    Tkurth/cuda disco (#38) · 29e7fb68
    Boris Bonev authored
    
    
    * adding cuda kernels for disco conv
    
    * making psi_idx an attribute
    
    * adding license headers
    
    * adding author files
    
    * reorganizing files
    
    * draft implementation
    
    * added conditional installation to setup.py
    
    * formatting changes
    
    * removing triton kernel in DISCO convolution
    
    * updated github actions
    
    * updated Readme and changelog
    
    * adding another guard for the cuda installation
    
    * renaming the  cuda extension
    
    * simplifying setup.py
    
    * minor bugfix
    
    * Bbonev/cuda disco cleanup (#32)
    
    * cleanup of disco convolutions based on CUDA extension
    
    * fixing unittest
    
    * changing version to experimental 0.7.0a
    
    * initial rewrite of the distributed convolution with CUDA
    
    * fixing streams
    
    * need to fix install options
    
    * fixing streams
    
    * undid setup.py changes
    
    * reset setup.py
    
    * including CUDAStream
    
    * adjusted the precomputation of theta_cutoff. If you rely on this, your models will not be backwards-compatible.
    
    * adjusting theta_cutoff in the unittest
    
    * adding newly refactored kernels for faster compile
    
    * Tkurth/cuda disco distributed fix (#34)
    
    * attempt to make disco distributed
    
    * working distributed convolutions
    
    * fixing distributed conv
    
    * working distributed disco
    
    * removing irrelevant extra argument
    
    * using stream functions from at instead of c10
    
    * using stream functions from at instead of c10, small fix
    
    * Bbonev/disc even filters (#35)
    
    * initial working commit with new convention of counting collocation points across the diameter instead of across the radius
    
    * fixed a bug in the computation of the even kernels
    
    * changing heuristic for computing theta_cutoff
    
    * Fixing unittest
    
    * Readability improvements
    
    * reworked normalization of filter basis functions
    
    * implemented discrete normalization of disco filters
    
    * relaxing tolerances in convolution unit test
    
    * bugfix to correctly support unequal scale factors in latitudes and longitudes
    
    * hotfix to a bug in the imports
    
    * Bbonev/distributed disco refactor (#37)
    
    * cleaned up normalization code in convolution
    
    * formatting changes in distributed convolution
    
    * Fixing default theta_cutoff to be the same in distributed and local case
    
    * fixed distributed convolution to support the same normalization as non-distributed one
    
    * readability improvements
    
    * fixed initial scale of convolution parameter weights and fixed naming of the normalization routine
    
    * Updated Readme.md
    
    * added comment in Dockerfile regarding older architectures
    
    ---------
    Co-authored-by: default avatarThorsten Kurth <tkurth@nvidia.com>
    Co-authored-by: default avatarBoris Bonev <bbonev@nvidia.com>
    29e7fb68
plotting.py 3.66 KB