• Masaki Kozuki's avatar
    Add CUDA Focal Loss Implementation (#1337) · 28f8539c
    Masaki Kozuki authored
    
    
    Take-over of #1097
    
    * Add fast CUDA focal loss implementation.
    
    * Enable fast math for CUDA focal loss.
    
    * Correct typo.
    
    * replace deprecated macros
    
    * Add fast CUDA focal loss implementation.
    
    * Enable fast math for CUDA focal loss.
    
    * Correct typo.
    
    * replace deprecated macros
    
    * TORCH_CUDA_CHECK -> AT_CUDA_CHECK
    
    The former is defined in torch/csrc/profiler/cuda.cpp so it's not available usually.
    The latter however is defined in ATen/cuda/Exceptions.h as an alias of C10_CUDA_CHECK.
    
    * add test
    
    * clean up
    
    * guard for torchvision
    Co-authored-by: default avatarWil Kong <alpha0422@gmail.com>
    28f8539c
setup.py 26.4 KB