"vscode:/vscode.git/clone" did not exist on "33a5435ca3aa0097b4a8dd96e4156db5164a4baf"
Add CUDA Focal Loss Implementation (#1337)
Take-over of #1097
* Add fast CUDA focal loss implementation.
* Enable fast math for CUDA focal loss.
* Correct typo.
* replace deprecated macros
* Add fast CUDA focal loss implementation.
* Enable fast math for CUDA focal loss.
* Correct typo.
* replace deprecated macros
* TORCH_CUDA_CHECK -> AT_CUDA_CHECK
The former is defined in torch/csrc/profiler/cuda.cpp so it's not available usually.
The latter however is defined in ATen/cuda/Exceptions.h as an alias of C10_CUDA_CHECK.
* add test
* clean up
* guard for torchvision
Co-authored-by:
Wil Kong <alpha0422@gmail.com>
Showing
Please register or sign in to comment