- 02 Sep, 2020 1 commit
-
-
Ashish Farmer authored
* add autocasting on ROCm * enable ROIAlign autocasting on ROCm * enable NMS autocasting on ROCm * fix to use correct torch CUDA APIs
-
- 09 Jul, 2020 1 commit
-
-
mcarilli authored
* Fixes Xiao's repro * Ports nms to use full dispatcher * Move HIPGuard to nms_cuda * clang-format * run models in test_models.py on GPU if available * Francisco's comment, also disable cuda model tests to see if CPU alone still passes * cuda tests now pass locally, although still not comparing to saved numerics * add note for thing to ask francisco * Allow cuda and cpu tests to share a data file * ignore suffix if unneeded * Skip autocast numerics checks for a few models * Add roi_align test Co-authored-by:Michael Carilli <mcarilli@nvidia.com>
-