[WIP] Allow autocast for 1.6 (#2384)
* Fixes Xiao's repro
* Ports nms to use full dispatcher
* Move HIPGuard to nms_cuda
* clang-format
* run models in test_models.py on GPU if available
* Francisco's comment, also disable cuda model tests to see if CPU alone still passes
* cuda tests now pass locally, although still not comparing to saved numerics
* add note for thing to ask francisco
* Allow cuda and cpu tests to share a data file
* ignore suffix if unneeded
* Skip autocast numerics checks for a few models
* Add roi_align test
Co-authored-by:
Michael Carilli <mcarilli@nvidia.com>
Showing
torchvision/csrc/autocast.h
0 → 100644
Please register or sign in to comment