- 02 Sep, 2020 1 commit
-
-
Ashish Farmer authored
* add autocasting on ROCm * enable ROIAlign autocasting on ROCm * enable NMS autocasting on ROCm * fix to use correct torch CUDA APIs
-
- 09 Jul, 2020 1 commit
-
-
mcarilli authored
* Fixes Xiao's repro * Ports nms to use full dispatcher * Move HIPGuard to nms_cuda * clang-format * run models in test_models.py on GPU if available * Francisco's comment, also disable cuda model tests to see if CPU alone still passes * cuda tests now pass locally, although still not comparing to saved numerics * add note for thing to ask francisco * Allow cuda and cpu tests to share a data file * ignore suffix if unneeded * Skip autocast numerics checks for a few models * Add roi_align test Co-authored-by:Michael Carilli <mcarilli@nvidia.com>
-
- 30 Jun, 2020 1 commit
-
-
Edward Z. Yang authored
* Switch torchvision registrations to new operator registration API. This is still registering everything as catchalls, so we're really just moving deck chairs around, but payoff is coming soon. Signed-off-by:
Edward Z. Yang <ezyang@fb.com> * Port roi_align to actually use dispatcher Signed-off-by:
Edward Z. Yang <ezyang@fb.com>
-
- 26 May, 2020 1 commit
-
-
Shawn Zhong authored
* Avoid `using` in header files * Fix clang_format * use clang-format-7 to reformat code
-
- 07 Apr, 2020 1 commit
-
-
AhnDW authored
* Replace **.is_cuda() to just is_cuda() * Replace type to scalar_type * Fix lint, clang-format * Fix lint, clang-format
-
- 12 Mar, 2020 1 commit
-
-
Francisco Massa authored
-
- 11 Mar, 2020 1 commit
-
-
Ashish Farmer authored
* Added code to support creating extension on ROCm * max -> fmaxf conversion for hipification * added WITH_HIP flag for hipExtension * added appropriate headers for HIP build * use USE_ROCM in condition to build * change fmaxf and fminf calls * fminf -> min * fix the check for ROCM_HOME * more robust checking for rocm pytorch * add check for pytorch version before using HIP extensions * conditional reading of ROCM_HOME
-
- 04 Mar, 2020 1 commit
-
-
AhnDW authored
* Aligned flag in the interfaces * Aligned flag in the impl, and remove unused comments * Handling empty bin in forward * Remove raise error in roi_width * Aligned flag in the Testcodes
-
- 18 Sep, 2019 1 commit
-
-
Francisco Massa authored
* Remove C++ extensions in favor of custom ops * Remove unused custom_ops.cpp file * Rename _custom_ops.py * Reorganize functions * Minor improvements and fixes * Fix lint * Fully scriptable ops * Import types used by annotations
-
- 10 Sep, 2019 1 commit
-
-
Lara Haidar authored
* Revert "Revert "Register Torchvision Ops as Cutom Ops (#1267)" (#1316)" This reverts commit fe234fc8. * Make import of C++ extensions lazy * define python initialization functions for extension * Fix lint
-
- 09 Sep, 2019 2 commits
-
-
Edward Z. Yang authored
This reverts commit 78f169b5.
-
Lara Haidar authored
* Register torchvision ops * install ORT only with python 3 * remane lib + address other comments * fix lint * fix lib copy * find file with pattern instead of suffix * use relative path * revert rename and use imp to find lib * fix typo
-
- 25 Jun, 2019 1 commit
-
-
Shahriar authored
-
- 07 May, 2019 1 commit
-
-
Francisco Massa authored
* Initial layout for layers with cpp extensions * Move files around * Fix import after move * Add support for multiple types to ROIAlign * Different organization CUDA extensions work now * Cleanups * Reduce memory requirements for backwards * Replace runtime_error by AT_ERROR * Add nms test * Add support for compilation using CPP extensions * Change folder structure * Add ROIPool cuda * Cleanups * Add roi_pool.py * Fix lint * Add initial structures folder for bounding boxes * Assertion macros compatible with pytorch master (#540) * Support for ROI Pooling (#592) * ROI Pooling with tests. Fix for cuda context in ROI Align. * renamed bottom and top to follow torch conventions * remove .type().tensor() calls in favor of the new approach to tensor initialization (#626) * Consistent naming for rois variable (#627) * remove .type().tensor() calls in favor of the new approach to tensor initialization * Consistent naming for rois variable in ROIPool * ROIPool: Support for all datatypes (#632) * Use of torch7 naming scheme for ROIAlign forward and backward * use common cuda helpers in ROIAlign * use .options() in favor of .type() where applicable * Added tests for forward pass of ROIAlign, as well as more consistent naming scheme for CPU vs CUDA * working ROIAlign cuda backwards pass * working ROIAlign backwards pass for CPU * added relevant headers for ROIAlign backwards * tests for ROIAlign layer * replace .type() with .options() for tensor initialization in ROIAlign layers * support for Half types in ROIAlign * gradcheck tests for ROIAlign * updated ROIPool on CPU to work with all datatypes * updated and cleaned tests for ROI Pooling * Fix rebase problem * Remove structures folder * Improve cleanup and bugfix in test_layers * Update C++ headers * Add CUDAGuard to cu files * Add more checks to layers * Add CUDA NMS and tests * Add multi-type support for NMS CUDA * Avoid using THCudaMalloc * Add clang-format and reformat c++ code * Remove THC includes * Rename layers to ops * Add documentation and rename functions * Improve the documentation a bit * Fix some lint errors * Fix remaining lint inssues * Area computation doesn't add +1 in NMS * Update CI to use PyTorch nightly * Make NMS return indices sorted according to the score * Address reviewer comments * Lint fixes * Improve doc for roi_align and roi_pool * move to xenial * Fix bug pointed by @lopuhin * Fix RoIPool reference implementation in Python 2 Also fixes a bug in the clip_boxes_to_image -- this function needs a test! * Remove change in .travis
-