- 07 Apr, 2020 2 commits
-
-
Brian Hart authored
Torchvision includes at least 3 bits of code that calculate box Intersection over Union values (and usually compare to a threshold): - box_iou in torchvision/ops/boxes.py - devIoU in torchvision/csrc/cuda/nms_cuda.cu - nms_cpu_kernel in torchvision/csrc/cpu/nms_cpu.cpp The calculations were performed slightly differently between those, leading to occasional differences in results. Update devIoU to use the same method as the others for better consistency. This change improves agreement between the CPU and CUDA calculations but the results can still differ slightly. Setting NVCC_FLAGS to include "--fmad=true" would provide still better agreement, but with likely cost to performance.
-
AhnDW authored
* Replace **.is_cuda() to just is_cuda() * Replace type to scalar_type * Fix lint, clang-format * Fix lint, clang-format
-
- 02 Jan, 2020 1 commit
-
-
Yuxin Wu authored
1. Let the IOU function compare with threshold. This avoid a division. Similar strategy is also used in https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/kernels/non_max_suppression_op.cu.cc 2. Only compute the upper triangle of the mask. This speeds up the kernel about 20% (tested on GTX 1080Ti, with 20 input cases dumped from a Mask R-CNN inference job).
-
- 05 Nov, 2019 1 commit
-
-
Francisco Massa authored
* Fix inconsistent NMS implementation * Improve tests for NMS * Remove unnecessary using statement
-
- 29 Aug, 2019 1 commit
-
-
Yuxin Wu authored
* Use Tensor.data_ptr instead of .data * use pytorch-nightly in CI
-
- 23 May, 2019 2 commits
-
-
Francisco Massa authored
* #944 MSBuild Compile time casting Error * #944 MSBuild Error static_cast<Long> to static_cast<int64_t> * Add eval.py Not Work find_contours * Remove unnecessary file * Lint
-
Varun Agrawal authored
Updated nms_cuda signature to accept detections and scores as separate tensors. This also required updating the indexing in the NMS CUDA kernel. Also made the iou_threshold parameter name consistent across implementations.
-
- 07 May, 2019 1 commit
-
-
Francisco Massa authored
* Initial layout for layers with cpp extensions * Move files around * Fix import after move * Add support for multiple types to ROIAlign * Different organization CUDA extensions work now * Cleanups * Reduce memory requirements for backwards * Replace runtime_error by AT_ERROR * Add nms test * Add support for compilation using CPP extensions * Change folder structure * Add ROIPool cuda * Cleanups * Add roi_pool.py * Fix lint * Add initial structures folder for bounding boxes * Assertion macros compatible with pytorch master (#540) * Support for ROI Pooling (#592) * ROI Pooling with tests. Fix for cuda context in ROI Align. * renamed bottom and top to follow torch conventions * remove .type().tensor() calls in favor of the new approach to tensor initialization (#626) * Consistent naming for rois variable (#627) * remove .type().tensor() calls in favor of the new approach to tensor initialization * Consistent naming for rois variable in ROIPool * ROIPool: Support for all datatypes (#632) * Use of torch7 naming scheme for ROIAlign forward and backward * use common cuda helpers in ROIAlign * use .options() in favor of .type() where applicable * Added tests for forward pass of ROIAlign, as well as more consistent naming scheme for CPU vs CUDA * working ROIAlign cuda backwards pass * working ROIAlign backwards pass for CPU * added relevant headers for ROIAlign backwards * tests for ROIAlign layer * replace .type() with .options() for tensor initialization in ROIAlign layers * support for Half types in ROIAlign * gradcheck tests for ROIAlign * updated ROIPool on CPU to work with all datatypes * updated and cleaned tests for ROI Pooling * Fix rebase problem * Remove structures folder * Improve cleanup and bugfix in test_layers * Update C++ headers * Add CUDAGuard to cu files * Add more checks to layers * Add CUDA NMS and tests * Add multi-type support for NMS CUDA * Avoid using THCudaMalloc * Add clang-format and reformat c++ code * Remove THC includes * Rename layers to ops * Add documentation and rename functions * Improve the documentation a bit * Fix some lint errors * Fix remaining lint inssues * Area computation doesn't add +1 in NMS * Update CI to use PyTorch nightly * Make NMS return indices sorted according to the score * Address reviewer comments * Lint fixes * Improve doc for roi_align and roi_pool * move to xenial * Fix bug pointed by @lopuhin * Fix RoIPool reference implementation in Python 2 Also fixes a bug in the clip_boxes_to_image -- this function needs a test! * Remove change in .travis
-