Commits · 8b9859d3aeebcd37e6a284fc751c58569857f7be · OpenDAS / vision

04 Mar, 2020 1 commit

`aligned` flag in ROIAlign (#1908) · e1e975f9

AhnDW authored Mar 04, 2020

* Aligned flag in the interfaces

* Aligned flag in the impl, and remove unused comments

* Handling empty bin in forward

* Remove raise error in roi_width

* Aligned flag in the Testcodes

e1e975f9

29 Jan, 2020 1 commit
- Revert "Base decoder for video. (#1747) (#1793)" (#1833) · f2600c2e
  Francisco Massa authored Jan 29, 2020
```
This reverts commit 28b7f8ae.
```
  f2600c2e
27 Jan, 2020 1 commit

Base decoder for video. (#1747) (#1793) · 28b7f8ae

Francisco Massa authored Jan 27, 2020

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1747

Pull Request resolved: https://github.com/pytorch/vision/pull/1746



Added the implementation of ffmpeg based decoder with functionality that can be used in VUE and TorchVision.

Reviewed By: fmassa

Differential Revision: D19358914

fbshipit-source-id: abb672f89bfaca6351dda2354f0d35cf8e47fa0f
Co-authored-by: Yuri Putivsky <yuri@fb.com>

28b7f8ae

16 Dec, 2019 1 commit
- Improve error message and avoid segfault in DeformConv2d (#1660) · 17ea1482
  Francisco Massa authored Dec 16, 2019
  
  17ea1482
06 Dec, 2019 1 commit
- Fix header includes for cpu (#1644) · 598b61d9
  gslotman authored Dec 06, 2019
  
  598b61d9
04 Dec, 2019 1 commit

Add Deformable Convolution operation. (#1586) · 52b8685b

pedrofreire authored Dec 04, 2019

* Add Deformable Convolution operation.

This adds the deformable convolution operation, as described in Deformable Convolutional Networks (https://arxiv.org/abs/1703.06211).

- The code is based on https://github.com/open-mmlab/mmdetection/blob/master/mmdet/ops/dcn/src/deform_conv_cuda.cpp ; the whole code was modified and refactored to remove redundancies and increase clarity, and to adapt it to torchvision.

- The CPU part is a direct copy of the CUDA code; it might make sense to do follow-up adjustments in the CPU code to simplify it / optimize it, or to reuse functionality between CPU and CUDA..

- We also add tests (with a non-trivial set of parameters); they can be made more robust by randomizing the parameters and executing multiple times.

* Update DeformConv to be more consistent w/ Conv2d

* rename some variables and arguments to match Conv2d;
* add optional bias;
* add weight, offset and bias as module parameters;
* remove the n_parallel_imgs parameter;
* Fix __repr__;
* etc..

Initialization of weight and bias is the same as in Conv2d, and
initialization of offsets to zero is the same as in the paper.

This also includes some other small unrelated fixes/improvements.

* Apply clang-format in DeformConv files.

* Import Optional type annotation

* Remove offset param from DeformConv2d module

- We pass the offset in the forward of DeformConv2d, instead of having
an internal parameter. This adds some complexity to creating the module
(e.g. now you have to worry about the output size, to create the
offset), but it gives more flexibility.
- We also use make_tuple for tuple creation, in an attempt to fix error
w/ older compilers.

* Replace abs by std::abs

Old gcc versions were giving wrong results here, because they would
resolve abs as int -> int, thus causing undesired truncation. Replacing
abs by std::abs should allow for correct overloading of abs as float -> float.

* Reorder declarations for clarity

* Reorder weight and offset args in deform_conv2d

We place offset arg before the weight arg, to be more
consistent with DeformConv2d.forward(input, offset)

* Replace abs by std::abs in DeformConv_cuda

52b8685b

05 Nov, 2019 1 commit
- Fix inconsistent NMS implementation between CPU and CUDA (#1556) · 4897402a
  Francisco Massa authored Nov 05, 2019
```
* Fix inconsistent NMS implementation

* Improve tests for NMS

* Remove unnecessary using statement
```
  4897402a
16 Oct, 2019 1 commit

Implementation for Position-sensitive ROI Pool/Align [updated] (#1410) · 896d7ec7

Lukas Bommes authored Oct 16, 2019

* added PSRoiAlign and PSRoiPool with C++ autograd and torch ops

* fixed linter errors

* fixed linter errors 2

* fixed linter errors 3

896d7ec7

12 Oct, 2019 1 commit

extend video reader to support fast video probing (#1437) · ed5b2dc3

Zhicheng Yan authored Oct 12, 2019

* extend video reader to support fast video probing

* fix c++ lint

* small fix

* allow to accept input video of type torch.Tensor

ed5b2dc3

20 Sep, 2019 1 commit

[video reader] inception commit (#1303) · 31fad34f

Zhicheng Yan authored Sep 20, 2019

* [video reader] inception commit

* add method save_metadata to class VideoClips in video_utils.py

* add load_metadata() method to VideoClips class

* add Exception to not catch unexpected events such as memory erros, interrupt

* fix bugs in video_plus.py

* [video reader]remove logging. update setup.py

* remove time measurement in test_video_reader.py

* Remove glog and try making ffmpeg finding more robust

* Add ffmpeg to conda build

* Add ffmpeg to conda build [again]

* Make library path finding more robust

* Missing import

* One more missing fix for import

* Py2 compatibility and change package to av to avoid version conflict with ffmpeg

* Fix for python2

* [video reader] support to decode one stream only (e.g. video/audio stream)

* remove argument _precomputed_metadata_filepath

* remove save_metadata method

* add get_metadata method

* expose _precomputed_metadata and frame_rate arguments in video dataset __init__ method

* remove ssize_t

* remove size_t to pass CI check on Windows

* add PyInit__video_reader function to pass CI check on Windows

* minor fix to define PyInit_video_reader symbol

* Make c++ video reader optional

* Temporarily revert changes to test_io

* Revert changes to python files

* Rename files to make it private

* Fix python lint

* Fix C++ lint

* add a functor object EnumClassHash to make Enum class instances usable as key type of std::unordered_map

* fix cpp format check

31fad34f

29 Aug, 2019 1 commit
- Use Tensor.data_ptr instead of .data (#1262) · c6e41b5d
  Yuxin Wu authored Aug 29, 2019
```
* Use Tensor.data_ptr instead of .data

* use pytorch-nightly in CI
```
  c6e41b5d
25 Jun, 2019 1 commit
- Renamed vision.h files to vision_cpu.h and vision_cuda.h (#1051) · 44c66270
  Shahriar authored Jun 26, 2019
  
  44c66270
23 May, 2019 1 commit

nms_cuda signature update (#945) · 249cfbf5

Varun Agrawal authored May 23, 2019

Updated nms_cuda signature to accept detections and scores as separate tensors.
This also required updating the indexing in the NMS CUDA kernel.

Also made the iou_threshold parameter name consistent across implementations.

249cfbf5

19 May, 2019 1 commit
- Fix RoIAlign and RoIPool for non-contiguous gradients (#920) · 2e1e0b63
  Francisco Massa authored May 19, 2019
  
  2e1e0b63
07 May, 2019 1 commit

Add C++ ops to torchvision (#826) · dc3ac290

Francisco Massa authored May 07, 2019

* Initial layout for layers with cpp extensions

* Move files around

* Fix import after move

* Add support for multiple types to ROIAlign

* Different organization

CUDA extensions work now

* Cleanups

* Reduce memory requirements for backwards

* Replace runtime_error by AT_ERROR

* Add nms test

* Add support for compilation using CPP extensions

* Change folder structure

* Add ROIPool cuda

* Cleanups

* Add roi_pool.py

* Fix lint

* Add initial structures folder for bounding boxes

* Assertion macros compatible with pytorch master (#540)

* Support for ROI Pooling (#592)

* ROI Pooling with tests. Fix for cuda context in ROI Align.

* renamed bottom and top to follow torch conventions

* remove .type().tensor() calls in favor of the new approach to tensor initialization (#626)

* Consistent naming for rois variable (#627)

* remove .type().tensor() calls in favor of the new approach to tensor initialization

* Consistent naming for rois variable in ROIPool

* ROIPool: Support for all datatypes (#632)

* Use of torch7 naming scheme for ROIAlign forward and backward

* use common cuda helpers in ROIAlign

* use .options() in favor of .type() where applicable

* Added tests for forward pass of ROIAlign, as well as more consistent naming scheme for CPU vs CUDA

* working ROIAlign cuda backwards pass

* working ROIAlign backwards pass for CPU

* added relevant headers for ROIAlign backwards

* tests for ROIAlign layer

* replace .type() with .options() for tensor initialization in ROIAlign layers

* support for Half types in ROIAlign

* gradcheck tests for ROIAlign

* updated ROIPool on CPU to work with all datatypes

* updated and cleaned tests for ROI Pooling

* Fix rebase problem

* Remove structures folder

* Improve cleanup and bugfix in test_layers

* Update C++ headers

* Add CUDAGuard to cu files

* Add more checks to layers

* Add CUDA NMS and tests

* Add multi-type support for NMS CUDA

* Avoid using THCudaMalloc

* Add clang-format and reformat c++ code

* Remove THC includes

* Rename layers to ops

* Add documentation and rename functions

* Improve the documentation a bit

* Fix some lint errors

* Fix remaining lint inssues

* Area computation doesn't add +1 in NMS

* Update CI to use PyTorch nightly

* Make NMS return indices sorted according to the score

* Address reviewer comments

* Lint fixes

* Improve doc for roi_align and roi_pool

* move to xenial

* Fix bug pointed by @lopuhin

* Fix RoIPool reference implementation in Python 2

Also fixes a bug in the clip_boxes_to_image -- this function needs a test!

* Remove change in .travis

dc3ac290