Commits · 3e06bc6f2ddd56cd713229a7f0404ff56cabda47 · OpenDAS / vision

01 Jun, 2020 1 commit
- Add more tests to NMS (#2279) · 34810c0c
  Francisco Massa authored Jun 01, 2020
```
* Add more tests to NMS

* Fix lint
```
  34810c0c
26 May, 2020 1 commit

Avoid `using` in header files (#2257) · e89c4c01

Shawn Zhong authored May 26, 2020

* Avoid `using` in header files

* Fix clang_format

* use clang-format-7 to reformat code

e89c4c01

18 May, 2020 1 commit
- Fix missing include for OSX in video decoder (#2224) · 7d5b601b
  Francisco Massa authored May 18, 2020
```
* Fix missing include for OSX in video decoder

* clang-format
```
  7d5b601b
14 May, 2020 1 commit

Make ceil_div __host__ __device__ (#2217) · a6073f07

Gao, Xiang authored May 14, 2020

Fixes https://github.com/pytorch/vision/issues/2214#issuecomment-628636663

I don't know why the building is not working with `--expt-relaxed-constexpr` flag set, but it is generally a good idea to declare this as `__host__ __device__`

a6073f07

12 May, 2020 1 commit
- Add namespace to avoid conflict with ATen version of channel_shuffle(). (#2206) · 4e138bd5
  xkszltl authored May 12, 2020
```
Fix https://github.com/pytorch/vision/issues/2193.
```
  4e138bd5
04 May, 2020 1 commit
- Don't include CUDAApplyUtils.cuh (#2127) · 5066d715
  Gao, Xiang authored May 04, 2020
```
* Don't include CUDAApplyUtils.cuh

* fix format

* fix atomic
```
  5066d715
23 Apr, 2020 1 commit

fix the use of contiguous() in kernels (#2131) · c031e287

Yuxin Wu authored Apr 23, 2020



* fix the use of contiguous() in kernels

* clang-format

* add a contiguous in nms
Co-authored-by: Yuxin Wu <ppwwyyxx@users.noreply.github.com>

c031e287

07 Apr, 2020 2 commits

improve consistency among box IoU calculations (#2072) · f6a3e0c3

Brian Hart authored Apr 07, 2020

Torchvision includes at least 3 bits of code that calculate
box Intersection over Union values (and usually compare to
a threshold):

- box_iou in torchvision/ops/boxes.py
- devIoU in torchvision/csrc/cuda/nms_cuda.cu
- nms_cpu_kernel in torchvision/csrc/cpu/nms_cpu.cpp

The calculations were performed slightly differently between
those, leading to occasional differences in results.

Update devIoU to use the same method as the others for better
consistency.

This change improves agreement between the CPU and CUDA
calculations but the results can still differ slightly.
Setting NVCC_FLAGS to include "--fmad=true" would provide
still better agreement, but with likely cost to performance.

f6a3e0c3

Remove warning about deprecated (#2064) · 57c789f8

AhnDW authored Apr 07, 2020

* Replace **.is_cuda() to just is_cuda()

* Replace type to scalar_type

* Fix lint, clang-format

* Fix lint, clang-format

57c789f8

03 Apr, 2020 3 commits
- Fix C++ lint (#2059) · 9ed2fa3c
  Francisco Massa authored Apr 03, 2020
  
  9ed2fa3c
- Fix some deprecated warnings (#2055) · 3c2c0022
  gslotman authored Apr 03, 2020
  
  3c2c0022
- Add clang-format to CircleCI (#2057) · d0b32a11
  Francisco Massa authored Apr 03, 2020
```
* Add clang-format to CircleCI

* Fix for clang-format version

* Fix lint and remove Travis CI

* Seeing if lost commit comes back

* Fix lint

* Re-enable all tests
```
  d0b32a11
02 Apr, 2020 1 commit

Add test for large batches in DeformConv2d (#2040) · ccd797dd

Francisco Massa authored Apr 02, 2020

* Add test for large batches in DeformConv2d

* Clean-up and (try) fix DeformConv2d

* Simplifications and bugfixes

* Try fix CUDA now

ccd797dd

01 Apr, 2020 1 commit
- Fix C++ lint (#2041) · f2f085bf
  Francisco Massa authored Apr 01, 2020
  
  f2f085bf
30 Mar, 2020 2 commits

Fix shape error for deform conv (#2027) · 7ee5a8b7

Yuwen Xiong authored Mar 30, 2020

* fix shape error for deform conv gpu op

recover shape of columns for next iteration in for loops, previous version will cause error when batch_sz / n_parallel_imgs > 1

* fix shape error for deform conv cpu op

recover shape of columns for next iteration in for loops, previous version will cause error when batch_sz / n_parallel_imgs > 1

7ee5a8b7

Fix Tensor::data<> deprecation. (#2028) · 561a014b
Mikhail Lobanov authored Mar 30, 2020

561a014b

24 Mar, 2020 1 commit
- Fix C++ lint (#2009) · 3c254fb7
  Francisco Massa authored Mar 24, 2020
```
* Fix C++ lint

* More fixes
```
  3c254fb7
17 Mar, 2020 1 commit

Update video reader to use new decoder (#1978) · 32e16805

Francisco Massa authored Mar 17, 2020

* Base decoder for video. (#1747)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1747

Pull Request resolved: https://github.com/pytorch/vision/pull/1746

Added the implementation of ffmpeg based decoder with functionality that can be used in VUE and TorchVision.

Reviewed By: fmassa

Differential Revision: D19358914

fbshipit-source-id: abb672f89bfaca6351dda2354f0d35cf8e47fa0f

* Integrated base decoder into VideoReader class and video_utils.py (#1766)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1766

Replaced FfmpegDecoder (incompativle with VUE) by base decoder (compatible with VUE).
Modified python utilities video_utils.py for internal simplification. Public interface got preserved.

Reviewed By: fmassa

Differential Revision: D19415903

fbshipit-source-id: 4d7a0158bd77bac0a18732fe4183fdd9a57f6402

* Optimizating base decoder performance. (#1852)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1852

Changed base decoder internals for a faster clip processing.

Reviewed By: stephenyan1231

Differential Revision: D19748379

fbshipit-source-id: 58a435f0a0b25545e7bd1a3edb0b1d558176a806

* Minor fix and decoder class members access.

Summary:
Found and fix a bug in cropping algorithm (simple mistyping).
Also derived classes need access to some decoder class members, like initialization parameters - make it protected.

Reviewed By: stephenyan1231, fmassa

Differential Revision: D19895076

fbshipit-source-id: 691336c8e18526b085ae5792ac3546bc387a6db9

* Added missing header for less dependencies. (#1898)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1898

Include streams/samplers shouldn't depend on decoder headers. Add dependencies directly to the place where they are required.

Reviewed By: stephenyan1231

Differential Revision: D19911404

fbshipit-source-id: ef322a053708405c02cee4562b456b1602fb12fc

* Implemented VUE Asynchronous Decoder

Summary: For Mothership we have found that asynchronous decoder provides a better performance.

Differential Revision: D20026194

fbshipit-source-id: 627b91844b4e3f917002031dd32cb19c239f4ba8

* fix a bug in API read_video_from_memory (#1942)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1942

In D18720474, it introduces a bug in `read_video_from_memory` API. Thank weiyaowang for reporting it.

Reviewed By: weiyaowang

Differential Revision: D20270179

fbshipit-source-id: 66348c99a5ad1f9129b90e934524ddfaad59de03

* extend decoder to support new video_max_dimension argument (#1924)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1924

Extend `video reader` decoder python API in Torchvision to support a new argument `video_max_dimension`. This enables the new video decoding use cases. When setting `video_width=0`, `video_height=0`, `video_min_dimension != 0`, and `video_max_dimension != 0`, we can rescale the video clips so that its spatial resolution (height, width) becomes
- (video_min_dimension, video_max_dimension) if original height < original width
- (video_max_dimension, video_min_dimension) if original height >= original width

This is useful at video model testing stage, where we perform fully convolution evaluation and take entire video frames without cropping as input. Previously, for instance we can only set `video_width=0`, `video_height=0`, `video_min_dimension = 128`, which will preserve aspect ratio. In production dataset, there are a small number of videos where aspect ratio is either extremely large or small, and when the shorter edge is rescaled to 128, the longer edge is still large. This will easily cause GPU memory OOM when we sample multiple video clips, and put them in a single minibatch.

Now, we can set (for instance) `video_width=0`, `video_height=0`, `video_min_dimension = 128` and `video_max_dimension = 171` so that the rescale resolution is either (128, 171) or (171, 128) depending on whether original height is larger than original width. Thus, we are less likely to have gpu OOM because the spatial size of video clips is determined.

Reviewed By: putivsky

Differential Revision: D20182529

fbshipit-source-id: f9c40afb7590e7c45e6908946597141efa35f57c

* Fixing samplers initialization (#1967)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1967

No-ops for torchvision diff, which fixes samplers.

Differential Revision: D20397218

fbshipit-source-id: 6dc4d04364f305fbda7ca4f67a25ceecd73d0f20

* Exclude C++ test files
Co-authored-by: Yuri Putivsky <yuri@fb.com>
Co-authored-by: Zhicheng Yan <zyan3@fb.com>

32e16805

12 Mar, 2020 1 commit
- Fix C++ linnt (#1971) · 601ce5fc
  Francisco Massa authored Mar 12, 2020
  
  601ce5fc
11 Mar, 2020 1 commit

[ROCm] Create torchvision as a HIP Extension (#1928) · 43e94b39

Ashish Farmer authored Mar 11, 2020

* Added code to support creating extension on ROCm

* max -> fmaxf conversion for hipification

* added WITH_HIP flag for hipExtension

* added appropriate headers for HIP build

* use USE_ROCM in condition to build

* change fmaxf and fminf calls

* fminf -> min

* fix the check for ROCM_HOME

* more robust checking for rocm pytorch

* add check for pytorch version before using HIP extensions

* conditional reading of ROCM_HOME

43e94b39

04 Mar, 2020 2 commits
- replace torch 1.5.0 items flagged with deprecation warnings (fix #1906) (#1918) · b6f28ec1
  Francis Charette Migneault authored Mar 04, 2020
  
  b6f28ec1
- `aligned` flag in ROIAlign (#1908) · e1e975f9
  AhnDW authored Mar 04, 2020
```
* Aligned flag in the interfaces

* Aligned flag in the impl, and remove unused comments

* Handling empty bin in forward

* Remove raise error in roi_width

* Aligned flag in the Testcodes
```
  e1e975f9
29 Jan, 2020 1 commit
- Revert "Base decoder for video. (#1747) (#1793)" (#1833) · f2600c2e
  Francisco Massa authored Jan 29, 2020
```
This reverts commit 28b7f8ae.
```
  f2600c2e
27 Jan, 2020 1 commit

Base decoder for video. (#1747) (#1793) · 28b7f8ae

Francisco Massa authored Jan 27, 2020

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1747

Pull Request resolved: https://github.com/pytorch/vision/pull/1746



Added the implementation of ffmpeg based decoder with functionality that can be used in VUE and TorchVision.

Reviewed By: fmassa

Differential Revision: D19358914

fbshipit-source-id: abb672f89bfaca6351dda2354f0d35cf8e47fa0f
Co-authored-by: Yuri Putivsky <yuri@fb.com>

28b7f8ae

22 Jan, 2020 1 commit
- Fix Windows build by renaming Python init functions (#1779) · e2a8b418
  peterjc123 authored Jan 23, 2020
  
  e2a8b418
02 Jan, 2020 1 commit

Speed up nms_cuda (#1704) · 06cbdb5b

Yuxin Wu authored Jan 02, 2020

1. Let the IOU function compare with threshold. This avoid a division. Similar strategy is also used in https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/kernels/non_max_suppression_op.cu.cc
2. Only compute the upper triangle of the mask.

This speeds up the kernel about 20% (tested on GTX 1080Ti, with 20 input cases dumped from a Mask R-CNN inference job).

06cbdb5b

16 Dec, 2019 1 commit
- Improve error message and avoid segfault in DeformConv2d (#1660) · 17ea1482
  Francisco Massa authored Dec 16, 2019
  
  17ea1482
06 Dec, 2019 1 commit
- Fix header includes for cpu (#1644) · 598b61d9
  gslotman authored Dec 06, 2019
  
  598b61d9
04 Dec, 2019 1 commit

Add Deformable Convolution operation. (#1586) · 52b8685b

pedrofreire authored Dec 04, 2019

* Add Deformable Convolution operation.

This adds the deformable convolution operation, as described in Deformable Convolutional Networks (https://arxiv.org/abs/1703.06211).

- The code is based on https://github.com/open-mmlab/mmdetection/blob/master/mmdet/ops/dcn/src/deform_conv_cuda.cpp ; the whole code was modified and refactored to remove redundancies and increase clarity, and to adapt it to torchvision.

- The CPU part is a direct copy of the CUDA code; it might make sense to do follow-up adjustments in the CPU code to simplify it / optimize it, or to reuse functionality between CPU and CUDA..

- We also add tests (with a non-trivial set of parameters); they can be made more robust by randomizing the parameters and executing multiple times.

* Update DeformConv to be more consistent w/ Conv2d

* rename some variables and arguments to match Conv2d;
* add optional bias;
* add weight, offset and bias as module parameters;
* remove the n_parallel_imgs parameter;
* Fix __repr__;
* etc..

Initialization of weight and bias is the same as in Conv2d, and
initialization of offsets to zero is the same as in the paper.

This also includes some other small unrelated fixes/improvements.

* Apply clang-format in DeformConv files.

* Import Optional type annotation

* Remove offset param from DeformConv2d module

- We pass the offset in the forward of DeformConv2d, instead of having
an internal parameter. This adds some complexity to creating the module
(e.g. now you have to worry about the output size, to create the
offset), but it gives more flexibility.
- We also use make_tuple for tuple creation, in an attempt to fix error
w/ older compilers.

* Replace abs by std::abs

Old gcc versions were giving wrong results here, because they would
resolve abs as int -> int, thus causing undesired truncation. Replacing
abs by std::abs should allow for correct overloading of abs as float -> float.

* Reorder declarations for clarity

* Reorder weight and offset args in deform_conv2d

We place offset arg before the weight arg, to be more
consistent with DeformConv2d.forward(input, offset)

* Replace abs by std::abs in DeformConv_cuda

52b8685b

25 Nov, 2019 1 commit

Make maskrcnn scriptable (#1407) · d88d8961

eellison authored Nov 25, 2019

* almost working...

* respond to comments

* add empty tensor op, handle different output types in generalized rcnn

* clean ups

* address comments

* more changes

* it's working!

* torchscript bugs

* add script/ eager test

* eval script model

* fix flake

* division import

* py2 compat

* update test, fix arange bug

* import division statement

* fix linter

* fixes

* changes needed for JIT master

* cleanups

* remove imagelist_to

* requested changes

* Make FPN backwards-compatible and torchscript compatible

We remove support for feature channels=0, but support for it was already a bit limited

* Fix ONNX regression

d88d8961

15 Nov, 2019 1 commit
- Fix C++ lint (#1584) · 4b2f8dab
  Francisco Massa authored Nov 15, 2019
  
  4b2f8dab
14 Nov, 2019 1 commit
- Rename with_bias() to bias(), and output_channels() to out_channels() in C++... · 44a5bae9
  Will Feng authored Nov 14, 2019
```
Rename with_bias() to bias(), and output_channels() to out_channels() in C++ conv layer options usage (#1576)
```
  44a5bae9
05 Nov, 2019 1 commit
- Fix inconsistent NMS implementation between CPU and CUDA (#1556) · 4897402a
  Francisco Massa authored Nov 05, 2019
```
* Fix inconsistent NMS implementation

* Improve tests for NMS

* Remove unnecessary using statement
```
  4897402a
17 Oct, 2019 1 commit
- Fix CUDA builds on Windows (#1485) · 7b5075fc
  Francisco Massa authored Oct 17, 2019
  
  7b5075fc
16 Oct, 2019 1 commit

Implementation for Position-sensitive ROI Pool/Align [updated] (#1410) · 896d7ec7

Lukas Bommes authored Oct 16, 2019

* added PSRoiAlign and PSRoiPool with C++ autograd and torch ops

* fixed linter errors

* fixed linter errors 2

* fixed linter errors 3

896d7ec7

12 Oct, 2019 1 commit

extend video reader to support fast video probing (#1437) · ed5b2dc3

Zhicheng Yan authored Oct 12, 2019

* extend video reader to support fast video probing

* fix c++ lint

* small fix

* allow to accept input video of type torch.Tensor

ed5b2dc3

08 Oct, 2019 1 commit

Revert "Change all torch::nn::init::Nonlinearity::{name} and... · ef0ffb80

Francisco Massa authored Oct 08, 2019

Revert "Change all torch::nn::init::Nonlinearity::{name} and torch::nn::init::FanMode::{name} to torch::k{name} (#1394)" (#1428)

This reverts commit 8c3cea7f.

ef0ffb80

02 Oct, 2019 1 commit

Change all torch::nn::init::Nonlinearity::{name} and... · 8c3cea7f

Will Feng authored Oct 02, 2019

Change all torch::nn::init::Nonlinearity::{name} and torch::nn::init::FanMode::{name} to torch::k{name} (#1394)

* Change all torch::nn::init::Nonlinearity::{name} and torch::nn::init::FanMode::{name} to torch::k{name}

* empty commit

* fix lint

* fix lint

* fix lint

8c3cea7f

30 Sep, 2019 2 commits

Revert "Change all torch::nn::init::Nonlinearity::{name} and... · def95df2

Will Feng authored Sep 30, 2019

Revert "Change all torch::nn::init::Nonlinearity::{name} and torch::nn::init::FanMode::{name} usage to torch::k{name}"

This reverts commit e22c105c.

def95df2

Change all torch::nn::init::Nonlinearity::{name} and... · e22c105c

Will Feng authored Sep 30, 2019

Change all torch::nn::init::Nonlinearity::{name} and torch::nn::init::FanMode::{name} usage to torch::k{name}

e22c105c