Commits · 3d60f498e71ba63b428edb184c9ac38fa3737fa6 · OpenDAS / vision

22 Dec, 2020 1 commit
- [*.py] Rename "Arguments:" to "Args:" (#3203) · 3d60f498
  Samuel Marks authored Dec 23, 2020
```
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
```
  3d60f498
20 Dec, 2020 1 commit

Clean up and Document io.image enhancements (#3193) · af5cb00c

Siddhant Bansal authored Dec 21, 2020

* Update ImageReadMode error messages, add newline at the end of image_read_mode.h, replace define with const in image_read_mode.h, add documentation to ImageReadMode enum

* Update readpng_cpu and readjpeg_cpu error messages

* Update image.py documentation

af5cb00c

17 Dec, 2020 1 commit

Refactoring and moving MobileNetV2 to make it reusable (#3177) · 4cbe7140

Vasilis Vryniotis authored Dec 17, 2020

* Moving mobilenet.py to mobilenetv2.py

* Adding mobilenet.py for BC.

* Extending ConvBNReLU for reuse.

* Reduce import scope on mobilenet to only the public and versioned classes and methods.

4cbe7140

16 Dec, 2020 1 commit
- Fixing incorrect doc example in MNASNet. (#3180) · 91e03b91
  Vasilis Vryniotis authored Dec 16, 2020
```
* Fixing incorrect doc example in MNASNet.

* Fixing incorrect output.
```
  91e03b91
15 Dec, 2020 2 commits

Cleanup functional_tensor.py (#3159) (#3171) · 1a300d84

Avijit Dasgupta authored Dec 15, 2020



* added the helper method for dimension checks

* unit tests for dimensio check function in functional_tensor

* code formatting and typing

* moved torch image check after tensor check

* unit testcases for test_assert_image_tensor added and refactored

* separate unit testcase file deleted

* assert_image_tensor added to newly created 6 methods

* test cases added for new 6 mthohds

* removed wrongly pasted posterize method and added solarize method for testing
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

1a300d84

Replacing all torch.jit.annotations with typing (#3174) · 90645ccd
Zhiqiang Wang authored Dec 15, 2020
```
* Replacing all torch.jit.annotations with typing

* Replacing remaining typing
```
90645ccd

14 Dec, 2020 2 commits

Implement all AutoAugment transforms + Policies (#3123) · 83171d6a

Vasilis Vryniotis authored Dec 14, 2020



* Invert Transform (#3104)

* Adding invert operator.

* Make use of the _assert_channels().

* Update upper bound value.

* Remove private doc from invert, create or reuse generic testing methods to avoid duplication of code in the tests. (#3106)

* Create posterize transformation and refactor common methods to assist reuse. (#3108)

* Implement the solarize transform. (#3112)

* Implement the adjust_sharpness transform (#3114)

* Adding functional operator for sharpness.

* Adding transforms for sharpness.

* Handling tiny images and adding a test.

* Implement the autocontrast transform. (#3117)

* Implement the equalize transform (#3119)

* Implement the equalize transform.

* Turn off deterministic for histogram.

* Fixing test. (#3126)

* Force ratio to be float to avoid numeric overflows on blend. (#3127)

* Separate the tests of Adjust Sharpness from ColorJitter. (#3128)

* Add AutoAugment Policies and main Transform (#3142)

* Separate the tests of Adjust Sharpness from ColorJitter.

* Initial implementation, not-jitable.

* AutoAugment passing JIT.

* Adding tests/docs, changing formatting.

* Update test.

* Fix formats

* Fix documentation and imports.

* Apply changes from code review:
- Move the transformations outside of AutoAugment on a separate method.
- Renamed degenerate method for sharpness for better clarity.

* Update torchvision/transforms/functional.py
Co-authored-by: vfdev <vfdev.5@gmail.com>

* Apply more changes from code review:
- Add InterpolationMode parameter.
- Move all declarations away from AutoAugment constructor and into the private method.

* Update documentation.

* Apply suggestions from code review
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

* Apply changes from code review:
- Refactor code to eliminate as any to() and clamp() as possible.
- Reuse methods where possible.
- Apply speed ups.

* Replacing pad.
Co-authored-by: vfdev <vfdev.5@gmail.com>
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>

83171d6a

Removing VISION_API from backward() methods and adding an ops.h (#3163) · 4eab7a67
Vasilis Vryniotis authored Dec 14, 2020
```
* Removing VISION_API from backward() methods and adding a ops.h

* Fixing clang format.
```
4eab7a67

12 Dec, 2020 1 commit

[ONNX] Fix ShuffleNetV2 model export issue. (#3158) · 45d9a304

Jay Zhang authored Dec 12, 2020



* Fix an issue that ShuffleNetV2 model is exported to a wrong ONNX file if dynamic_axes field was provided.

* Add a ut for the bug fix.

* Fix flake8 issue.

* Don't access each element in x.shape, use x.size() instead.
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

45d9a304

11 Dec, 2020 2 commits
- Move autograd implementations on separate files. (#3154) · 0963ff71
  Vasilis Vryniotis authored Dec 11, 2020
  
  0963ff71
- Remove _new_empty_tensor. (#3156) · 07a9c956
  Vasilis Vryniotis authored Dec 11, 2020
```
Co-authored-by: Francisco Massa <fvsmassa@gmail.com>
```
  07a9c956
10 Dec, 2020 1 commit

Restructuring C++ project: (#3146) · 7d831a2f

Vasilis Vryniotis authored Dec 10, 2020

Summary:
* Reduce unnecessary header inclusions in models and io.

* Move autocast to separate folder and hide autograd implementation in an anonymous namespace.

* Moving files in subfolders.

Reviewed By: fmassa

Differential Revision: D25461523

fbshipit-source-id: 756eeb6848aacaa474de4825ed4c1045d17e2cea

7d831a2f

09 Dec, 2020 1 commit
- Removed all backward methods from header files. (#3143) · ce342580
  Vasilis Vryniotis authored Dec 09, 2020
  
  ce342580
08 Dec, 2020 1 commit

Per file C++ Operator registration (#3135) · 3c33f367

Vasilis Vryniotis authored Dec 08, 2020

* Moving deform_conv2d op registration.

* Moving nms op registration.

* Moving new_empty_tensor op registration.

* Moving ps_roi_align op registration.

* Moving ps_roi_pool op registration.

* Moving roi_align op registration.

* Moving roi_pool op registration.

* Restoring headers for forward/backward and fixing styles.

* Restoring the test hack on windows.

* Stricter header inclusion.

3c33f367

07 Dec, 2020 2 commits
- Remove "#pragma once" from cpp files. (#3134) · f80b83ea
  Vasilis Vryniotis authored Dec 07, 2020
  
  f80b83ea
- DatasetFolder: change documentation to include files in subfolders. (#3131) · 70ed29d0
  Robert-Jan Bruintjes authored Dec 07, 2020
  
  70ed29d0
04 Dec, 2020 2 commits

Remove torchscript workaround for center_crop (#3118) · aa753263
Francisco Massa authored Dec 04, 2020
```
This has been fixed in PyTorch in https://github.com/pytorch/pytorch/pull/40897
```
aa753263

pil_to_tensor accimage backend return uint8 (#3109) · 2780c889

Francisco Massa authored Dec 04, 2020

accimage always stores images as uint8, so let's be compatible with the internal representation. Tests were failing without this as it would return a float32 image normalized between 0-1

2780c889

03 Dec, 2020 2 commits
- Fix potential overflow in convert_image_dtype (#3107) · dab47572
  Francisco Massa authored Dec 03, 2020
```
We could have errors such as aten/src/ATen/native/cpu/PowKernel.cpp:41:5:  runtime error: 5.7896e+76 is outside the range of representable values of type 'float'
```
  dab47572
- Use new TORCH_LIBRARY_FRAGMENT to register video_reder (#3105) · 0a75a0c1
  Francisco Massa authored Dec 03, 2020
  
  0a75a0c1
02 Dec, 2020 5 commits

Check num of channels on adjust_* transformations (#3069) · 7f1a05a3
Vasilis Vryniotis authored Dec 02, 2020
```
* Fixing upperbound value on tests and documentation.

* Limit the number of channels on adjust_* transoforms.
```
7f1a05a3

Encapsulate and Standardise C++ Ops (#3097) · 0ebbb0ab

Vasilis Vryniotis authored Dec 02, 2020

* Encapsulate and standardize deform_conv2d (#3074)

* Rename files.

* Standardizing method names.

* Adding anonymous namespaces.

* Applying C++ naming rules and alinging variable names across headers and cpp files.

* Syncing names across implementations.

* Rename deform_conv2d.h to deform_conv2d.cpp

* Use header files:
- Create header files for kernel implementation and remove definitions from vision_*.h files.
- Eliminate unnecessary headers and ensure all cpp include their headers.

* Change the naming convention for kernel implementations.

* Remove the _param postfix from the variables and standardizing names.

* Exposing public forward/backward methods to the C++ API and moving methods around to minimize git blame changes.

* Encapsulate and standardize nms (#3081)

* Syncing, where possible, the names of functions across devices.

* Adding all internal functions in anonymous namespaces.

* Renaming C++/CUDA kernel files and moving operator code from header to cpp file.

* Create foreach cpp file a separate header file with "public" functions.

* Removing unnecessary repeated includes.

* Update CMakeLists.txt to include all headers.

* Encapsulate and standardize ps_roi_align (#3082)

* Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.
Syncing, where possible, the names of functions across devices.

* Adding all internal functions in anonymous namespaces.

* Renaming C++/CUDA kernel files and moving operator code from header to cpp file.

* Create foreach cpp file a separate header file with "public" functions.

* Removing unnecessary repeated includes.

* Encapsulate and standardize ps_roi_pool (#3084)

* Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.

* Adding all internal functions in anonymous namespaces.

* Renaming C++/CUDA kernel files and moving operator code from header to cpp file.

* Create foreach cpp file a separate header file with "public" functions.

* Removing unnecessary repeated includes.

* Encapsulate and standardize roi_align (#3085)

* Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.

* Adding all internal functions in anonymous namespaces.

* Renaming C++/CUDA kernel files and moving operator code from header to cpp file.

* Create foreach cpp file a separate header file with "public" functions.

* Removing unnecessary repeated includes.

* Encapsulate and standardize roi_pool (#3088)

* Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.

* Adding all internal functions in anonymous namespaces.

* Syncing variable names between the cpp files and their header files.

* Renaming C++/CUDA kernel files and moving operator code from header to cpp file.

* Create foreach cpp file a separate header file with "public" functions.

* Removing unnecessary repeated includes.

* Encapsulate and standardize new_empty_tensor_op (#3089)

* Renaming C++ files & methods according to recommended naming conventions and aligning them with Python's API.

* Create foreach cpp file a separate header file with "public" functions.

* Adding all internal functions in anonymous namespaces.

* Convert to const ref all possible parameters.

* Removing unnecessary repeated includes.

* Encapsulate and standardize C++ Ops - Clean up (#3094)

* Removing unnecessary repeated includes.

* Remove unnecessary vision_cpu.h, vision_cuda.h, autocast.h.

* Fixing naming convention and correcting method names on macros.

* Turn on clang formatter for cu files and fixing broken styles.

* Replace "#ifndef ... #define ... #endif" with "#pragma once" on header files.

* Adding operator methods in vision::ops namespace. (#3096)

* Adding operator methods in vision::ops namespace.

* Replace general.h with macros.h

* Adding vision.h to the necessary cpp files.

0ebbb0ab

Miscellaneous linter fixes (#3095) · 8520f0be
Francisco Massa authored Dec 02, 2020
```
Replace tabs with spaces, add newlines to files and replace whitelist with allowlist
```
8520f0be

Remove noexcept from cuda_version (#3091) · ac288eaf

Francisco Massa authored Dec 02, 2020

Operator registration of functions with noexcept doesn't work on some compilers, see https://github.com/pytorch/pytorch/issues/48667

ac288eaf

Fill color support for tensor affine transforms (#2904) · 21deb4d0

Zhengyang Feng authored Dec 02, 2020



* Fill color support for tensor affine transforms

* PEP fix

* Docstring changes and float support

* Docstring update for transforms and float type cast

* Cast only for Tensor

* Temporary patch for lack of Union type support, plus an extra unit test

* More plausible bilinear filling for tensors

* Keep things simple & New docstrings

* Fix lint and other issues after merge

* make it in one line

* Docstring and some code modifications

* More tests and corresponding changes for transoforms and docstring changes

* Simplify test configs

* Update test_functional_tensor.py

* Update test_functional_tensor.py

* Move assertions
Co-authored-by: vfdev <vfdev.5@gmail.com>

21deb4d0

01 Dec, 2020 6 commits

add UUID in LOG() in decoder (#3080) · df4003fd
Francisco Massa authored Dec 01, 2020
```
* add UUID in LOG() in decoder

* Fix lint

* More lint
```
df4003fd

concatenate small tensors into big ones to reduce the use of shared f… (#1795) · 9fc6522d

Francisco Massa authored Dec 01, 2020

* concatenate small tensors into big ones to reduce the use of shared file descriptor (#1694)

Summary:
Pull Request resolved: https://github.com/pytorch/vision/pull/1694

- PT dataloader forks worker process to speed up the fetching of dataset example. The recommended way of multiprocess context is `forkserver` rather than `fork`.

- Main process and worker processes will share the dataset class instance, which avoid duplicating the dataset and save memory. In this process, `ForkPickler(..).dumps(...)` will be called to serialize the objects, including objects within dataset instance recursively. `VideoClips` instance internally uses O(N) `torch.Tensor` to store per-video information, such as pts, and possible clips, where N is the No. of videos.

- During dumping, each `torch.Tensor` will use one File Descriptor (FD). The OS default max limit of FD is 65K by using `ulimit -n` to query. The number of tensors in `VideoClips` often exceeds the limit.

- To resolve this issue, we use a few big tensors by concatenating small tensors in the `__getstate__()` method, which will be called during pickling. This will only require O(1) tensors.

- When this diff is landed, we can abondon D19173248

In D19173397, in ClassyVision, we change the mp context from `fork` to `forkserver`, and finally can run the PT dataloader without hanging issues.

Reviewed By: fmassa

Differential Revision: D19179991

fbshipit-source-id: c8716775c7c154aa33d93b25d112d2a59ea688a9

* Try to fix Windows

* Try fix Windows v2

* Disable tests on Windows

* Add back necessary part

* Try fix OSX (and maybe Windows)

* Fix

* Try enabling Windows
Co-authored-by: Zhicheng Yan <zyan3@fb.com>

9fc6522d

Fix: Improve the bounding boxes implementation (#3075) · d97825ea

AdityaKhursale authored Dec 01, 2020



* Fix: Improve the bounding boxes implementation

Use write_png instead of PIL in test_draw_boxes()
Initialize txt_font only once

* Remove channels permutation in test_draw_boxes
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Aditya Khursale <akhursale@nvidia.com>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

d97825ea

Add option to write audio to video file (#2304) · 1b00af38

Francisco Massa authored Dec 01, 2020



* Add option to write audio to video file

Summary:
I was trying to use torchvision's `write_video` function and realized there was no option to add in the audio.

Thus, this diff contains the changes necessary such that this is possible. This is my first time trying to contribute to this project, so be as harsh as you need!

Reviewed By: fmassa

Differential Revision: D21480083

fbshipit-source-id: 2e11f2c8728d42f86c94068f75b843793d5a94aa

* Fix typo

* Try fix Windows

* Disable test on Windows
Co-authored-by: Joanna Bitton <jbitton@fb.com>

1b00af38

Enable rtmp timeout in decoder (#3076) · 4eb9f660

Francisco Massa authored Dec 01, 2020



Summary:
* Link libav change into fbcode
* Set rw_timeout value

Differential Revision: D23412524

fbshipit-source-id: 5755950be1b1b4c37cb0c3a69a8c875f8862a92c
Co-authored-by: Keyun Tong <ktong@fb.com>

4eb9f660

Fix spelling mistake: orignal -> original (#3062) · 1b83f46c

ProGamerGov authored Dec 01, 2020



* Fix spelling mistake: orignal -> original

* Spelling fix: orignal -> original
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

1b83f46c

30 Nov, 2020 2 commits

Fix spelling mistakes: dimenions -> dimensions (#3061) · 6e7ed49a
ProGamerGov authored Nov 30, 2020
```
* Fix spelling: dimenions -> dimensions

* Fix spelling mistake: dimenions -> dimensions
```
6e7ed49a

Fixes types annotation (#3059) · 181e81ce

Vasilis Vryniotis authored Nov 30, 2020



* Correcting incorrect types

* Add missing type statement

* Fix type annotations in unittest

* Fix TypeError

* Fix TypeError

* Fix type equality judgment

* Fix recursive compile

* Use string for class name annotation.
Co-authored-by: zhiqiang <zhiqwang@outlook.com>

181e81ce

27 Nov, 2020 5 commits

Support for image with no annotations in RetinaNet (#3032) · 4ab46e5f

Vasilis Vryniotis authored Nov 27, 2020

* Enable support for images without annotations

* Ensuring gradient propagates to RegressionHead.

* Rewriting losses to remove branching.

* Fix the seed on DeformConv autocast test.

4ab46e5f

Plural to singular name change. (#3055) · 9e71fdaf
Vasilis Vryniotis authored Nov 27, 2020

9e71fdaf

[BC-breaking] Introduced InterpolationModes and deprecated arguments: resample... · 0c445130

vfdev authored Nov 27, 2020

[BC-breaking] Introduced InterpolationModes and deprecated arguments: resample and fillcolor (#2952)

* Deprecated arguments: resample and fillcolor
Replaced by interpolation and fill

* Updates according to the review

* Added tests to check warnings and asserted BC

* [WIP] Interpolation modes

* Added InterpolationModes enum

* Added supported for int values for interpolation for BC

* Removed useless test code

* Fix flake8

0c445130

Add utility to draw bounding boxes (#2785) · 240210c9

Aditya Oke authored Nov 27, 2020



* initital prototype

* flake

* Adds documentation

* minimal working bboxes

* Adds label display

* adds colors :-)

* adds suggestions and fixes CI

* handles image of dim 4

* fixes image handling

* removes dev file

* adds suggested changes

* Updating the API.

* Update test.

* Implementing code review improvements.

* Further refactoring and adding test.

* Replace random to white to reduce size and change font on tests.
Co-authored-by: Vasilis Vryniotis <vvryniotis@fb.com>

240210c9

Adding Python type hints, correcting incorrect types, removing unnecessary... · b3adace6
Vasilis Vryniotis authored Nov 27, 2020
```
Adding Python type hints, correcting incorrect types, removing unnecessary vars and simplifying code. (#3045)
```
b3adace6

26 Nov, 2020 1 commit

Add a warning if any clip can't be obtained from a video in VideoClips. (#2513) · 1bdda8cb

Santiago Castro authored Nov 26, 2020



* Add a warning if a clip can't be get from a video in VideoClips

* Update torchvision/datasets/video_utils.py
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

* Add a test
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

1bdda8cb

20 Nov, 2020 1 commit

Add explicit check for number of channels (#3013) · a51c49e4

Alexey Demyanchuk authored Nov 20, 2020



* Add explicit check for number of channels

Example why you need to check it:
`M = torch.randint(low=0, high=2, size=(6, 64, 64), dtype = torch.float)`
When you put this input through to_pil_image without mode argument, it converts to uint8 here:
```
if pic.is_floating_point() and mode != 'F':
            pic = pic.mul(255).byte()
```
and change the mode to RGB here:
```
if mode is None and npimg.dtype == np.uint8:
            mode = 'RGB'
```
Image.fromarray doesn't raise if provided with mode RGB and just cut number of channels from what you have to 3

* Check number of channels before processing

* Add test for invalid number of channels

* Add explicit check for number of channels

Example why you need to check it:
`M = torch.randint(low=0, high=2, size=(6, 64, 64), dtype = torch.float)`
When you put this input through to_pil_image without mode argument, it converts to uint8 here:
```
if pic.is_floating_point() and mode != 'F':
            pic = pic.mul(255).byte()
```
and change the mode to RGB here:
```
if mode is None and npimg.dtype == np.uint8:
            mode = 'RGB'
```
Image.fromarray doesn't raise if provided with mode RGB and just cut number of channels from what you have to 3

* Check number of channels before processing

* Add test for invalid number of channels

* Put check after channel dim unsqueeze

* Add test if error message is matching

* Delete redundant code

* Bug fix in checking for bad types
Co-authored-by: Demyanchuk <demyanca@mh-hannover.local>
Co-authored-by: vfdev <vfdev.5@gmail.com>

a51c49e4