Commits · 9effc4cdc940952500c72b3ea5a0d7348f33b004 · OpenDAS / vision

11 Jul, 2022 1 commit

[proto] Added some transformations and fixed type hints (#6245) · 9effc4cd

vfdev authored Jul 11, 2022

* Another attempt to add transforms

* Fixed padding type hint

* Fixed fill arg for pad and rotate, affine

* code formatting and type hints for affine transformation

* Fixed flake8

* Updated tests to save and load transforms

* Fixed code formatting issue

* Fixed jit loading issue

* Restored fill default value to None
Updated code according to the review

* Added tests for rotation, affine and zoom transforms

* Put back commented code

* Random erase bypass boxes and masks
Go back with if-return/elif-return/else-return

* Fixed acceptable and non-acceptable types for Cutmix/Mixup

* Updated conditions for _BaseMixupCutmix

9effc4cd

08 Jul, 2022 1 commit
- [jit] Updated tests checking save and load transforms (#6248) · 54160313
  vfdev authored Jul 08, 2022
```
* Updated tests to save and load transforms

* Fixed code formatting issue
```
  54160313
06 Jul, 2022 1 commit

[proto] Added mid-level ops and feature-based ops (#6219) · bd19fb8e

vfdev authored Jul 06, 2022

* Added mid-level ops and feature-based ops

* Fixing deadlock in dataloader with circular imports

* Added non-scalar fill support workaround for pad

* Removed comments

* int/float support for fill in pad op

* Updated type hints and removed bypass option from mid-level methods

* Minor nit fixes

bd19fb8e

01 Jul, 2022 1 commit
- Fill arg supports float values, scripted pad op (#6226) · fea1f733
  vfdev authored Jul 01, 2022
  
  fea1f733
24 Jun, 2022 1 commit

Add MViT architecture in TorchVision (#6198) · fb7f9a16

Vasilis Vryniotis authored Jun 24, 2022

* Adding MViT v2 architecture (#6105)

* Adding mvitv2 architecture

* Fixing memory issues on tests and minor refactorings.

* Adding input validation

* Adding docs and minor refactoring

* Add `min_temporal_size` in the supported meta-data.

* Switch Tuple[int, int, int] with List[int] to support easier the 2D case

* Adding more docs and references

* Change naming conventions of classes to follow the same pattern as MobileNetV3

* Fix test breakage.

* Update todos

* Performance optimizations.

* Add support to MViT v1 (#6179)

* Switch implementation to v1 variant.

* Fix docs

* Adding back a v2 pseudovariant

* Changing the way the network are configured.

* Temporarily removing v2

* Adding weights.

* Expand _squeeze/_unsqueeze to support arbitrary dims.

* Update references script.

* Fix tests.

* Fixing frames and preprocessing.

* Fix std/mean values in transforms.

* Add permanent Dropout and update the weights.

* Update accuracies.

* Fix documentation

* Remove unnecessary expected file.

* Skip big model test

* Rewrite the configuration logic to reduce LOC.

* Fix mypy

fb7f9a16

23 Jun, 2022 5 commits

Skip big models on both cpu and gpu test to fix CI(#6197) · 32e63417
YosuaMichael authored Jun 23, 2022

32e63417

[proto] Improvements for functional API and tests (#6187) · 6155808f

vfdev authored Jun 23, 2022

* Added base tests for rotate_image_tensor

* Updated resize_image_tensor API and tests and fixed a bug with max_size

* Refactored and modified private api for resize functional op

* Fixed failures

* More updates

* Updated proto functional op: resize_image_*

* Added max_size arg to resize_bounding_box and updated basic tests

* Update functional.py

* Reverted fill/center order for rotate
Other nits

6155808f

Refactored and modified private api for resize functional op (#6191) · aeafa912

vfdev authored Jun 23, 2022

* Refactored and modified private api for resize functional op

* Fixed failures

* More updates

* Fixed flake8

aeafa912

Added antialias arg to resized crop transform and op (#6193) · a5536de9
vfdev authored Jun 23, 2022

a5536de9

Add raft-stereo model to prototype/models (#6107) · 11caf37a

YosuaMichael authored Jun 23, 2022

* Add rough raft-stereo implementation on prototype/models

* Add standard raft_stereo builder, and modify context_encoder to be more similar with original implementation

* Follow original implementation on pre-convolve context

* Fix to make sure we can load original implementation weight and got same output

* reusing component from raft

* Make the raft_stereo_fast able to load original weight implementation

* Format with ufmt and update some comment

* Use raft FlowHead

* clean up comments

* Remove unnecessary import and use ufmt format

* Add __all__ and more docs for RaftStereo class

* Only accept param and not module for raft stereo builder

* Cleanup comment

* Adding typing to raft_stereo

* Update some of raft code and reuse on raft stereo

* Use bool instead of int

* Make standard raft_stereo model jit scriptable

* Make the function _make_out_layer using boolean with_block and init the block_layer with identity

* Separate corr_block into two modules for pyramid and building corr features

* Use tuple if input is not variable size, also remove default value if using List

* Format using ufmt and update ConvGRU to not inherit from raft in order to satisfy both jit script and mypy

* Change RaftStereo docs input type

* Ufmt format raft

* revert back convgru to see mypy errors, add test for jit and fx, make the model fx compatible

* ufmt format

* Specify device for new tensor, dont init module then overwrite and put if-else instead

* Ignore mypy problem on override, put back num_iters on forward

* Revert some effort to make it fx compatible but unnecessary now

* refactor code and remove num_iters from RaftStereo constructor

* Change to raft_stereo_realtime, and specify device directly for tensor creation

* Add description for raft_stereo_realtime

* Update the test for raft_stereo

* Fix raft stereo prototype test to properly test jit script

* Ufmt format

* Test against expected file, change name from raft_stereo to raft_stereo_builder to prevent import error

* Revert __init__.py changes

* Add default value for non-list param on model builder

* Add checking on out_with_block length, add more docs on the encoder

* Use base instead of basic since it is more commonly used

* rename expect file to base as well

* rename on test

* Revert the revert of __init__.py, also revert the adding default value to _raft_stereo to follow the standard pattern

* ufmt format __init__.py

11caf37a

13 Jun, 2022 1 commit

Added elastic transform in torchvision.transforms (#4938) · 9430be76

Lenz authored Jun 13, 2022



* Added elastic augment

* ufmt formatting

* updated comments

* fixed circular dependency issue and bare except error

* Fixed three type checking errors in functional_tensor.py

* ufmt formatted

* changed elastic_deformation to a more common implementation

Implementation uses alpha and sigma to control strength and smoothness of the displacement vectors in elastic_deformation instead of control_point_spacings and sigma.

* ufmt formatting

* Some performance updates

Put random offset vectors to device before gaussian_blur is applied speeds it up 3-fold.

* fixed type error

* fixed again a type error

* Update torchvision/transforms/functional_tensor.py
Co-authored-by: vfdev <vfdev.5@gmail.com>

* Added some requested changes

- pil image support similar to GaussianBlur
- changed interpolation arg to InterpolationMode
- added a wrapper in torchvision.transforms.functional.py that gets called by the class in transforms.py
-renamed it to ElasticTransform
- handled sigma = 0 case

* added img docstring

* added some tests

* Updated tests and the code

* Added the requested changes to the arguments of F.elastic_transform

Added random_state and displacement as arguments to F.elastic_transform

* fixed the type error

* Fixed tests and docs

* implemented requested changes

Changes:
1) alpha AND sigma OR displacement must be given as arguments to transforms.functional_tensor.elastic_transform instead of alpha AND sigma AND displacement
2) displacements are accepted in transforms.functional.elastic_transform as np.array and torch.Tensor instead of only accepting torch.Tensor

* ufmt formatting

* trochscript error resolved

replaced torch.from_numpy() to torch.Tensor() to make it compatible to torchscript

* revert to torch.from_numpy()

* updated argument checks and errors

- In F.elastic_transform added check to see if both user inputs img and displacement are either of type PIL Image and ndarray or both of type tensor.
- In F_t.elastic_transform added check if alpha and sigma are None if displacement is given or vice versa.

* fixed seed error

changed torch.seed to torch.manual_seed in F_t.elastic_transform

* Reverted displacement type and other cosmetics

* Other minor improvements

* changed gaussian_blur filter size

changed gaussian_blur filter size
from
4 * int(sigma) + 1
to
int(8 * sigma + 1)
to make it consistent with ernestums implementation

* resolved merge error

* Revert "resolved merge error"

This reverts commit 6a4a4e74ff4d078e2c2753d359185f9a81c415d0.

* resolve merge error

* ufmt formatted

* ufmt formated once again..

* fixed unsupported operand error

* Update API and removed random_state from functional part

* Added default values

* Added ElasticTransform to gallery and updated the docstring

* Updated gallery and added _log_api_usage_once
BTW, matplotlib.pylab is deprecated

* Updated gallery transforms code

* Updates according to review
Co-authored-by: vfdev <vfdev.5@gmail.com>

9430be76

11 Jun, 2022 1 commit

Update _pil_constants.py (#6154) · c02d6ce1

vfdev authored Jun 11, 2022



* Update _pil_constants.py

* Update _pil_constants.py

* Fix flake8

* Fixed two related warnings in tests

* switch dir with hasattr
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

c02d6ce1

06 Jun, 2022 1 commit
- Throw ValueError in draw bounding boxes for invalid boxes (#6123) · 738fa133
  Aditya Oke authored Jun 06, 2022
```
* Fix the issue :)

* Intellij vs ufmt battle

* remove .item()
```
  738fa133
30 May, 2022 1 commit

[proto] Added tests for other padding modes (#6104) · fa37d9bb

vfdev authored May 30, 2022

* Added tests for other padding modes

* Fixed expected mask dtype

* Applied comments from review

fa37d9bb

26 May, 2022 1 commit

add tests for F.pad_bounding_box (#6038) · 1d50dfa0

Philip Meier authored May 26, 2022



* add tests for F.pad_bounding_box

* Added correctness tests for pad and reimplemented bbox op to keep dtype

* Update _geometry.py
Co-authored-by: vfdev <vfdev.5@gmail.com>

1d50dfa0

25 May, 2022 1 commit
- Add .float() before .mean() on test_backbone_utils.py because .mean() dont... · 665b8355
  YosuaMichael authored May 25, 2022
```
Add .float() before .mean() on test_backbone_utils.py because .mean() dont accept integer dtype (#6090)
```
  665b8355
24 May, 2022 1 commit
- Validate against expected files on videos (#6077) · 69ce4523
  Vasilis Vryniotis authored May 24, 2022
```
* Validate against expected files on videos

* Plus tests for autocast
```
  69ce4523
23 May, 2022 3 commits

feat: add functional center crop on mask (#5961) · 3a2631ba

Federico Pozzi authored May 24, 2022



* feat: add functional center crop on mask

* test: add correctness center crop with random segmentation mask

* test: improvements

* test: improvements

* Apply suggestions from code review
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: Federico Pozzi <federico.pozzi@argo.vision>
Co-authored-by: vfdev <vfdev.5@gmail.com>
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

3a2631ba

Remove `(N, T, H, W, C) => (N, T, C, H, W)` from presets (#6058) · 60ce5bf4

Vasilis Vryniotis authored May 23, 2022

* Remove `(N, T, H, W, C) => (N, T, C, H, W)` conversion on presets

* Update docs.

* Fix the tests

* Use `output_format` for `read_video()`

* Use `output_format` for `Kinetics()`

* Adding input descriptions on presets

60ce5bf4

Throw warning for empty masks or box tensors on draw_segmentation_masks and... · 5486b768

oxabz authored May 23, 2022


Throw warning for empty masks or box tensors on draw_segmentation_masks and draw_bounding_boxes (#5857)

* Fixing the IndexError in draw_segmentation_masks

* fixing the bug on draw_bounding_boxes

* Changing fstring to normal string

* Removing unecessary conversion

* Adding test for the change

* Adding a test for draw seqmentation mask

* Fixing small mistake

* Fixing an error in the tests

* removing useless imports

* ufmt
Co-authored-by: LEGRAND Matthieu <legrand.ma@chu-toulouse.fr>
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

5486b768

19 May, 2022 2 commits

Minor Swin Transformer fixes (#6054) · e65372e1
Vasilis Vryniotis authored May 19, 2022
```
* Add swin on hubconfig.

* Add swin b/s in the `slow_models` list.
```
e65372e1

add swin_s and swin_b variants and improved swin_t (#6048) · 9d9cfab2

Joao Gomes authored May 19, 2022



* add swin_s and swin_b variants

* fix swin_b params

* fix n parameters and acc numbers

* adding missing acc numbers

* apply ufmt

* Updating `_docs` to reflect training recipe

* Fix exted for swin_b
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

9d9cfab2

18 May, 2022 2 commits

New schema for metrics in weights meta-data (#6047) · 2ec0e847

Nicolas Hug authored May 18, 2022

* Classif models

* Detection

* Segmentation

* quantization

* Video

* optical flow

* tests

* Fix docs

* Fix Video dataset

* Consistency for RAFT dataset names

* use ImageNet-1K

* Use COCO-val2017-VOC-labels for segmentation

* formatting

2ec0e847

Document all remaining pre-trained weights (#6039) · b52f2331

Vasilis Vryniotis authored May 18, 2022

* Adding docs for quantized models.

* Adding docs for video models.

* Adding docs for segmentation models.

* Adding docs for optical flow models.

* Adding docs for detection models.

* Fix typo.

* Make changes from code-review.

b52f2331

17 May, 2022 3 commits

Document all pre-trained Classification weights (#6036) · edb7bbbd

Vasilis Vryniotis authored May 17, 2022

* Improving the auto-gen doc.

* Adding details for AlexNet, ConvNext, DenseNet, EfficientNets, GoogLeNet and InceptionV3.

* Fixing location of `_docs`

* Adding docs in the remaining classification models.

* Fix linter

edb7bbbd

simplify OnlineResource.load (#5990) · b430ba68

Philip Meier authored May 17, 2022

* simplify OnlineResource.load

* [PoC] merge mock data preparation and loading

* Revert "cache mock data based on config"

This reverts commit 5ed6eedef74865e0baa746a375d5ec1f0ab1bde7.

* Revert "[PoC] merge mock data preparation and loading"

This reverts commit d62747962f9ed6a7b0b80849e7c971efabb5d3da.

* remove preprocess returning a new path in favor of querying twice

* address test comments

* clarify comment

* mypy

* use builtin decompress utility

b430ba68

Merge mock data preparation and dataset logic in prototype tests (#6010) · 08c8f0e0

Philip Meier authored May 17, 2022

* merge mock data preparation and loading

* address comments

* fix extra file creation

* remove tmp folder

* inline images meta creation in coco mock data

08c8f0e0

16 May, 2022 1 commit

Deprecate int as interpolation argument type (#5974) · 44252c81

kylematoba authored May 16, 2022

* Requested here https://github.com/pytorch/vision/pull/5898#discussion_r864765799

.

* Fix tests

* ufmt, not black
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

44252c81

12 May, 2022 1 commit
- rely on patched datasets home rather than passing it around (#5998) · 90a729a1
  Philip Meier authored May 12, 2022
```
* rely on patched datasets home rather than passing it around

* add comment
```
  90a729a1
11 May, 2022 1 commit

Allow custom docs for Weight enums and Weights fields (#5988) · e6edcef4

Nicolas Hug authored May 11, 2022



* POC

* Update torchvision/models/resnet.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Fix tests

* ufmt

* Remove useless docstring
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

e6edcef4

09 May, 2022 5 commits

Distance IoU (#5786) · 1ae38297

Yassine Alouini authored May 09, 2022



* [FEAT] Add distance IoU and distance IoU loss + some tests (WIP for tests).

* [FIX] Remove URL from docstring + remove assert since it causes a big performance drop.

* [FIX] eps isn't None.

* [TEST] Update existing box dIoU test + add dIoU loss tests (inspired from cIoU ones).

* [ENH] Some pre-commit fixes + remove print + mypy.

* [ENH] Pass the device in the assertion for the dIoU loss test.

* [FIX] Remove type hints from the dIoU box test.

* [ENH] Refactor box and loss for dIoU functions + fix half tests.

* [FIX] Precommits fix.

* [ENH] Some improvement for the distance IoU tests thanks to code review.

* [ENH] Upcast in distance boxes computation to avoid overflow.

* [ENH] Revert the refactor of distance IoU loss back since it introduced a bug and can be slow.

* Precommit fix.

* [FIX] Few changes introduced by merge conflict.

* Add code reference

* Fix test
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

1ae38297

[proto] Added `center_crop_bounding_box` functional op (#5972) · 7d0d7fd7

vfdev authored May 09, 2022

* [proto] Added `center_crop_bounding_box` functional op

* Fixed mypy issue

* Added one more test case

* More test cases

7d0d7fd7

[proto] Added functional `perspective_bounding_box/segmentation_mask` ops (#5888) · f079f5a5

vfdev authored May 09, 2022

* Added functional `perspective_bounding_box`/`perspective_segmentation_mask` ops

* Added more comments and added a code to assert denom != 0

* Put larger r/a tolerence when matching bboxes

f079f5a5

Update transforms for PIL deprecation (#5898) · 423ddcd0

kylematoba authored May 09, 2022



* Update transforms for PIL deprecation

* Changes agreed at pytorch/vision#5898

* black, sort constants, version check

* Format tests

* Square brackets

* Update torchvision/transforms/_pil_constants.py
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

423ddcd0

Adding resnext101 64x4d model (#5935) · 4c02f103

YosuaMichael authored May 09, 2022

* Add resnext101_64x4d model definition

* Add test for resnext101_64x4d

* Add resnext101_64x4d weight

* Update checkpoint to use EMA weigth

* Add quantization model signature for resnext101_64x4d

* Fix class name and update accuracy using 1 gpu and batch_size=1

* Apply ufmt

* Update the quantized weight and accuracy that we still keep the training log

* Add quantized expect file

* Update docs and fix acc1

* Add recipe for quantized to PR

* Update models.rst

4c02f103

02 May, 2022 1 commit

feat: add functional pad on segmentation mask (#5866) · 104073cc

Federico Pozzi authored May 02, 2022



* feat: add functional pad on segmentation mask

* test: add basic correctness test with random masks

* test: add all padding options

* fix: pr comments

* fix: tests

* refactor: reshape tensor in 4d, then pad
Co-authored-by: Federico Pozzi <federico.pozzi@argo.vision>

104073cc

28 Apr, 2022 4 commits

Added CIOU loss function (#5776) · ecbff88a

Abhijit Deo authored Apr 28, 2022



* added ciou loss

* "formatting with flake8 and ufmt"

* formatting with ufmt and flake8

* minor changes

* changes as per the suggestions

* added reference in torchvision/ops/__init__.py

* sample test

* tests formatted

* added description

* formatting

* edited tests

* changes in tests

* added tests for multiple boxes

* minor edits

* minor edit

* doc added

* minor edits

* Update test_ops.py

* formatting test file

* changes as per the suggestions

* formatting and adding some more tests

* bounding box added

* removed unnecessary comment

* added docstring

* added type annotations

* removed potential bug

* Update torchvision/ops/boxes.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Update torchvision/ops/boxes.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Update test/test_ops.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Philip Meier <github.pmeier@posteo.de>

ecbff88a

fix test_data_loader on Windows and macOS (#5912) · 734ee253

Philip Meier authored Apr 28, 2022



* fix test_data_loader on Windows and macOS

* Update test/test_prototype_builtin_datasets.py
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

734ee253

Skip big model in test to reduce memory usage in CI (#5903) · e0467c64

YosuaMichael authored Apr 28, 2022



* Skip big model in test

* Let get_models_from_module func to read skipped_models from global directly instead of function param to reduce changes

* Also skip regnet_y_128gf

* Only skip test for test_classification_model and create toggle using env var

* Remove unnecessary comment

* Fix comparison of device to use str

* Add logic to test_classification_model directly

* Add kprcnn in autocast flaky list
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

e0467c64

Fix keypointrcnn_resnet50_fpn flaky test (#5911) · a46a323e
Vasilis Vryniotis authored Apr 28, 2022

a46a323e