Commits · b9b7cfc602d68e71b4e4039d15dddfe578df9db2 · OpenDAS / vision

27 Jul, 2023 2 commits
- Add --backend and --use-v2 support for segmentation references (#7743) · b9b7cfc6
  Nicolas Hug authored Jul 27, 2023
  
  b9b7cfc6
- Properly handle maskrcnn and keypoints w.r.t. V2 in detection references (#7742) · 8233c9cd
  Nicolas Hug authored Jul 27, 2023
```
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
```
  8233c9cd
13 Jul, 2023 1 commit
- Add --backend and --use-v2 support to detection refs (#7732) · bb3aae7b
  Nicolas Hug authored Jul 13, 2023
```
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
```
  bb3aae7b
07 Jul, 2023 1 commit
- Add --use-v2 support to classification references (#7724) · 08c9938f
  Nicolas Hug authored Jul 07, 2023
  
  08c9938f
12 Jun, 2023 1 commit
- fix bug when using PIL backend in references/classification (#7665) · cff78aa6
  Max Chuprov authored Jun 12, 2023
```
Co-authored-by: Max Chuprov <m.chuprov@expasoft.tech>
```
  cff78aa6
31 May, 2023 1 commit
- Allow classification references to use the tensor backend (#7629) · 0ab7d05c
  Nicolas Hug authored May 31, 2023
```
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
```
  0ab7d05c
13 Feb, 2023 1 commit
- Change default of antialias parameter from None to 'warn' (#7160) · b030e936
  Nicolas Hug authored Feb 13, 2023
```
Co-authored-by: Philip Meier <github.pmeier@posteo.de>
Co-authored-by: vfdev <vfdev.5@gmail.com>
```
  b030e936
01 Feb, 2023 1 commit
- Fix quantized classif reference - missing args (#7072) · a23f0158
  Nicolas Hug authored Feb 01, 2023
  
  a23f0158
11 Jan, 2023 1 commit

Fix typos and grammar errors (#7065) · 7dc5e5bd

Philip Meier authored Jan 11, 2023

* fix typos throughout the code base

* fix grammar

* revert formatting changes to gallery

* revert 'an uXX'

* remove 'number of the best'

7dc5e5bd

13 Dec, 2022 1 commit
- Fix non-existing parameters in docstrings (#7025) · 0dceac02
  Sergii Dymchenko authored Dec 13, 2022
  
  0dceac02
29 Sep, 2022 1 commit
- [FBcode->GH] Rename asset files to remove spaces. (#6666) · 9cece405
  Vasilis Vryniotis authored Sep 29, 2022
  
  9cece405
26 Sep, 2022 1 commit
- Remove the unused/buggy `--train-center-crop` flag from Classification preset (#6642) · a35be97a
  Vasilis Vryniotis authored Sep 26, 2022
```
* Fixing inverted center_crop check on Classification preset

* Remove the `--train-center-crop` flag.
```
  a35be97a
23 Sep, 2022 2 commits

Add stereo train loop (#6605) · 10dafd9b

Ponku authored Sep 23, 2022



* crestereo draft implementation

* minor model fixes. positional embedding changes.

* aligned base configuration with paper

* Adressing comments

* Broke down Adaptive Correlation Layer. Adressed some other commets.

* adressed some nits

* changed search size, added output channels to model attrs

* changed weights naming

* changed from iterations to num_iters

* removed _make_coords, adressed comments

* fixed jit test

* added script files

* added cascaded inference evaluation

* added optimizer option

* minor changes

* Update references/depth/stereo/train.py
Co-authored-by: vfdev <vfdev.5@gmail.com>

* adressed some comments

* change if-else to dict

* added manual resizing for masks and disparities during evaluation

* minor fixes after previous changes

* changed dataloader to be initialised once

* added distributed changes

* changed loader logic

* updated eval script to generate weight API like logs

* improved support for fine-tuning / training resume

* minor changes for finetuning

* updated with transforms from main

* logging distributed deadlock fix

* lint fix

* updated metrics

* weights API log support

* lint fix

* added readme

* updated readme

* updated readme

* read-me update

* remove hardcoded paths. improved valid dataset selection and sync

* removed extras from gitignore
Co-authored-by: Joao Gomes <jdsgomes@fb.com>
Co-authored-by: vfdev <vfdev.5@gmail.com>
Co-authored-by: YosuaMichael <yosuamichaelm@gmail.com>

10dafd9b

MaxVit model (#6342) · 6b1646ca

Ponku authored Sep 23, 2022



* Added maxvit architecture and tests

* rebased + addresed comments

* Revert "rebased + addresed comments"

This reverts commit c5b28398cd48d2f3403c7c8eeefbaba9df05fcfe.

* Re-added model changes after revert

* aligned with partial original implementation

* removed submitit script fixed lint

* mypy fix for too many arguments

* updated old tests

* removed per batch lr scheduler and seed setting

* removed ontap

* added docs, validated weights

* fixed test expect, moved shape assertions in the begging for torch.fx compatibility

* mypy fix

* lint fix

* added legacy interface

* added weight link

* updated docs

* Update references/classification/train.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Update torchvision/models/maxvit.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* adressed comments

* update ra_maginuted and augmix_severity default values

* adressed some comments

* remove input_channels parameter
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

6b1646ca

22 Sep, 2022 1 commit

Add stereo preset transforms (#6549) · 0fcfaa13

Ponku authored Sep 22, 2022



* Added transforms for Stereo Matching

* changed implicit Y scaling to 0.

* Adressed some comments

* addressed type hint

* Added interpolation random interpolation strategy

* Aligned crop get params

* fixed bug in RandomErase

* Adressed scaling and typos

* Adressed occlusion typo

* Changed parameter order in F.erase

* fixed random erase

* Added inference preset transform for stereo matching

* added contiguous reshape to output tensors

* Adressed comments

* Modified the transform preset to use Tuple[int, int]

* adressed NITs

* added grayscale transform, align resize -> mask

* changed max disparity default behaviour

* added fixed resize, changed masking in sparse flow masking

* update to align with argparse

* changed default mask in asymetric pairs

* moved grayscale order

* changed grayscale api to accept to tensor variant

* mypy fix

* changed resize specs

* adressed nits

* added type hints

* mypy fix

* mypy fix

* mypy fix
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

0fcfaa13

21 Sep, 2022 1 commit

Add stereo matching losses (#6554) · 2c1022e3

Ponku authored Sep 21, 2022



* Moved more losses into classes

* Added photometric loss

* quick fix for ssim loss return value

* added references

* replaced with unsqueeze

* renaming variables

* add ref to consistency loss

* made mask optional everywhere. generalised photometric displacement

* smoothness typo

* fixed flow channel selection bug

* aligned with training script
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

2c1022e3

05 Sep, 2022 2 commits
- Update S3D weights (#6537) · 9b432d07
  Vasilis Vryniotis authored Sep 05, 2022
```
* S3D weight deployment

* Update accuracies.

* Address review comments.
```
  9b432d07
- Fix incorrect recipe for SSDlite320 (#6536) · 8e0d1b95
  Vasilis Vryniotis authored Sep 05, 2022
  
  8e0d1b95
18 Aug, 2022 1 commit
- Introduce resize params, fix lr estimation, update docs. (#6444) · 97bb6cb1
  Vasilis Vryniotis authored Aug 18, 2022
  
  97bb6cb1
17 Aug, 2022 1 commit

Fix minor bug on `PolynomialLR` invocation (#6436) · 78d680fe

Vasilis Vryniotis authored Aug 17, 2022

Resolves issue reported at https://github.com/pytorch/vision/commit/6e535db255cee3ce878dd7a54dda01d4ec8932c1#commitcomment-81409388

There seems to be a misspelling on the name of the parameter. This PR updates `total_steps` to `total_iters` which is the correct argument.

78d680fe

12 Aug, 2022 1 commit
- refactor: replace LambdaLR with PolynomialLR in segmentation training script (#6405) · 6e535db2
  Federico Pozzi authored Aug 12, 2022
  
  6e535db2
10 Aug, 2022 1 commit

Add SwinV2 (#6246) · 5521e9d0

Local State authored Aug 10, 2022



* init submit

* fix typo

* support ufmt and mypy

* fix 2 unittest errors

* fix ufmt issue

* Apply suggestions from code review
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* unify codes

* fix meshgrid indexing

* fix a bug

* fix type check

* add type_annotation

* add slow model

* fix device issue

* fix ufmt issue

* add expect pickle file

* fix jit script issue

* fix type check

* keep consistent argument order

* add support for pretrained_window_size

* avoid code duplication

* a better code reuse

* update window_size argument

* make permute and flatten operations modular

* add PatchMergingV2

* modify expect.pkl

* use None as default argument value

* fix type check

* fix indent

* fix window_size (temporarily)

* remove "v2_" related prefix and add v2 builder

* remove v2 builder

* keep default value consistent with official repo

* deprecate dropout

* deprecate pretrained_window_size

* fix dynamic padding edge case

* remove unused imports

* remove doc modification

* Revert "deprecate dropout"

This reverts commit 8a13f932815ae25655c07430d52929f86b1ca479.

* Revert "fix dynamic padding edge case"

This reverts commit 1c7579cb1bd7bf2f0f94907f39bee6ed707a97a8.

* remove unused kwargs

* add downsample docs

* revert block default value

* revert argument order change

* explicitly specify start_dim

* add small and base variants

* add expect files and slow_models

* Add model weights and documentation for swin v2

* fix lint

* fix end of files line
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

5521e9d0

08 Aug, 2022 2 commits

Update references to use the new Model Registration API (#6369) · 1d0786b0

Vasilis Vryniotis authored Aug 08, 2022

* Expose on Hub the public methods of the registration API

* Limit methods and update docs.

* Update references to use the new Model Registration API

1d0786b0

Type fix in transformers.py (#6376) · 96dbada4
vcwai authored Aug 08, 2022
```
Update `RandomPhotometricDistort` `__init__` argument to correct types.
```
96dbada4

22 Jul, 2022 1 commit

Upgrade usort to `1.0.2` and black to 22.3.0 (#5106) · 6ca9c76a

Philip Meier authored Jul 22, 2022



* upgrade usort to

* Also update black

* Actually use 1.0.2

* Apply pre-commit
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

6ca9c76a

07 Jul, 2022 1 commit

Adding video accuracy for video_classification reference script (#6241) · 8a45147f

YosuaMichael authored Jul 07, 2022

* Add ensembled video accuracy on video reference script

* Change the parser func to be similar with classification reference

* Fix typo type->dtype

* Use custom kinetics

* Fix dataset to not getting start_pts

* Change dataset name, and put video_idx at the back

* Ufmt format

* Use functional softmax, updating meta and use it to overwrite eval param

* Fix typo

* Put the eval parameters on the docs for now

* Change meta for video resnet to use frame-rate 15, also change wording on docs

8a45147f

05 Jul, 2022 1 commit
- Update the dataset cache to factor input parameters (#6234) · 8f98aee5
  Vasilis Vryniotis authored Jul 05, 2022
```
* Update the dataset cache to factor in parameters from the args.

* Fix linter
```
  8f98aee5
24 Jun, 2022 1 commit

Add MViT architecture in TorchVision (#6198) · fb7f9a16

Vasilis Vryniotis authored Jun 24, 2022

* Adding MViT v2 architecture (#6105)

* Adding mvitv2 architecture

* Fixing memory issues on tests and minor refactorings.

* Adding input validation

* Adding docs and minor refactoring

* Add `min_temporal_size` in the supported meta-data.

* Switch Tuple[int, int, int] with List[int] to support easier the 2D case

* Adding more docs and references

* Change naming conventions of classes to follow the same pattern as MobileNetV3

* Fix test breakage.

* Update todos

* Performance optimizations.

* Add support to MViT v1 (#6179)

* Switch implementation to v1 variant.

* Fix docs

* Adding back a v2 pseudovariant

* Changing the way the network are configured.

* Temporarily removing v2

* Adding weights.

* Expand _squeeze/_unsqueeze to support arbitrary dims.

* Update references script.

* Fix tests.

* Fixing frames and preprocessing.

* Fix std/mean values in transforms.

* Add permanent Dropout and update the weights.

* Update accuracies.

* Fix documentation

* Remove unnecessary expected file.

* Skip big model test

* Rewrite the configuration logic to reduce LOC.

* Fix mypy

fb7f9a16

21 Jun, 2022 1 commit
- Fix copypaste collate pickle issues (#6181) · 28557e0c
  Vasilis Vryniotis authored Jun 21, 2022
  
  28557e0c
15 Jun, 2022 1 commit

Add SimpleCopyPaste augmentation (#5825) · bbc1aac8

Lezwon Castelino authored Jun 15, 2022



* added simple POC

* added jitter and crop options

* added references

* moved simplecopypaste to detection module

* working POC for simple copy paste in detection

* added comments

* remove transforms from class
updated the labels
added gaussian blur

* removed loop for mask calculation

* replaced Gaussian blur with functional api

* added inplace operations

* added changes to accept tuples instead of tensors

* - make copy paste functional
- make only one copy of batch and target

* add inplace support within copy paste functional

* Updated code for copy-paste transform

* Fixed code formatting

* [skip ci] removed manual thresholding

* Replaced cropping by resizing data to paste

* Removed inplace arg (as useless) and put a check on iscrowd target

* code-formatting

* Updated copypaste op to make it torch scriptable
Added fallbacks to support LSJ

* Fixed flake8

* Updates according to the review
Co-authored-by: vfdev-5 <vfdev.5@gmail.com>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

bbc1aac8

23 May, 2022 1 commit

Remove `(N, T, H, W, C) => (N, T, C, H, W)` from presets (#6058) · 60ce5bf4

Vasilis Vryniotis authored May 23, 2022

* Remove `(N, T, H, W, C) => (N, T, C, H, W)` conversion on presets

* Update docs.

* Fix the tests

* Use `output_format` for `read_video()`

* Use `output_format` for `Kinetics()`

* Adding input descriptions on presets

60ce5bf4

20 May, 2022 1 commit

Use Kinetics instead of Kinetics400 in references (#5787) (#5952) · b969cca7

Bruno Korbar authored May 20, 2022



* Dataset creation now supports "new" version of Kinetics dataset

* remove unnecessary warning for now

* provide kinetics option

* new reading somehow doesn't need BHWC to BCHW transform

* Addressing minor comments

* Adding kinetics deprication warning for the old Kinetics400 class

* lint error

* Update torchvision/datasets/kinetics.py
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

* Updating README

* Remove BHWC to BCHW

* Put warning back

* formatting
Co-authored-by: Bruno Korbar <bkorbar@quansight.com>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

b969cca7

19 May, 2022 1 commit

add swin_s and swin_b variants and improved swin_t (#6048) · 9d9cfab2

Joao Gomes authored May 19, 2022



* add swin_s and swin_b variants

* fix swin_b params

* fix n parameters and acc numbers

* adding missing acc numbers

* apply ufmt

* Updating `_docs` to reflect training recipe

* Fix exted for swin_b
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

9d9cfab2

10 May, 2022 1 commit
- Fix regression on Detection training script (#5985) · 3ec4b949
  Vasilis Vryniotis authored May 10, 2022
  
  3ec4b949
03 May, 2022 1 commit

Reduce variance of evaluation in reference (#5819) · e556640b

YosuaMichael authored May 03, 2022

* Change code to reduce variance in eval

* Remove unnecessary new line

* Fix missing import warnings

* Fix the warning on video_classification

* Fix bug to get len of UniformClipSampler

e556640b

28 Apr, 2022 1 commit

Add shufflenetv2 1.5 and 2.0 weights (#5906) · 5fc36b4f

YosuaMichael authored Apr 28, 2022

* Add shufflenetv2 1.5 and 2.0 weights

* Update recipe

* Add to docs

* Use resize_size=232 for eval and update the result

* Add quantized shufflenetv2 large

* Update docs and readme

* Format with ufmt

* Add to hubconf.py

* Update readme for classification reference

* Fix reference classification readme

* Fix typo on readme

* Update reference/classification/readme

5fc36b4f

27 Apr, 2022 1 commit

Adding Swin Transformer architecture (#5491) · e288f6ca

Hu Ye authored Apr 27, 2022



* add swin transformer

* Update swin_transformer.py

* Update swin_transformer.py

* fix lint

* fix lint

* refactor code

* add swin_transformer

* Update swin_transformer.py

* fix bug

* refactor code

* fix lint

* update init_weights

* move shift_window into attention

* refactor code

* fix bug

* Update swin_transformer.py

* Update swin_transformer.py

* fix lint

* add patch_merge

* fix bug

* Update swin_transformer.py

* Update swin_transformer.py

* Update swin_transformer.py

* refactor code

* Update swin_transformer.py

* refactor code

* fix lint

* refactor code

* add swin_tiny

* add swin_tiny.pkl

* fix lint

* Delete ModelTester.test_swin_tiny_expect.pkl

* add swin_tiny

* add

* add Optional to bias

* update init weights

* update init_weights and add no weight decay

* add no weight decay

* add set_weight_decay

* add set_weight_decay

* fix lint

* fix lint

* add lr_cos_min

* add other swin models

* Update torchvision/models/swin_transformer.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* refactor doc

* Update utils.py

* Update train.py

* Update train.py

* Update swin_transformer.py

* update model builder

* fix lint

* add

* Update torchvision/models/swin_transformer.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* Update torchvision/models/swin_transformer.py
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* update other model

* simplify the model name just like ViT

* add lr_cos_min

* fix lint

* fix lint

* Update swin_transformer.py

* Update swin_transformer.py

* Update swin_transformer.py

* Delete ModelTester.test_swin_tiny_expect.pkl

* add swin_t

* refactor code

* Update train.py

* add swin_s

* ignore a error of mypy

* Update swin_transformer.py

* fix lint

* add swin_b

* add swin_l

* refactor code

* Update train.py

* move relative_position_bias to __init__

* fix formatting

* Revert "fix formatting"

This reverts commit 41faba232668f7ac4273a0cf632c0d0130c7ce9c.

* Revert "move relative_position_bias to __init__"

This reverts commit f0615440bf18617dc0e5dc4839bd5ed27e5ed010.

* refactor code

* Remove deprecated meta-data from `_COMMON_META`

* fix linter

* add pretrained weights for swin_t

* fix format

* apply ufmt

* add documentation

* update references README

* adding new style docs

* update pre-trained weights values

* remove other variants

* fix typo

* Remove expect for the variants not yet supported
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

e288f6ca

20 Apr, 2022 1 commit
- Minor updates to optical flow ref for consistency (#5654) · 92eb12d6
  Nicolas Hug authored Apr 20, 2022
```
* Minor updates to optical flow ref for consistency

* Actually put back name

* linting
```
  92eb12d6
01 Apr, 2022 2 commits

add set_weight_decay to support custom weight decay setting (#5671) · 3925946f

Hu Ye authored Apr 01, 2022



* add set_weight_decay

* Update _utils.py

* refactor code

* fix import

* add set_weight_decay

* fix lint

* fix lint

* replace split_normalization_params with set_weight_decay

* simplfy the code

* refactor code

* refactor code

* fix lint

* remove unused

* Update test_ops.py

* Update train.py

* Update _utils.py

* Update train.py

* add set_weight_decay

* add set_weight_decay

* Update _utils.py

* Update test_ops.py

* Change `--transformer-weight-decay` to `--transformer-embedding-decay`
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

3925946f

Detection recipe enhancements (#5715) · d59398b5
Vasilis Vryniotis authored Apr 01, 2022
```
* Detection recipe enhancements

* Add back nesterov momentum
```
d59398b5