Commits · 57ae04b4100909a741cda95daa34ef06ee4302eb · OpenDAS / vision

"src/vscode:/vscode.git/clone" did not exist on "08c852290a060b8ac21f263f897e1c9e25e1eda3"

15 Sep, 2022 1 commit
- Fix the error message of `_ovewrite_value_param` (#6585) · 57ae04b4
  Vasilis Vryniotis authored Sep 15, 2022
  
  57ae04b4
14 Sep, 2022 1 commit
- Make the assert message more verbose (#6583) · 321f6552
  YosuaMichael authored Sep 14, 2022
  
  321f6552
13 Sep, 2022 1 commit
- Adding classifications on Video models (#6572) · 3894c353
  Vasilis Vryniotis authored Sep 13, 2022
  
  3894c353
12 Sep, 2022 2 commits
- Add missing `handle_legacy_interface()` calls (#6565) · a89b1957
  Vasilis Vryniotis authored Sep 12, 2022
```
* Add `handle_legacy_interface()` to all new models.

* Fix imports

* Addressing review comments.

* Fix linter

* Addressing further comments.
```
  a89b1957
- Make get_model_builder public (#6560) · cac4e228
  Vasilis Vryniotis authored Sep 12, 2022
  
  cac4e228
09 Sep, 2022 1 commit

Nicolas Granger authored Sep 09, 2022

Changes image size convention to (h, w). I don't think this is used by
models other than SSD which assumes this convention.
Co-authored-by: Nicolas Granger <nicolas.granger@cea.fr>
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

f36f3514

05 Sep, 2022 1 commit
- Update S3D weights (#6537) · 9b432d07
  Vasilis Vryniotis authored Sep 05, 2022
```
* S3D weight deployment

* Update accuracies.

* Address review comments.
```
  9b432d07
19 Aug, 2022 1 commit

Add the S3D architecture to TorchVision (#6412) · 6de7021e

Sophia Zhi authored Aug 19, 2022



* S3D initial commit

* add model builder code and docstrings

* change classifier submodule, populate weights enum

* fix change of block args from List[List[int]] to ints

* add VideoClassification to transforms

* edit weights url for testing, add s3d to models.video init

* norm_layer changes

* norm_layer and args fix

* Overwrite default dropout

* Remove docs from internal submodules.

* Fix tests

* Adding documentation.

* Link doc from main models.rst

* Fix min_temporal_size

* Adding crop/resize parameters in references script

* Release weights.

* Refactor dropout.

* Adding the weights table in the doc
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Vasilis Vryniotis <vvryniotis@fb.com>

6de7021e

18 Aug, 2022 1 commit
- Introduce resize params, fix lr estimation, update docs. (#6444) · 97bb6cb1
  Vasilis Vryniotis authored Aug 18, 2022
  
  97bb6cb1
10 Aug, 2022 2 commits

Add SwinV2 (#6246) · 5521e9d0

Local State authored Aug 10, 2022



* init submit

* fix typo

* support ufmt and mypy

* fix 2 unittest errors

* fix ufmt issue

* Apply suggestions from code review
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

* unify codes

* fix meshgrid indexing

* fix a bug

* fix type check

* add type_annotation

* add slow model

* fix device issue

* fix ufmt issue

* add expect pickle file

* fix jit script issue

* fix type check

* keep consistent argument order

* add support for pretrained_window_size

* avoid code duplication

* a better code reuse

* update window_size argument

* make permute and flatten operations modular

* add PatchMergingV2

* modify expect.pkl

* use None as default argument value

* fix type check

* fix indent

* fix window_size (temporarily)

* remove "v2_" related prefix and add v2 builder

* remove v2 builder

* keep default value consistent with official repo

* deprecate dropout

* deprecate pretrained_window_size

* fix dynamic padding edge case

* remove unused imports

* remove doc modification

* Revert "deprecate dropout"

This reverts commit 8a13f932815ae25655c07430d52929f86b1ca479.

* Revert "fix dynamic padding edge case"

This reverts commit 1c7579cb1bd7bf2f0f94907f39bee6ed707a97a8.

* remove unused kwargs

* add downsample docs

* revert block default value

* revert argument order change

* explicitly specify start_dim

* add small and base variants

* add expect files and slow_models

* Add model weights and documentation for swin v2

* fix lint

* fix end of files line
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

5521e9d0

Add support of MViTv2 video variants (#6373) · 7e8186e0

Vasilis Vryniotis authored Aug 10, 2022

* Extending to support MViTv2

* Fix docs, mypy and linter

* Refactor the relative positional code.

* Code refactoring.

* Rename vars.

* Update docs.

* Replace assert with exception.

* Updat docs.

* Minor refactoring.

* Remove the square input limitation.

* Moving methods around.

* Modify the shortcut in the attention layer.

* Add ported weights.

* Introduce a `residual_cls` config on the attention layer.

* Make the patch_embed kernel/padding/stride configurable.

* Apply changes from code-review.

* Remove stale todo.

7e8186e0

08 Aug, 2022 1 commit
- Expose on Hub the public methods of the registration API (#6364) · c72b2843
  Vasilis Vryniotis authored Aug 08, 2022
```
* Expose on Hub the public methods of the registration API

* Limit methods and update docs.
```
  c72b2843
02 Aug, 2022 1 commit
- cleanup for box encoding and decoding in FCOS (#6277) · 96fa8204
  Abhijit Deo authored Aug 02, 2022
```
* cleaning up box decoding

* minor nits

* cleanup for box encoding also addded.
```
  96fa8204
01 Aug, 2022 2 commits

Remove duplicate doc args. (#6340) · b30fa5c1
Vasilis Vryniotis authored Aug 01, 2022

b30fa5c1

Add registration mechanism for models (#6333) · 0a919dbb

Vasilis Vryniotis authored Aug 01, 2022

* Model registration mechanism.

* Add overwrite options to the dataset prototype registration mechanism.

* Adding example models.

* Fix module filtering

* Fix linter

* Fix docs

* Make name optional if same as model builder

* Apply updates from code-review.

* fix minor bug

* Adding getter for model weight enum

* Support both strings and callables on get_model_weight.

* linter fixes

* Fixing mypy.

* Renaming `get_model_weight` to `get_model_weights`

* Registering all classification models.

* Registering all video models.

* Registering all detection models.

* Registering all optical flow models.

* Fixing mypy.

* Registering all segmentation models.

* Registering all quantization models.

* Fixing linter

* Registering all prototype depth perception models.

* Adding tests and updating existing tests.

* Fix linters

* Fix tests.

* Add beta annotation on docs.

* Fix tests.

* Apply changes from code-review.

* Adding documentation.

* Fix docs.

0a919dbb

25 Jul, 2022 1 commit

Vectorize box encoding in FCOS (#6278) · ba0d665b

Abhijit Deo authored Jul 25, 2022



* intial structure

* fixed types of few variables

* remove the commented code

* list -> List

* encode method will take input as tensors instead of list of tensor
Co-authored-by: Joao Gomes <jdsgomes@fb.com>

ba0d665b

22 Jul, 2022 1 commit

Upgrade usort to `1.0.2` and black to 22.3.0 (#5106) · 6ca9c76a

Philip Meier authored Jul 22, 2022



* upgrade usort to

* Also update black

* Actually use 1.0.2

* Apply pre-commit
Co-authored-by: Nicolas Hug <contact@nicolas-hug.com>

6ca9c76a

14 Jul, 2022 1 commit

fix Swin Transformer inplace mutation (#6266) · 418d8a6f

Local State authored Jul 14, 2022



* fix inplace mutation

* Different attn shouldn't share the same attribute

* a simpler solution
Co-authored-by: YosuaMichael <yosuamichaelm@gmail.com>

418d8a6f

08 Jul, 2022 1 commit
- Move out the pad operation from PatchMerging in swin transformer to make it fx compatible (#6252) · e75a3337
  YosuaMichael authored Jul 08, 2022
  
  e75a3337
07 Jul, 2022 1 commit

Adding video accuracy for video_classification reference script (#6241) · 8a45147f

YosuaMichael authored Jul 07, 2022

* Add ensembled video accuracy on video reference script

* Change the parser func to be similar with classification reference

* Fix typo type->dtype

* Use custom kinetics

* Fix dataset to not getting start_pts

* Change dataset name, and put video_idx at the back

* Ufmt format

* Use functional softmax, updating meta and use it to overwrite eval param

* Fix typo

* Put the eval parameters on the docs for now

* Change meta for video resnet to use frame-rate 15, also change wording on docs

8a45147f

05 Jul, 2022 1 commit

Vectorize box decoding in FCOS (#6203) · b3b74481

Abhijit Deo authored Jul 05, 2022



* basic structure

* added constrains

* fixed errors

* thanks to vadim!

* addressing the comments and added docstrign

* Apply suggestions from code review
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

b3b74481

24 Jun, 2022 1 commit

Add MViT architecture in TorchVision (#6198) · fb7f9a16

Vasilis Vryniotis authored Jun 24, 2022

* Adding MViT v2 architecture (#6105)

* Adding mvitv2 architecture

* Fixing memory issues on tests and minor refactorings.

* Adding input validation

* Adding docs and minor refactoring

* Add `min_temporal_size` in the supported meta-data.

* Switch Tuple[int, int, int] with List[int] to support easier the 2D case

* Adding more docs and references

* Change naming conventions of classes to follow the same pattern as MobileNetV3

* Fix test breakage.

* Update todos

* Performance optimizations.

* Add support to MViT v1 (#6179)

* Switch implementation to v1 variant.

* Fix docs

* Adding back a v2 pseudovariant

* Changing the way the network are configured.

* Temporarily removing v2

* Adding weights.

* Expand _squeeze/_unsqueeze to support arbitrary dims.

* Update references script.

* Fix tests.

* Fixing frames and preprocessing.

* Fix std/mean values in transforms.

* Add permanent Dropout and update the weights.

* Update accuracies.

* Fix documentation

* Remove unnecessary expected file.

* Skip big model test

* Rewrite the configuration logic to reduce LOC.

* Fix mypy

fb7f9a16

23 Jun, 2022 1 commit

Add raft-stereo model to prototype/models (#6107) · 11caf37a

YosuaMichael authored Jun 23, 2022

* Add rough raft-stereo implementation on prototype/models

* Add standard raft_stereo builder, and modify context_encoder to be more similar with original implementation

* Follow original implementation on pre-convolve context

* Fix to make sure we can load original implementation weight and got same output

* reusing component from raft

* Make the raft_stereo_fast able to load original weight implementation

* Format with ufmt and update some comment

* Use raft FlowHead

* clean up comments

* Remove unnecessary import and use ufmt format

* Add __all__ and more docs for RaftStereo class

* Only accept param and not module for raft stereo builder

* Cleanup comment

* Adding typing to raft_stereo

* Update some of raft code and reuse on raft stereo

* Use bool instead of int

* Make standard raft_stereo model jit scriptable

* Make the function _make_out_layer using boolean with_block and init the block_layer with identity

* Separate corr_block into two modules for pyramid and building corr features

* Use tuple if input is not variable size, also remove default value if using List

* Format using ufmt and update ConvGRU to not inherit from raft in order to satisfy both jit script and mypy

* Change RaftStereo docs input type

* Ufmt format raft

* revert back convgru to see mypy errors, add test for jit and fx, make the model fx compatible

* ufmt format

* Specify device for new tensor, dont init module then overwrite and put if-else instead

* Ignore mypy problem on override, put back num_iters on forward

* Revert some effort to make it fx compatible but unnecessary now

* refactor code and remove num_iters from RaftStereo constructor

* Change to raft_stereo_realtime, and specify device directly for tensor creation

* Add description for raft_stereo_realtime

* Update the test for raft_stereo

* Fix raft stereo prototype test to properly test jit script

* Ufmt format

* Test against expected file, change name from raft_stereo to raft_stereo_builder to prevent import error

* Revert __init__.py changes

* Add default value for non-list param on model builder

* Add checking on out_with_block length, add more docs on the encoder

* Use base instead of basic since it is more commonly used

* rename expect file to base as well

* rename on test

* Revert the revert of __init__.py, also revert the adding default value to _raft_stereo to follow the standard pattern

* ufmt format __init__.py

11caf37a

16 Jun, 2022 2 commits
- Fix all broken URLs (#6176) · 12bb8873
  Nicolas Hug authored Jun 16, 2022
  
  12bb8873
- Adding `_log_api_usage_once` to Swin's reusable components. (#6174) · ac5dc51a
  Vasilis Vryniotis authored Jun 16, 2022
  
  ac5dc51a
14 Jun, 2022 1 commit

Add new `.. betastatus::` directive and document Beta APIs (#6115) · 0e688ce0

Nicolas Hug authored Jun 14, 2022

* Add new .. betastatus:: directive to document Beta APIs

* Also add it for the fine-grained video API

* Add directive for all builders and pages of Detection module

* Also segmentation and video models

0e688ce0

10 Jun, 2022 1 commit
- Fix ViT and Resnext docs (#6150) · fee6d12c
  Nicolas Hug authored Jun 10, 2022
  
  fee6d12c
31 May, 2022 1 commit
- Add missing `_version` to the MLPBlock (#6113) · ba4b0db5
  Vasilis Vryniotis authored May 31, 2022
```
* Add missing `_version` to the MLPBlock

* fix linter
```
  ba4b0db5
26 May, 2022 2 commits

Change weights return type to Mapping (#6097) · 6aaa2b00
Vasilis Vryniotis authored May 26, 2022

6aaa2b00

Refactor swin transfomer so later we can reuse component for 3d version (#6088) · 952f4806

YosuaMichael authored May 26, 2022

* Use List[int] instead of int for window_size and shift_size

* Make PatchMerging and SwinTransformerBlock able to handle 2d and 3d cases

* Separate patch embedding from SwinTransformer and enable to get model without head by specifying num_heads=None

* Dont use if before padding so it is fx friendly

* Put the handling on window_size edge cases on separate function and wrap with torch.fx.wrap so it is excluded from tracing

* Update the weight url to the converted weight with new structure

* Update the accuracy of swin_transformer

* Change assert to Exception and nit

* Make num_classes optional

* Add typing output for _fix_window_and_shift_size function

* init head to None to make it jit scriptable

* Revert the change to make num_classes optional

* Revert unneccesarry changes that might be risky

* Remove self.head declaration

952f4806

25 May, 2022 1 commit
- Fix bug by checking if norm_layer weight is None before init (#6082) · 0f971f64
  YosuaMichael authored May 25, 2022
  
  0f971f64
20 May, 2022 2 commits

Document ResNet architecture tweak (#5977) · 37665a0b

puhuk authored May 20, 2022



* To resolve issue #5964

Add note for resnet architecture

* Update resnet.py

* Update resnet.py

* Update resnet.rst

* Fix stylings

* Add the same notes on model builders

* Improve description

* Apply the change everywhere

* Remove trailing space
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

37665a0b

Move Permute layer to ops. (#6055) · d57f929d
Vasilis Vryniotis authored May 20, 2022

d57f929d

19 May, 2022 2 commits

Adding multi-layer perceptron in ops (#6053) · 77cad127

Vasilis Vryniotis authored May 19, 2022

* Adding an MLP block.

* Adding documentation

* Update typos.

* Fix inplace for Dropout.

* Apply recommendations from code review.

* Making changes on pre-trained models.

* Fix linter

77cad127

add swin_s and swin_b variants and improved swin_t (#6048) · 9d9cfab2

Joao Gomes authored May 19, 2022



* add swin_s and swin_b variants

* fix swin_b params

* fix n parameters and acc numbers

* adding missing acc numbers

* apply ufmt

* Updating `_docs` to reflect training recipe

* Fix exted for swin_b
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

9d9cfab2

18 May, 2022 4 commits

Doc revamp for optical flow models (#5895) · 5985504c
Nicolas Hug authored May 18, 2022
```
* Doc revamp for optical flow models

* Some more
```
5985504c

New schema for metrics in weights meta-data (#6047) · 2ec0e847

Nicolas Hug authored May 18, 2022

* Classif models

* Detection

* Segmentation

* quantization

* Video

* optical flow

* tests

* Fix docs

* Fix Video dataset

* Consistency for RAFT dataset names

* use ImageNet-1K

* Use COCO-val2017-VOC-labels for segmentation

* formatting

2ec0e847

Fix resnext docs (#6044) · ff8fae92

Aditya Oke authored May 18, 2022


Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>

ff8fae92

Document all remaining pre-trained weights (#6039) · b52f2331

Vasilis Vryniotis authored May 18, 2022

* Adding docs for quantized models.

* Adding docs for video models.

* Adding docs for segmentation models.

* Adding docs for optical flow models.

* Adding docs for detection models.

* Fix typo.

* Make changes from code-review.

b52f2331

17 May, 2022 1 commit

Document all pre-trained Classification weights (#6036) · edb7bbbd

Vasilis Vryniotis authored May 17, 2022

* Improving the auto-gen doc.

* Adding details for AlexNet, ConvNext, DenseNet, EfficientNets, GoogLeNet and InceptionV3.

* Fixing location of `_docs`

* Adding docs in the remaining classification models.

* Fix linter

edb7bbbd