Commits · cc26cd8139c672016b6a578ea8d02138b53eb193 · OpenDAS / vision

17 Nov, 2022 1 commit

Add Video SwinTransformer (#6521) · b1054cbb

Aditya Oke authored Nov 17, 2022



* Just start adding mere copy paste

* Replace d with t and D with T

* Align swin transformer video to image a bit

* Rename d -> t

* align with 2d impl

* align with 2d impl

* Add helpful comments and config for 3d

* add docs

* Add docs

* Add configurations

* Add docs

* Fix bugs

* Fix wrong edit

* Fix wrong edit

* Fix bugs

* Fix bugs

* Fix as per fx suggestions

* Update torchvision/models/video/swin_transformer.py

* Fix as per fx suggestions

* Fix expect files and code

* Update the expect files

* Modify video swin

* Add min size and min temporal size, num params

* Add flops and size

* Fix types

* Fix url recipe
Co-authored-by: Yosua Michael Maranatha <yosuamichael@fb.com>

b1054cbb

19 Aug, 2022 1 commit

Add the S3D architecture to TorchVision (#6412) · 6de7021e

Sophia Zhi authored Aug 19, 2022



* S3D initial commit

* add model builder code and docstrings

* change classifier submodule, populate weights enum

* fix change of block args from List[List[int]] to ints

* add VideoClassification to transforms

* edit weights url for testing, add s3d to models.video init

* norm_layer changes

* norm_layer and args fix

* Overwrite default dropout

* Remove docs from internal submodules.

* Fix tests

* Adding documentation.

* Link doc from main models.rst

* Fix min_temporal_size

* Adding crop/resize parameters in references script

* Release weights.

* Refactor dropout.

* Adding the weights table in the doc
Co-authored-by: Vasilis Vryniotis <datumbox@users.noreply.github.com>
Co-authored-by: Vasilis Vryniotis <vvryniotis@fb.com>

6de7021e

24 Jun, 2022 1 commit

Add MViT architecture in TorchVision (#6198) · fb7f9a16

Vasilis Vryniotis authored Jun 24, 2022

* Adding MViT v2 architecture (#6105)

* Adding mvitv2 architecture

* Fixing memory issues on tests and minor refactorings.

* Adding input validation

* Adding docs and minor refactoring

* Add `min_temporal_size` in the supported meta-data.

* Switch Tuple[int, int, int] with List[int] to support easier the 2D case

* Adding more docs and references

* Change naming conventions of classes to follow the same pattern as MobileNetV3

* Fix test breakage.

* Update todos

* Performance optimizations.

* Add support to MViT v1 (#6179)

* Switch implementation to v1 variant.

* Fix docs

* Adding back a v2 pseudovariant

* Changing the way the network are configured.

* Temporarily removing v2

* Adding weights.

* Expand _squeeze/_unsqueeze to support arbitrary dims.

* Update references script.

* Fix tests.

* Fixing frames and preprocessing.

* Fix std/mean values in transforms.

* Add permanent Dropout and update the weights.

* Update accuracies.

* Fix documentation

* Remove unnecessary expected file.

* Skip big model test

* Rewrite the configuration logic to reduce LOC.

* Fix mypy

fb7f9a16

04 Aug, 2019 1 commit

Move resnet video models to single location (#1190) · 6a834e98

Francisco Massa authored Aug 04, 2019

* [WIP] Minor cleanups on R3d

* Move all models to video/resnet.py

* Remove old files

* Make tests less memory intensive

* Lint

* Fix typo and add pretraing arg to training script

6a834e98

26 Jul, 2019 1 commit

Add VideoModelZoo models (#1130) · 7c95f97a

Bruno Korbar authored Jul 26, 2019

* [0.4_video] models - initial commit

* addressing fmassas inline comments

* pep8 and flake8

* simplify "hacks"

* sorting out latest comments

* nitpick

* Updated tests and constructors

* Added docstrings - ready to merge

7c95f97a