- 17 Nov, 2022 1 commit
-
-
Aditya Oke authored
* Just start adding mere copy paste * Replace d with t and D with T * Align swin transformer video to image a bit * Rename d -> t * align with 2d impl * align with 2d impl * Add helpful comments and config for 3d * add docs * Add docs * Add configurations * Add docs * Fix bugs * Fix wrong edit * Fix wrong edit * Fix bugs * Fix bugs * Fix as per fx suggestions * Update torchvision/models/video/swin_transformer.py * Fix as per fx suggestions * Fix expect files and code * Update the expect files * Modify video swin * Add min size and min temporal size, num params * Add flops and size * Fix types * Fix url recipe Co-authored-by:Yosua Michael Maranatha <yosuamichael@fb.com>
-
- 19 Aug, 2022 1 commit
-
-
Sophia Zhi authored
* S3D initial commit * add model builder code and docstrings * change classifier submodule, populate weights enum * fix change of block args from List[List[int]] to ints * add VideoClassification to transforms * edit weights url for testing, add s3d to models.video init * norm_layer changes * norm_layer and args fix * Overwrite default dropout * Remove docs from internal submodules. * Fix tests * Adding documentation. * Link doc from main models.rst * Fix min_temporal_size * Adding crop/resize parameters in references script * Release weights. * Refactor dropout. * Adding the weights table in the doc Co-authored-by:
Vasilis Vryniotis <datumbox@users.noreply.github.com> Co-authored-by:
Vasilis Vryniotis <vvryniotis@fb.com>
-
- 24 Jun, 2022 1 commit
-
-
Vasilis Vryniotis authored
* Adding MViT v2 architecture (#6105) * Adding mvitv2 architecture * Fixing memory issues on tests and minor refactorings. * Adding input validation * Adding docs and minor refactoring * Add `min_temporal_size` in the supported meta-data. * Switch Tuple[int, int, int] with List[int] to support easier the 2D case * Adding more docs and references * Change naming conventions of classes to follow the same pattern as MobileNetV3 * Fix test breakage. * Update todos * Performance optimizations. * Add support to MViT v1 (#6179) * Switch implementation to v1 variant. * Fix docs * Adding back a v2 pseudovariant * Changing the way the network are configured. * Temporarily removing v2 * Adding weights. * Expand _squeeze/_unsqueeze to support arbitrary dims. * Update references script. * Fix tests. * Fixing frames and preprocessing. * Fix std/mean values in transforms. * Add permanent Dropout and update the weights. * Update accuracies. * Fix documentation * Remove unnecessary expected file. * Skip big model test * Rewrite the configuration logic to reduce LOC. * Fix mypy
-
- 04 Aug, 2019 1 commit
-
-
Francisco Massa authored
* [WIP] Minor cleanups on R3d * Move all models to video/resnet.py * Remove old files * Make tests less memory intensive * Lint * Fix typo and add pretraing arg to training script
-
- 26 Jul, 2019 1 commit
-
-
Bruno Korbar authored
* [0.4_video] models - initial commit * addressing fmassas inline comments * pep8 and flake8 * simplify "hacks" * sorting out latest comments * nitpick * Updated tests and constructors * Added docstrings - ready to merge
-