1. 14 Dec, 2023 1 commit
  2. 20 Oct, 2023 1 commit
  3. 25 Sep, 2023 1 commit
  4. 21 Aug, 2023 1 commit
  5. 21 Jun, 2023 1 commit
  6. 02 Jun, 2023 1 commit
  7. 01 Jun, 2023 1 commit
  8. 24 May, 2023 1 commit
  9. 13 Feb, 2023 1 commit
  10. 10 Feb, 2023 1 commit
  11. 09 Feb, 2023 3 commits
  12. 11 Jan, 2023 1 commit
  13. 24 Nov, 2022 1 commit
  14. 10 Oct, 2022 1 commit
  15. 04 Oct, 2022 1 commit
  16. 28 Sep, 2022 1 commit
  17. 20 Sep, 2022 1 commit
  18. 16 Sep, 2022 1 commit
  19. 13 Sep, 2022 1 commit
  20. 12 Sep, 2022 1 commit
  21. 05 Sep, 2022 1 commit
  22. 26 Aug, 2022 1 commit
  23. 19 Aug, 2022 1 commit
    • Sophia Zhi's avatar
      Add the S3D architecture to TorchVision (#6412) · 6de7021e
      Sophia Zhi authored
      
      
      * S3D initial commit
      
      * add model builder code and docstrings
      
      * change classifier submodule, populate weights enum
      
      * fix change of block args from List[List[int]] to ints
      
      * add VideoClassification to transforms
      
      * edit weights url for testing, add s3d to models.video init
      
      * norm_layer changes
      
      * norm_layer and args fix
      
      * Overwrite default dropout
      
      * Remove docs from internal submodules.
      
      * Fix tests
      
      * Adding documentation.
      
      * Link doc from main models.rst
      
      * Fix min_temporal_size
      
      * Adding crop/resize parameters in references script
      
      * Release weights.
      
      * Refactor dropout.
      
      * Adding the weights table in the doc
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      Co-authored-by: default avatarVasilis Vryniotis <vvryniotis@fb.com>
      6de7021e
  24. 10 Aug, 2022 2 commits
    • Local State's avatar
      Add SwinV2 (#6246) · 5521e9d0
      Local State authored
      
      
      * init submit
      
      * fix typo
      
      * support ufmt and mypy
      
      * fix 2 unittest errors
      
      * fix ufmt issue
      
      * Apply suggestions from code review
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * unify codes
      
      * fix meshgrid indexing
      
      * fix a bug
      
      * fix type check
      
      * add type_annotation
      
      * add slow model
      
      * fix device issue
      
      * fix ufmt issue
      
      * add expect pickle file
      
      * fix jit script issue
      
      * fix type check
      
      * keep consistent argument order
      
      * add support for pretrained_window_size
      
      * avoid code duplication
      
      * a better code reuse
      
      * update window_size argument
      
      * make permute and flatten operations modular
      
      * add PatchMergingV2
      
      * modify expect.pkl
      
      * use None as default argument value
      
      * fix type check
      
      * fix indent
      
      * fix window_size (temporarily)
      
      * remove "v2_" related prefix and add v2 builder
      
      * remove v2 builder
      
      * keep default value consistent with official repo
      
      * deprecate dropout
      
      * deprecate pretrained_window_size
      
      * fix dynamic padding edge case
      
      * remove unused imports
      
      * remove doc modification
      
      * Revert "deprecate dropout"
      
      This reverts commit 8a13f932815ae25655c07430d52929f86b1ca479.
      
      * Revert "fix dynamic padding edge case"
      
      This reverts commit 1c7579cb1bd7bf2f0f94907f39bee6ed707a97a8.
      
      * remove unused kwargs
      
      * add downsample docs
      
      * revert block default value
      
      * revert argument order change
      
      * explicitly specify start_dim
      
      * add small and base variants
      
      * add expect files and slow_models
      
      * Add model weights and documentation for swin v2
      
      * fix lint
      
      * fix end of files line
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      5521e9d0
    • Vasilis Vryniotis's avatar
      Add support of MViTv2 video variants (#6373) · 7e8186e0
      Vasilis Vryniotis authored
      * Extending to support MViTv2
      
      * Fix docs, mypy and linter
      
      * Refactor the relative positional code.
      
      * Code refactoring.
      
      * Rename vars.
      
      * Update docs.
      
      * Replace assert with exception.
      
      * Updat docs.
      
      * Minor refactoring.
      
      * Remove the square input limitation.
      
      * Moving methods around.
      
      * Modify the shortcut in the attention layer.
      
      * Add ported weights.
      
      * Introduce a `residual_cls` config on the attention layer.
      
      * Make the patch_embed kernel/padding/stride configurable.
      
      * Apply changes from code-review.
      
      * Remove stale todo.
      7e8186e0
  25. 08 Aug, 2022 1 commit
  26. 01 Aug, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Add registration mechanism for models (#6333) · 0a919dbb
      Vasilis Vryniotis authored
      * Model registration mechanism.
      
      * Add overwrite options to the dataset prototype registration mechanism.
      
      * Adding example models.
      
      * Fix module filtering
      
      * Fix linter
      
      * Fix docs
      
      * Make name optional if same as model builder
      
      * Apply updates from code-review.
      
      * fix minor bug
      
      * Adding getter for model weight enum
      
      * Support both strings and callables on get_model_weight.
      
      * linter fixes
      
      * Fixing mypy.
      
      * Renaming `get_model_weight` to `get_model_weights`
      
      * Registering all classification models.
      
      * Registering all video models.
      
      * Registering all detection models.
      
      * Registering all optical flow models.
      
      * Fixing mypy.
      
      * Registering all segmentation models.
      
      * Registering all quantization models.
      
      * Fixing linter
      
      * Registering all prototype depth perception models.
      
      * Adding tests and updating existing tests.
      
      * Fix linters
      
      * Fix tests.
      
      * Add beta annotation on docs.
      
      * Fix tests.
      
      * Apply changes from code-review.
      
      * Adding documentation.
      
      * Fix docs.
      0a919dbb
  27. 22 Jul, 2022 1 commit
  28. 24 Jun, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Add MViT architecture in TorchVision (#6198) · fb7f9a16
      Vasilis Vryniotis authored
      * Adding MViT v2 architecture (#6105)
      
      * Adding mvitv2 architecture
      
      * Fixing memory issues on tests and minor refactorings.
      
      * Adding input validation
      
      * Adding docs and minor refactoring
      
      * Add `min_temporal_size` in the supported meta-data.
      
      * Switch Tuple[int, int, int] with List[int] to support easier the 2D case
      
      * Adding more docs and references
      
      * Change naming conventions of classes to follow the same pattern as MobileNetV3
      
      * Fix test breakage.
      
      * Update todos
      
      * Performance optimizations.
      
      * Add support to MViT v1 (#6179)
      
      * Switch implementation to v1 variant.
      
      * Fix docs
      
      * Adding back a v2 pseudovariant
      
      * Changing the way the network are configured.
      
      * Temporarily removing v2
      
      * Adding weights.
      
      * Expand _squeeze/_unsqueeze to support arbitrary dims.
      
      * Update references script.
      
      * Fix tests.
      
      * Fixing frames and preprocessing.
      
      * Fix std/mean values in transforms.
      
      * Add permanent Dropout and update the weights.
      
      * Update accuracies.
      
      * Fix documentation
      
      * Remove unnecessary expected file.
      
      * Skip big model test
      
      * Rewrite the configuration logic to reduce LOC.
      
      * Fix mypy
      fb7f9a16
  29. 23 Jun, 2022 1 commit
  30. 24 May, 2022 1 commit
  31. 19 May, 2022 1 commit
  32. 09 May, 2022 1 commit
    • YosuaMichael's avatar
      Adding resnext101 64x4d model (#5935) · 4c02f103
      YosuaMichael authored
      * Add resnext101_64x4d model definition
      
      * Add test for resnext101_64x4d
      
      * Add resnext101_64x4d weight
      
      * Update checkpoint to use EMA weigth
      
      * Add quantization model signature for resnext101_64x4d
      
      * Fix class name and update accuracy using 1 gpu and batch_size=1
      
      * Apply ufmt
      
      * Update the quantized weight and accuracy that we still keep the training log
      
      * Add quantized expect file
      
      * Update docs and fix acc1
      
      * Add recipe for quantized to PR
      
      * Update models.rst
      4c02f103
  33. 28 Apr, 2022 2 commits
  34. 27 Apr, 2022 1 commit
  35. 05 Apr, 2022 2 commits
    • Vasilis Vryniotis's avatar
      Post-paper Detection Optimizations (#5444) · 08cc9a7f
      Vasilis Vryniotis authored
      * Use frozen BN only if pre-trained.
      
      * Add LSJ and ability to from scratch training.
      
      * Fixing formatter
      
      * Adding `--opt` and `--norm-weight-decay` support in Detection.
      
      * Fix error message
      
      * Make ScaleJitter proportional.
      
      * Adding more norm layers in split_normalization_params.
      
      * Add FixedSizeCrop
      
      * Temporary fix for fill values on PIL
      
      * Fix the bug on fill.
      
      * Add RandomShortestSize.
      
      * Skip resize when an augmentation method is used.
      
      * multiscale in [480, 800]
      
      * Add missing star
      
      * Add new RetinaNet variant.
      
      * Add tests.
      
      * Update expected file for old retina
      
      * Fixing tests
      
      * Add FrozenBN to retinav2
      
      * Fix network initialization issues
      
      * Adding BN support in MaskRCNNHeads and FPN
      
      * Adding support of FasterRCNNHeads
      
      * Introduce norm_layers in backbone utils.
      
      * Bigger RPN head + 2x rcnn v2 models.
      
      * Adding gIoU support to retinanet
      
      * Fix assert
      
      * Add back nesterov momentum
      
      * Rename and extend `FastRCNNConvFCHead` to support arbitrary FCs
      
      * Fix linter
      08cc9a7f
    • YosuaMichael's avatar
      Adding the huge vision transformer from SWAG (#5721) · 63576c9f
      YosuaMichael authored
      
      
      * Add vit_b_16_swag
      
      * Better handling idiom for image_size, edit test_extended_model to handle case where number of param differ from default due to different image size input
      
      * Update the accuracy to the experiment result on torchvision model
      
      * Fix typo missing underscore
      
      * raise exception instead of torch._assert, add back publication year (accidentally deleted)
      
      * Add license information on meta and readme
      
      * Improve wording and fix typo for pretrained model license in readme
      
      * Add vit_l_16 weight
      
      * Update README.rst
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * Update the accuracy meta on vit_l_16_swag model to result from our experiment
      
      * Add vit_h_14_swag model
      
      * Add accuracy from experiments
      
      * Add to vit_h_16 model to hubconf.py
      
      * Add docs and expected pkl file for test
      
      * Remove legacy compatibility for ViT_H_14 model
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * Test vit_h_14 with smaller image_size to speedup the test
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      63576c9f