1. 24 Feb, 2023 5 commits
  2. 23 Feb, 2023 3 commits
  3. 21 Feb, 2023 2 commits
  4. 02 Feb, 2023 1 commit
  5. 01 Feb, 2023 2 commits
  6. 11 Jan, 2023 2 commits
  7. 04 Jan, 2023 1 commit
  8. 17 Nov, 2022 1 commit
    • Aditya Oke's avatar
      Add Video SwinTransformer (#6521) · b1054cbb
      Aditya Oke authored
      
      
      * Just start adding mere copy paste
      
      * Replace d with t and D with T
      
      * Align swin transformer video to image a bit
      
      * Rename d -> t
      
      * align with 2d impl
      
      * align with 2d impl
      
      * Add helpful comments and config for 3d
      
      * add docs
      
      * Add docs
      
      * Add configurations
      
      * Add docs
      
      * Fix bugs
      
      * Fix wrong edit
      
      * Fix wrong edit
      
      * Fix bugs
      
      * Fix bugs
      
      * Fix as per fx suggestions
      
      * Update torchvision/models/video/swin_transformer.py
      
      * Fix as per fx suggestions
      
      * Fix expect files and code
      
      * Update the expect files
      
      * Modify video swin
      
      * Add min size and min temporal size, num params
      
      * Add flops and size
      
      * Fix types
      
      * Fix url recipe
      Co-authored-by: default avatarYosua Michael Maranatha <yosuamichael@fb.com>
      b1054cbb
  9. 11 Nov, 2022 1 commit
    • toni057's avatar
      Adding FLOPs and size to model metadata (#6936) · deba0562
      toni057 authored
      
      
      * Adding FLOPs and size to model metadata
      
      * Adding weight size to quantization models
      
      * Small refactor of rich metadata
      
      * Removing unused code
      
      * Fixing wrong entries
      
      * Adding .DS_Store to gitignore
      
      * Renaming _flops to _ops
      
      * Adding number of operations to quantization models
      
      * Reflecting _flops change to _ops
      
      * Renamed ops and weight size in individual model doc pages
      
      * Linter fixes
      
      * Rounding ops to first decimal
      
      * Rounding num ops and sizes to 3 decimals
      
      * Change naming of columns.
      
      * Update tables
      Co-authored-by: default avatarToni Blaslov <tblaslov@fb.com>
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      deba0562
  10. 04 Nov, 2022 1 commit
  11. 04 Oct, 2022 1 commit
  12. 28 Sep, 2022 1 commit
    • Nicolas Hug's avatar
      Remove deprecated APIs for 0.14 (#6258) · 7b8a6db7
      Nicolas Hug authored
      
      
      * Remove Kinetics400 class
      
      * Remove '2007-test' in VOC
      
      * Remove some MobileNet layer classes
      
      * Remove torchvision/models/segmentation/segmentation.py
      
      * Remove some MultiScaleRoIAlign methods
      
      * Remove torchvision/transforms/_functional_video.py
      
      * Remove torchvision/transforms/_transforms_video.py
      
      * Remove resample parameter in transforms
      
      * Remove 'range' parameter
      
      * Remove 'fill_value' parameter in transforms
      
      * Revert to original warning for C++ models - looks like we should still keep them around?
      
      * pre-commit
      
      * Fix docs
      
      * Remove test/test_transforms_video.py
      
      * Some fixes
      
      * Remove more tests
      
      * Revert changes to C++ models
      
      * Add back _transforms_video and change warning message
      
      * Change back the warning message, and will change the warning message on separate PR
      Co-authored-by: default avatarYosuaMichael <yosuamichaelm@gmail.com>
      Co-authored-by: default avatarYosua Michael Maranatha <yosuamichael@fb.com>
      7b8a6db7
  13. 23 Sep, 2022 1 commit
    • Ponku's avatar
      MaxVit model (#6342) · 6b1646ca
      Ponku authored
      
      
      * Added maxvit architecture and tests
      
      * rebased + addresed comments
      
      * Revert "rebased + addresed comments"
      
      This reverts commit c5b28398cd48d2f3403c7c8eeefbaba9df05fcfe.
      
      * Re-added model changes after revert
      
      * aligned with partial original implementation
      
      * removed submitit script fixed lint
      
      * mypy fix for too many arguments
      
      * updated old tests
      
      * removed per batch lr scheduler and seed setting
      
      * removed ontap
      
      * added docs, validated weights
      
      * fixed test expect, moved shape assertions in the begging for torch.fx compatibility
      
      * mypy fix
      
      * lint fix
      
      * added legacy interface
      
      * added weight link
      
      * updated docs
      
      * Update references/classification/train.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * Update torchvision/models/maxvit.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * adressed comments
      
      * update ra_maginuted and augmix_severity default values
      
      * adressed some comments
      
      * remove input_channels parameter
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      6b1646ca
  14. 09 Sep, 2022 1 commit
  15. 08 Sep, 2022 1 commit
  16. 19 Aug, 2022 1 commit
    • Sophia Zhi's avatar
      Add the S3D architecture to TorchVision (#6412) · 6de7021e
      Sophia Zhi authored
      
      
      * S3D initial commit
      
      * add model builder code and docstrings
      
      * change classifier submodule, populate weights enum
      
      * fix change of block args from List[List[int]] to ints
      
      * add VideoClassification to transforms
      
      * edit weights url for testing, add s3d to models.video init
      
      * norm_layer changes
      
      * norm_layer and args fix
      
      * Overwrite default dropout
      
      * Remove docs from internal submodules.
      
      * Fix tests
      
      * Adding documentation.
      
      * Link doc from main models.rst
      
      * Fix min_temporal_size
      
      * Adding crop/resize parameters in references script
      
      * Release weights.
      
      * Refactor dropout.
      
      * Adding the weights table in the doc
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      Co-authored-by: default avatarVasilis Vryniotis <vvryniotis@fb.com>
      6de7021e
  17. 18 Aug, 2022 1 commit
    • Ponku's avatar
      Added ETH3D stereo dataset (#6349) · 96620011
      Ponku authored
      * Added ETH3D stereo dataset
      
      * Small doc-reformating
      
      * Removed assertions with no use, changed np conversion
      
      * Added ETH3D stereo dataset
      
      * Removed assertions with no use, changed np conversion
      
      * rebased on main
      
      * Revert "Removed assertions with no use, changed np conversion"
      
      This reverts commit 1478a8c056bbed59b6f4a67f78a5cfac84cc9fda.
      
      * Update to np.bool instead of np.bool_
      
      * lint and mypy nit fix
      
      * test nit
      96620011
  18. 17 Aug, 2022 3 commits
    • Ponku's avatar
      Add FallingThings dataset (#6346) · 162267ca
      Ponku authored
      
      
      * Added Falling Things datasets
      
      * Renamed split to variant
      
      * Update torchvision/datasets/_stereo_matching.py
      
      Changed constant formatting
      Co-authored-by: default avatarNicolas Hug <contact@nicolas-hug.com>
      Co-authored-by: default avatarNicolas Hug <contact@nicolas-hug.com>
      162267ca
    • Ponku's avatar
      Added InStereo2k dataset (#6347) · 8120c594
      Ponku authored
      * added SceneFLow variant datasets
      
      * Changed split name to variant name
      
      * removed trailing commented code line
      
      * Added InStereo2k dataset
      
      * Added Sintel Stereo dataset
      
      * small refactor in tests
      
      * Fixed doc formatting.
      
      * candidate fix for FileNotFound on windows test
      
      * Adressing comments
      
      * Added Sintel Stereo dataset
      
      * small refactor in tests
      
      * Fixed doc formatting.
      
      * candidate fix for FileNotFound on windows test
      
      * Adressing comments
      
      * rebased on main
      
      * lint fix
      
      * Added InStereo2k dataset
      8120c594
    • Ponku's avatar
      Add Sintel Stereo dataset (#6348) · db718021
      Ponku authored
      * added SceneFLow variant datasets
      
      * Changed split name to variant name
      
      * removed trailing commented code line
      
      * Added Sintel Stereo dataset
      
      * small refactor in tests
      
      * Fixed doc formatting.
      
      * candidate fix for FileNotFound on windows test
      
      * Adressing comments
      
      * Added Sintel Stereo dataset
      
      * small refactor in tests
      
      * Fixed doc formatting.
      
      * candidate fix for FileNotFound on windows test
      
      * Adressing comments
      
      * rebased on main
      
      * lint fix
      db718021
  19. 10 Aug, 2022 2 commits
    • Local State's avatar
      Add SwinV2 (#6246) · 5521e9d0
      Local State authored
      
      
      * init submit
      
      * fix typo
      
      * support ufmt and mypy
      
      * fix 2 unittest errors
      
      * fix ufmt issue
      
      * Apply suggestions from code review
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * unify codes
      
      * fix meshgrid indexing
      
      * fix a bug
      
      * fix type check
      
      * add type_annotation
      
      * add slow model
      
      * fix device issue
      
      * fix ufmt issue
      
      * add expect pickle file
      
      * fix jit script issue
      
      * fix type check
      
      * keep consistent argument order
      
      * add support for pretrained_window_size
      
      * avoid code duplication
      
      * a better code reuse
      
      * update window_size argument
      
      * make permute and flatten operations modular
      
      * add PatchMergingV2
      
      * modify expect.pkl
      
      * use None as default argument value
      
      * fix type check
      
      * fix indent
      
      * fix window_size (temporarily)
      
      * remove "v2_" related prefix and add v2 builder
      
      * remove v2 builder
      
      * keep default value consistent with official repo
      
      * deprecate dropout
      
      * deprecate pretrained_window_size
      
      * fix dynamic padding edge case
      
      * remove unused imports
      
      * remove doc modification
      
      * Revert "deprecate dropout"
      
      This reverts commit 8a13f932815ae25655c07430d52929f86b1ca479.
      
      * Revert "fix dynamic padding edge case"
      
      This reverts commit 1c7579cb1bd7bf2f0f94907f39bee6ed707a97a8.
      
      * remove unused kwargs
      
      * add downsample docs
      
      * revert block default value
      
      * revert argument order change
      
      * explicitly specify start_dim
      
      * add small and base variants
      
      * add expect files and slow_models
      
      * Add model weights and documentation for swin v2
      
      * fix lint
      
      * fix end of files line
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      5521e9d0
    • Vasilis Vryniotis's avatar
      Add support of MViTv2 video variants (#6373) · 7e8186e0
      Vasilis Vryniotis authored
      * Extending to support MViTv2
      
      * Fix docs, mypy and linter
      
      * Refactor the relative positional code.
      
      * Code refactoring.
      
      * Rename vars.
      
      * Update docs.
      
      * Replace assert with exception.
      
      * Updat docs.
      
      * Minor refactoring.
      
      * Remove the square input limitation.
      
      * Moving methods around.
      
      * Modify the shortcut in the attention layer.
      
      * Add ported weights.
      
      * Introduce a `residual_cls` config on the attention layer.
      
      * Make the patch_embed kernel/padding/stride configurable.
      
      * Apply changes from code-review.
      
      * Remove stale todo.
      7e8186e0
  20. 08 Aug, 2022 1 commit
  21. 03 Aug, 2022 1 commit
  22. 02 Aug, 2022 1 commit
    • Ponku's avatar
      Splitting Stereo Dataset PR(#6269) (#6311) · 6bea4ef5
      Ponku authored
      * Broken down PR(#6269). Added an additional dataset
      
      * Removed some types. Store None instead of "". Merged test util functions.
      
      * minor mypy fixes. minor doc fixes
      
      * reformated docstring
      
      * Added additional line-skips
      6bea4ef5
  23. 01 Aug, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Add registration mechanism for models (#6333) · 0a919dbb
      Vasilis Vryniotis authored
      * Model registration mechanism.
      
      * Add overwrite options to the dataset prototype registration mechanism.
      
      * Adding example models.
      
      * Fix module filtering
      
      * Fix linter
      
      * Fix docs
      
      * Make name optional if same as model builder
      
      * Apply updates from code-review.
      
      * fix minor bug
      
      * Adding getter for model weight enum
      
      * Support both strings and callables on get_model_weight.
      
      * linter fixes
      
      * Fixing mypy.
      
      * Renaming `get_model_weight` to `get_model_weights`
      
      * Registering all classification models.
      
      * Registering all video models.
      
      * Registering all detection models.
      
      * Registering all optical flow models.
      
      * Fixing mypy.
      
      * Registering all segmentation models.
      
      * Registering all quantization models.
      
      * Fixing linter
      
      * Registering all prototype depth perception models.
      
      * Adding tests and updating existing tests.
      
      * Fix linters
      
      * Fix tests.
      
      * Add beta annotation on docs.
      
      * Fix tests.
      
      * Apply changes from code-review.
      
      * Adding documentation.
      
      * Fix docs.
      0a919dbb
  24. 26 Jul, 2022 1 commit
  25. 21 Jul, 2022 1 commit
  26. 24 Jun, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Add MViT architecture in TorchVision (#6198) · fb7f9a16
      Vasilis Vryniotis authored
      * Adding MViT v2 architecture (#6105)
      
      * Adding mvitv2 architecture
      
      * Fixing memory issues on tests and minor refactorings.
      
      * Adding input validation
      
      * Adding docs and minor refactoring
      
      * Add `min_temporal_size` in the supported meta-data.
      
      * Switch Tuple[int, int, int] with List[int] to support easier the 2D case
      
      * Adding more docs and references
      
      * Change naming conventions of classes to follow the same pattern as MobileNetV3
      
      * Fix test breakage.
      
      * Update todos
      
      * Performance optimizations.
      
      * Add support to MViT v1 (#6179)
      
      * Switch implementation to v1 variant.
      
      * Fix docs
      
      * Adding back a v2 pseudovariant
      
      * Changing the way the network are configured.
      
      * Temporarily removing v2
      
      * Adding weights.
      
      * Expand _squeeze/_unsqueeze to support arbitrary dims.
      
      * Update references script.
      
      * Fix tests.
      
      * Fixing frames and preprocessing.
      
      * Fix std/mean values in transforms.
      
      * Add permanent Dropout and update the weights.
      
      * Update accuracies.
      
      * Fix documentation
      
      * Remove unnecessary expected file.
      
      * Skip big model test
      
      * Rewrite the configuration logic to reduce LOC.
      
      * Fix mypy
      fb7f9a16
  27. 14 Jun, 2022 1 commit
  28. 10 Jun, 2022 1 commit