1. 27 Jul, 2023 2 commits
  2. 13 Jul, 2023 1 commit
  3. 07 Jul, 2023 1 commit
  4. 12 Jun, 2023 1 commit
  5. 31 May, 2023 1 commit
  6. 13 Feb, 2023 1 commit
  7. 01 Feb, 2023 1 commit
  8. 11 Jan, 2023 1 commit
  9. 13 Dec, 2022 1 commit
  10. 29 Sep, 2022 1 commit
  11. 26 Sep, 2022 1 commit
  12. 23 Sep, 2022 2 commits
    • Ponku's avatar
      Add stereo train loop (#6605) · 10dafd9b
      Ponku authored
      
      
      * crestereo draft implementation
      
      * minor model fixes. positional embedding changes.
      
      * aligned base configuration with paper
      
      * Adressing comments
      
      * Broke down Adaptive Correlation Layer. Adressed some other commets.
      
      * adressed some nits
      
      * changed search size, added output channels to model attrs
      
      * changed weights naming
      
      * changed from iterations to num_iters
      
      * removed _make_coords, adressed comments
      
      * fixed jit test
      
      * added script files
      
      * added cascaded inference evaluation
      
      * added optimizer option
      
      * minor changes
      
      * Update references/depth/stereo/train.py
      Co-authored-by: default avatarvfdev <vfdev.5@gmail.com>
      
      * adressed some comments
      
      * change if-else to dict
      
      * added manual resizing for masks and disparities during evaluation
      
      * minor fixes after previous changes
      
      * changed dataloader to be initialised once
      
      * added distributed changes
      
      * changed loader logic
      
      * updated eval script to generate weight API like logs
      
      * improved support for fine-tuning / training resume
      
      * minor changes for finetuning
      
      * updated with transforms from main
      
      * logging distributed deadlock fix
      
      * lint fix
      
      * updated metrics
      
      * weights API log support
      
      * lint fix
      
      * added readme
      
      * updated readme
      
      * updated readme
      
      * read-me update
      
      * remove hardcoded paths. improved valid dataset selection and sync
      
      * removed extras from gitignore
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      Co-authored-by: default avatarvfdev <vfdev.5@gmail.com>
      Co-authored-by: default avatarYosuaMichael <yosuamichaelm@gmail.com>
      10dafd9b
    • Ponku's avatar
      MaxVit model (#6342) · 6b1646ca
      Ponku authored
      
      
      * Added maxvit architecture and tests
      
      * rebased + addresed comments
      
      * Revert "rebased + addresed comments"
      
      This reverts commit c5b28398cd48d2f3403c7c8eeefbaba9df05fcfe.
      
      * Re-added model changes after revert
      
      * aligned with partial original implementation
      
      * removed submitit script fixed lint
      
      * mypy fix for too many arguments
      
      * updated old tests
      
      * removed per batch lr scheduler and seed setting
      
      * removed ontap
      
      * added docs, validated weights
      
      * fixed test expect, moved shape assertions in the begging for torch.fx compatibility
      
      * mypy fix
      
      * lint fix
      
      * added legacy interface
      
      * added weight link
      
      * updated docs
      
      * Update references/classification/train.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * Update torchvision/models/maxvit.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * adressed comments
      
      * update ra_maginuted and augmix_severity default values
      
      * adressed some comments
      
      * remove input_channels parameter
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      6b1646ca
  13. 22 Sep, 2022 1 commit
    • Ponku's avatar
      Add stereo preset transforms (#6549) · 0fcfaa13
      Ponku authored
      
      
      * Added transforms for Stereo Matching
      
      * changed implicit Y scaling to 0.
      
      * Adressed some comments
      
      * addressed type hint
      
      * Added interpolation random interpolation strategy
      
      * Aligned crop get params
      
      * fixed bug in RandomErase
      
      * Adressed scaling and typos
      
      * Adressed occlusion typo
      
      * Changed parameter order in F.erase
      
      * fixed random erase
      
      * Added inference preset transform for stereo matching
      
      * added contiguous reshape to output tensors
      
      * Adressed comments
      
      * Modified the transform preset to use Tuple[int, int]
      
      * adressed NITs
      
      * added grayscale transform, align resize -> mask
      
      * changed max disparity default behaviour
      
      * added fixed resize, changed masking in sparse flow masking
      
      * update to align with argparse
      
      * changed default mask in asymetric pairs
      
      * moved grayscale order
      
      * changed grayscale api to accept to tensor variant
      
      * mypy fix
      
      * changed resize specs
      
      * adressed nits
      
      * added type hints
      
      * mypy fix
      
      * mypy fix
      
      * mypy fix
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      0fcfaa13
  14. 21 Sep, 2022 1 commit
    • Ponku's avatar
      Add stereo matching losses (#6554) · 2c1022e3
      Ponku authored
      
      
      * Moved more losses into classes
      
      * Added photometric loss
      
      * quick fix for ssim loss return value
      
      * added references
      
      * replaced with unsqueeze
      
      * renaming variables
      
      * add ref to consistency loss
      
      * made mask optional everywhere. generalised photometric displacement
      
      * smoothness typo
      
      * fixed flow channel selection bug
      
      * aligned with training script
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      2c1022e3
  15. 05 Sep, 2022 2 commits
  16. 18 Aug, 2022 1 commit
  17. 17 Aug, 2022 1 commit
  18. 12 Aug, 2022 1 commit
  19. 10 Aug, 2022 1 commit
    • Local State's avatar
      Add SwinV2 (#6246) · 5521e9d0
      Local State authored
      
      
      * init submit
      
      * fix typo
      
      * support ufmt and mypy
      
      * fix 2 unittest errors
      
      * fix ufmt issue
      
      * Apply suggestions from code review
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * unify codes
      
      * fix meshgrid indexing
      
      * fix a bug
      
      * fix type check
      
      * add type_annotation
      
      * add slow model
      
      * fix device issue
      
      * fix ufmt issue
      
      * add expect pickle file
      
      * fix jit script issue
      
      * fix type check
      
      * keep consistent argument order
      
      * add support for pretrained_window_size
      
      * avoid code duplication
      
      * a better code reuse
      
      * update window_size argument
      
      * make permute and flatten operations modular
      
      * add PatchMergingV2
      
      * modify expect.pkl
      
      * use None as default argument value
      
      * fix type check
      
      * fix indent
      
      * fix window_size (temporarily)
      
      * remove "v2_" related prefix and add v2 builder
      
      * remove v2 builder
      
      * keep default value consistent with official repo
      
      * deprecate dropout
      
      * deprecate pretrained_window_size
      
      * fix dynamic padding edge case
      
      * remove unused imports
      
      * remove doc modification
      
      * Revert "deprecate dropout"
      
      This reverts commit 8a13f932815ae25655c07430d52929f86b1ca479.
      
      * Revert "fix dynamic padding edge case"
      
      This reverts commit 1c7579cb1bd7bf2f0f94907f39bee6ed707a97a8.
      
      * remove unused kwargs
      
      * add downsample docs
      
      * revert block default value
      
      * revert argument order change
      
      * explicitly specify start_dim
      
      * add small and base variants
      
      * add expect files and slow_models
      
      * Add model weights and documentation for swin v2
      
      * fix lint
      
      * fix end of files line
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      5521e9d0
  20. 08 Aug, 2022 2 commits
  21. 22 Jul, 2022 1 commit
  22. 07 Jul, 2022 1 commit
    • YosuaMichael's avatar
      Adding video accuracy for video_classification reference script (#6241) · 8a45147f
      YosuaMichael authored
      * Add ensembled video accuracy on video reference script
      
      * Change the parser func to be similar with classification reference
      
      * Fix typo type->dtype
      
      * Use custom kinetics
      
      * Fix dataset to not getting start_pts
      
      * Change dataset name, and put video_idx at the back
      
      * Ufmt format
      
      * Use functional softmax, updating meta and use it to overwrite eval param
      
      * Fix typo
      
      * Put the eval parameters on the docs for now
      
      * Change meta for video resnet to use frame-rate 15, also change wording on docs
      8a45147f
  23. 05 Jul, 2022 1 commit
  24. 24 Jun, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Add MViT architecture in TorchVision (#6198) · fb7f9a16
      Vasilis Vryniotis authored
      * Adding MViT v2 architecture (#6105)
      
      * Adding mvitv2 architecture
      
      * Fixing memory issues on tests and minor refactorings.
      
      * Adding input validation
      
      * Adding docs and minor refactoring
      
      * Add `min_temporal_size` in the supported meta-data.
      
      * Switch Tuple[int, int, int] with List[int] to support easier the 2D case
      
      * Adding more docs and references
      
      * Change naming conventions of classes to follow the same pattern as MobileNetV3
      
      * Fix test breakage.
      
      * Update todos
      
      * Performance optimizations.
      
      * Add support to MViT v1 (#6179)
      
      * Switch implementation to v1 variant.
      
      * Fix docs
      
      * Adding back a v2 pseudovariant
      
      * Changing the way the network are configured.
      
      * Temporarily removing v2
      
      * Adding weights.
      
      * Expand _squeeze/_unsqueeze to support arbitrary dims.
      
      * Update references script.
      
      * Fix tests.
      
      * Fixing frames and preprocessing.
      
      * Fix std/mean values in transforms.
      
      * Add permanent Dropout and update the weights.
      
      * Update accuracies.
      
      * Fix documentation
      
      * Remove unnecessary expected file.
      
      * Skip big model test
      
      * Rewrite the configuration logic to reduce LOC.
      
      * Fix mypy
      fb7f9a16
  25. 21 Jun, 2022 1 commit
  26. 15 Jun, 2022 1 commit
    • Lezwon Castelino's avatar
      Add SimpleCopyPaste augmentation (#5825) · bbc1aac8
      Lezwon Castelino authored
      
      
      * added simple POC
      
      * added jitter and crop options
      
      * added references
      
      * moved simplecopypaste to detection module
      
      * working POC for simple copy paste in detection
      
      * added comments
      
      * remove transforms from class
      updated the labels
      added gaussian blur
      
      * removed loop for mask calculation
      
      * replaced Gaussian blur with functional api
      
      * added inplace operations
      
      * added changes to accept tuples instead of tensors
      
      * - make copy paste functional
      - make only one copy of batch and target
      
      * add inplace support within copy paste functional
      
      * Updated code for copy-paste transform
      
      * Fixed code formatting
      
      * [skip ci] removed manual thresholding
      
      * Replaced cropping by resizing data to paste
      
      * Removed inplace arg (as useless) and put a check on iscrowd target
      
      * code-formatting
      
      * Updated copypaste op to make it torch scriptable
      Added fallbacks to support LSJ
      
      * Fixed flake8
      
      * Updates according to the review
      Co-authored-by: default avatarvfdev-5 <vfdev.5@gmail.com>
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      bbc1aac8
  27. 23 May, 2022 1 commit
  28. 20 May, 2022 1 commit
  29. 19 May, 2022 1 commit
  30. 10 May, 2022 1 commit
  31. 03 May, 2022 1 commit
  32. 28 Apr, 2022 1 commit
    • YosuaMichael's avatar
      Add shufflenetv2 1.5 and 2.0 weights (#5906) · 5fc36b4f
      YosuaMichael authored
      * Add shufflenetv2 1.5 and 2.0 weights
      
      * Update recipe
      
      * Add to docs
      
      * Use resize_size=232 for eval and update the result
      
      * Add quantized shufflenetv2 large
      
      * Update docs and readme
      
      * Format with ufmt
      
      * Add to hubconf.py
      
      * Update readme for classification reference
      
      * Fix reference classification readme
      
      * Fix typo on readme
      
      * Update reference/classification/readme
      5fc36b4f
  33. 27 Apr, 2022 1 commit
    • Hu Ye's avatar
      Adding Swin Transformer architecture (#5491) · e288f6ca
      Hu Ye authored
      
      
      * add swin transformer
      
      * Update swin_transformer.py
      
      * Update swin_transformer.py
      
      * fix lint
      
      * fix lint
      
      * refactor code
      
      * add swin_transformer
      
      * Update swin_transformer.py
      
      * fix bug
      
      * refactor code
      
      * fix lint
      
      * update init_weights
      
      * move shift_window into attention
      
      * refactor code
      
      * fix bug
      
      * Update swin_transformer.py
      
      * Update swin_transformer.py
      
      * fix lint
      
      * add patch_merge
      
      * fix bug
      
      * Update swin_transformer.py
      
      * Update swin_transformer.py
      
      * Update swin_transformer.py
      
      * refactor code
      
      * Update swin_transformer.py
      
      * refactor code
      
      * fix lint
      
      * refactor code
      
      * add swin_tiny
      
      * add swin_tiny.pkl
      
      * fix lint
      
      * Delete ModelTester.test_swin_tiny_expect.pkl
      
      * add swin_tiny
      
      * add
      
      * add Optional to bias
      
      * update init weights
      
      * update init_weights and add no weight decay
      
      * add no weight decay
      
      * add set_weight_decay
      
      * add set_weight_decay
      
      * fix lint
      
      * fix lint
      
      * add lr_cos_min
      
      * add other swin models
      
      * Update torchvision/models/swin_transformer.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * refactor doc
      
      * Update utils.py
      
      * Update train.py
      
      * Update train.py
      
      * Update swin_transformer.py
      
      * update model builder
      
      * fix lint
      
      * add
      
      * Update torchvision/models/swin_transformer.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * Update torchvision/models/swin_transformer.py
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      
      * update other model
      
      * simplify the model name just like ViT
      
      * add lr_cos_min
      
      * fix lint
      
      * fix lint
      
      * Update swin_transformer.py
      
      * Update swin_transformer.py
      
      * Update swin_transformer.py
      
      * Delete ModelTester.test_swin_tiny_expect.pkl
      
      * add swin_t
      
      * refactor code
      
      * Update train.py
      
      * add swin_s
      
      * ignore a error of mypy
      
      * Update swin_transformer.py
      
      * fix lint
      
      * add swin_b
      
      * add swin_l
      
      * refactor code
      
      * Update train.py
      
      * move relative_position_bias to __init__
      
      * fix formatting
      
      * Revert "fix formatting"
      
      This reverts commit 41faba232668f7ac4273a0cf632c0d0130c7ce9c.
      
      * Revert "move relative_position_bias to __init__"
      
      This reverts commit f0615440bf18617dc0e5dc4839bd5ed27e5ed010.
      
      * refactor code
      
      * Remove deprecated meta-data from `_COMMON_META`
      
      * fix linter
      
      * add pretrained weights for swin_t
      
      * fix format
      
      * apply ufmt
      
      * add documentation
      
      * update references README
      
      * adding new style docs
      
      * update pre-trained weights values
      
      * remove other variants
      
      * fix typo
      
      * Remove expect for the variants not yet supported
      Co-authored-by: default avatarVasilis Vryniotis <datumbox@users.noreply.github.com>
      Co-authored-by: default avatarJoao Gomes <jdsgomes@fb.com>
      e288f6ca
  34. 20 Apr, 2022 1 commit
  35. 01 Apr, 2022 2 commits