1. 25 Jul, 2022 1 commit
  2. 22 Jul, 2022 1 commit
  3. 14 Jul, 2022 1 commit
  4. 08 Jul, 2022 1 commit
  5. 07 Jul, 2022 1 commit
    • YosuaMichael's avatar
      Adding video accuracy for video_classification reference script (#6241) · 8a45147f
      YosuaMichael authored
      * Add ensembled video accuracy on video reference script
      
      * Change the parser func to be similar with classification reference
      
      * Fix typo type->dtype
      
      * Use custom kinetics
      
      * Fix dataset to not getting start_pts
      
      * Change dataset name, and put video_idx at the back
      
      * Ufmt format
      
      * Use functional softmax, updating meta and use it to overwrite eval param
      
      * Fix typo
      
      * Put the eval parameters on the docs for now
      
      * Change meta for video resnet to use frame-rate 15, also change wording on docs
      8a45147f
  6. 05 Jul, 2022 1 commit
  7. 24 Jun, 2022 1 commit
    • Vasilis Vryniotis's avatar
      Add MViT architecture in TorchVision (#6198) · fb7f9a16
      Vasilis Vryniotis authored
      * Adding MViT v2 architecture (#6105)
      
      * Adding mvitv2 architecture
      
      * Fixing memory issues on tests and minor refactorings.
      
      * Adding input validation
      
      * Adding docs and minor refactoring
      
      * Add `min_temporal_size` in the supported meta-data.
      
      * Switch Tuple[int, int, int] with List[int] to support easier the 2D case
      
      * Adding more docs and references
      
      * Change naming conventions of classes to follow the same pattern as MobileNetV3
      
      * Fix test breakage.
      
      * Update todos
      
      * Performance optimizations.
      
      * Add support to MViT v1 (#6179)
      
      * Switch implementation to v1 variant.
      
      * Fix docs
      
      * Adding back a v2 pseudovariant
      
      * Changing the way the network are configured.
      
      * Temporarily removing v2
      
      * Adding weights.
      
      * Expand _squeeze/_unsqueeze to support arbitrary dims.
      
      * Update references script.
      
      * Fix tests.
      
      * Fixing frames and preprocessing.
      
      * Fix std/mean values in transforms.
      
      * Add permanent Dropout and update the weights.
      
      * Update accuracies.
      
      * Fix documentation
      
      * Remove unnecessary expected file.
      
      * Skip big model test
      
      * Rewrite the configuration logic to reduce LOC.
      
      * Fix mypy
      fb7f9a16
  8. 23 Jun, 2022 1 commit
    • YosuaMichael's avatar
      Add raft-stereo model to prototype/models (#6107) · 11caf37a
      YosuaMichael authored
      * Add rough raft-stereo implementation on prototype/models
      
      * Add standard raft_stereo builder, and modify context_encoder to be more similar with original implementation
      
      * Follow original implementation on pre-convolve context
      
      * Fix to make sure we can load original implementation weight and got same output
      
      * reusing component from raft
      
      * Make the raft_stereo_fast able to load original weight implementation
      
      * Format with ufmt and update some comment
      
      * Use raft FlowHead
      
      * clean up comments
      
      * Remove unnecessary import and use ufmt format
      
      * Add __all__ and more docs for RaftStereo class
      
      * Only accept param and not module for raft stereo builder
      
      * Cleanup comment
      
      * Adding typing to raft_stereo
      
      * Update some of raft code and reuse on raft stereo
      
      * Use bool instead of int
      
      * Make standard raft_stereo model jit scriptable
      
      * Make the function _make_out_layer using boolean with_block and init the block_layer with identity
      
      * Separate corr_block into two modules for pyramid and building corr features
      
      * Use tuple if input is not variable size, also remove default value if using List
      
      * Format using ufmt and update ConvGRU to not inherit from raft in order to satisfy both jit script and mypy
      
      * Change RaftStereo docs input type
      
      * Ufmt format raft
      
      * revert back convgru to see mypy errors, add test for jit and fx, make the model fx compatible
      
      * ufmt format
      
      * Specify device for new tensor, dont init module then overwrite and put if-else instead
      
      * Ignore mypy problem on override, put back num_iters on forward
      
      * Revert some effort to make it fx compatible but unnecessary now
      
      * refactor code and remove num_iters from RaftStereo constructor
      
      * Change to raft_stereo_realtime, and specify device directly for tensor creation
      
      * Add description for raft_stereo_realtime
      
      * Update the test for raft_stereo
      
      * Fix raft stereo prototype test to properly test jit script
      
      * Ufmt format
      
      * Test against expected file, change name from raft_stereo to raft_stereo_builder to prevent import error
      
      * Revert __init__.py changes
      
      * Add default value for non-list param on model builder
      
      * Add checking on out_with_block length, add more docs on the encoder
      
      * Use base instead of basic since it is more commonly used
      
      * rename expect file to base as well
      
      * rename on test
      
      * Revert the revert of __init__.py, also revert the adding default value to _raft_stereo to follow the standard pattern
      
      * ufmt format __init__.py
      11caf37a
  9. 16 Jun, 2022 2 commits
  10. 14 Jun, 2022 1 commit
  11. 10 Jun, 2022 1 commit
  12. 31 May, 2022 1 commit
  13. 26 May, 2022 2 commits
    • Vasilis Vryniotis's avatar
      6aaa2b00
    • YosuaMichael's avatar
      Refactor swin transfomer so later we can reuse component for 3d version (#6088) · 952f4806
      YosuaMichael authored
      * Use List[int] instead of int for window_size and shift_size
      
      * Make PatchMerging and SwinTransformerBlock able to handle 2d and 3d cases
      
      * Separate patch embedding from SwinTransformer and enable to get model without head by specifying num_heads=None
      
      * Dont use if before padding so it is fx friendly
      
      * Put the handling on window_size edge cases on separate function and wrap with torch.fx.wrap so it is excluded from tracing
      
      * Update the weight url to the converted weight with new structure
      
      * Update the accuracy of swin_transformer
      
      * Change assert to Exception and nit
      
      * Make num_classes optional
      
      * Add typing output for _fix_window_and_shift_size function
      
      * init head to None to make it jit scriptable
      
      * Revert the change to make num_classes optional
      
      * Revert unneccesarry changes that might be risky
      
      * Remove self.head declaration
      952f4806
  14. 25 May, 2022 1 commit
  15. 20 May, 2022 2 commits
  16. 19 May, 2022 2 commits
  17. 18 May, 2022 4 commits
  18. 17 May, 2022 4 commits
  19. 16 May, 2022 7 commits
  20. 13 May, 2022 2 commits
  21. 12 May, 2022 3 commits