• Vasilis Vryniotis's avatar
    Add MViT architecture in TorchVision (#6198) · fb7f9a16
    Vasilis Vryniotis authored
    * Adding MViT v2 architecture (#6105)
    
    * Adding mvitv2 architecture
    
    * Fixing memory issues on tests and minor refactorings.
    
    * Adding input validation
    
    * Adding docs and minor refactoring
    
    * Add `min_temporal_size` in the supported meta-data.
    
    * Switch Tuple[int, int, int] with List[int] to support easier the 2D case
    
    * Adding more docs and references
    
    * Change naming conventions of classes to follow the same pattern as MobileNetV3
    
    * Fix test breakage.
    
    * Update todos
    
    * Performance optimizations.
    
    * Add support to MViT v1 (#6179)
    
    * Switch implementation to v1 variant.
    
    * Fix docs
    
    * Adding back a v2 pseudovariant
    
    * Changing the way the network are configured.
    
    * Temporarily removing v2
    
    * Adding weights.
    
    * Expand _squeeze/_unsqueeze to support arbitrary dims.
    
    * Update references script.
    
    * Fix tests.
    
    * Fixing frames and preprocessing.
    
    * Fix std/mean values in transforms.
    
    * Add permanent Dropout and update the weights.
    
    * Update accuracies.
    
    * Fix documentation
    
    * Remove unnecessary expected file.
    
    * Skip big model test
    
    * Rewrite the configuration logic to reduce LOC.
    
    * Fix mypy
    fb7f9a16
models.rst 15.5 KB