• NielsRogge's avatar
    Add VideoMAE (#17821) · f9a0008d
    NielsRogge authored
    
    
    * First draft
    
    * Add VideoMAEForVideoClassification
    
    * Improve conversion script
    
    * Add VideoMAEForPreTraining
    
    * Add VideoMAEFeatureExtractor
    
    * Improve VideoMAEFeatureExtractor
    
    * Improve docs
    
    * Add first draft of model tests
    
    * Improve VideoMAEForPreTraining
    
    * Fix base_model_prefix
    
    * Make model take pixel_values of shape (B, T, C, H, W)
    
    * Add loss computation of VideoMAEForPreTraining
    
    * Improve tests
    
    * Improve model tests茅
    
    * Make all tests pass
    
    * Add VideoMAE to main README
    
    * Add tests for VideoMAEFeatureExtractor
    
    * Add integration test
    
    * Improve conversion script
    
    * Rename patch embedding class
    
    * Remove VideoMAELayer from init
    
    * Update design of patch embeddings
    
    * Improve comments
    
    * Improve conversion script
    
    * Improve conversion script
    
    * Add conversion of pretrained model
    
    * Add loss verification of pretrained model
    
    * Add loss verification of unnormalized targets
    
    * Add integration test for pretraining model
    
    * Apply suggestions from code review
    
    * Fix bug to make feature extractor resize only shorter edge
    
    * Address more comments
    
    * Improve normalization of videos
    
    * Add doc examples
    
    * Move constants to dedicated script
    
    * Remove scripts
    
    * Transfer checkpoints, fix docs
    
    * Update script
    
    * Update image mean and std
    
    * Fix doc tests
    
    * Set return_tensors to NumPy by default
    
    * Revert the previous change
    Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
    f9a0008d
README.md 62.4 KB