1. 04 Aug, 2022 7 commits
    • NielsRogge's avatar
      Add VideoMAE (#17821) · f9a0008d
      NielsRogge authored
      
      
      * First draft
      
      * Add VideoMAEForVideoClassification
      
      * Improve conversion script
      
      * Add VideoMAEForPreTraining
      
      * Add VideoMAEFeatureExtractor
      
      * Improve VideoMAEFeatureExtractor
      
      * Improve docs
      
      * Add first draft of model tests
      
      * Improve VideoMAEForPreTraining
      
      * Fix base_model_prefix
      
      * Make model take pixel_values of shape (B, T, C, H, W)
      
      * Add loss computation of VideoMAEForPreTraining
      
      * Improve tests
      
      * Improve model tests茅
      
      * Make all tests pass
      
      * Add VideoMAE to main README
      
      * Add tests for VideoMAEFeatureExtractor
      
      * Add integration test
      
      * Improve conversion script
      
      * Rename patch embedding class
      
      * Remove VideoMAELayer from init
      
      * Update design of patch embeddings
      
      * Improve comments
      
      * Improve conversion script
      
      * Improve conversion script
      
      * Add conversion of pretrained model
      
      * Add loss verification of pretrained model
      
      * Add loss verification of unnormalized targets
      
      * Add integration test for pretraining model
      
      * Apply suggestions from code review
      
      * Fix bug to make feature extractor resize only shorter edge
      
      * Address more comments
      
      * Improve normalization of videos
      
      * Add doc examples
      
      * Move constants to dedicated script
      
      * Remove scripts
      
      * Transfer checkpoints, fix docs
      
      * Update script
      
      * Update image mean and std
      
      * Fix doc tests
      
      * Set return_tensors to NumPy by default
      
      * Revert the previous change
      Co-authored-by: default avatarNiels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
      f9a0008d
    • Thomas Wang's avatar
    • Sylvain Gugger's avatar
      df28de05
    • Kian Sierra McGettigan's avatar
      Update no trainer scripts for multiple-choice (#18468) · 330247ed
      Kian Sierra McGettigan authored
      * swag_no_trainer updated for with gather_metrics
      
      * Removed unused variable samples_seen
      330247ed
    • Michael Benayoun's avatar
      HFTracer.trace can now take callables and torch.nn.Module (#18457) · c74befc9
      Michael Benayoun authored
      * Enable HFTracer to trace with custom dummy inputs instead of pre-computed ones
      
      * Add HFTracer.trace docstring, and make it possible to handle callable and torch.nn.Module in general
      
      * Remove pdb comment
      
      * Apply suggestions
      c74befc9
    • nlpcat's avatar
      change shape to support dynamic batch input in tf.function XLA generate for tf serving (#18372) · fc1d841b
      nlpcat authored
      
      
      * change shape to support dynamic batch input in tf.generate
      
      * add tests
      Co-authored-by: default avatarnlpcatcode <nlpcodecat@gmail.com>
      fc1d841b
    • Thomas Wang's avatar
      [BLOOM] Clean modeling code (#18344) · b69a62d5
      Thomas Wang authored
      
      
      * Cleanup some code
      
      * Improve signatures
      
      * Try to reduce the number of reshape/copies
      
      * I don't think we actually need the layer_num scaling trick
      
      * No need for duplication
      
      * Try to fix beam_search
      
      * Fix beam search
      
      * Removing layer num normalization seems to be breaking
      
      * Not sure self.layer_number normalization actually matters
      
      * Try and be backward compatible
      
      * Try to fix beam_search
      
      * Revert attempt to be backward compatible
      
      * Improve documentation on past_key_values format
      
      * Optimize the device allocation in case of hidden_states in multiple devices
      
      * No need to manually cast the values to a specific device
      
      * Rename with long version of variables
      
      * Improve type hinting
      
      * Add comment that explains that some methods return views
      
      * Actually i think the attention casting only makes sense when we use torch.float16
      
      * We don't actually need layer_number to be passed anymore
      
      * Fix FX test
      
      * Bypass torch.baddbmm
      
      * Apply suggestions from code review
      
      * Add comment about support for torchScript v1.11
      
      * fix ONNX support for bloom (#18456)
      Co-authored-by: default avatarNiklas Muennighoff <n.muennighoff@gmail.com>
      Co-authored-by: default avatarNouamane Tazi <nouamane98@gmail.com>
      b69a62d5
  2. 03 Aug, 2022 10 commits
  3. 02 Aug, 2022 10 commits
  4. 01 Aug, 2022 13 commits