1. 25 Jul, 2023 1 commit
    • Sebastian Husch Lee's avatar
      [`T5`, `MT5`, `UMT5`] Add [T5, MT5, UMT5]ForSequenceClassification (#24726) · 8f36ab3e
      Sebastian Husch Lee authored
      * Initial addition of t5forsequenceclassification
      
      * Adding imports and adding tests
      
      * Formatting
      
      * Running make fix-copies
      
      * Adding mt5forseq
      
      * Formatting
      
      * run make fix-copies
      
      * Adding to docs
      
      * Add model_parallel
      
      * Fix bug
      
      * Fix
      
      * Remove TODO
      
      * Fixing tests for T5ForSequenceClassification
      
      * Undo changes to dependency_versions_table.py
      
      * Change classification head to work with T5Config directly
      
      * Change seq length to let tests pass
      
      * PR comments for formatting
      
      * Formatting
      
      * Initial addition of UMT5ForSequenceClassification
      
      * Adding to inits and formatting
      
      * run make fix-copies
      
      * Add doc for UMT5ForSeqClass
      
      * Update UMT5 config
      
      * Fix docs
      
      * Skip torch fx test for SequenceClassification
      
      * Formatting
      
      * Add skip to UMT5 tests as well
      
      * Fix umt5 tests
      
      * Running make fix-copies
      
      * PR comments
      
      * Fix for change to sentence_representation
      
      * Rename seq_len to hidden_size since that's what it is
      
      * Use base_model to follow format of the rest of the library
      
      * Update docs
      
      * Extract the decoder_input_ids changes and make one liner
      
      * Make one-liner
      8f36ab3e
  2. 27 Jun, 2023 1 commit
  3. 20 Jun, 2023 1 commit
  4. 17 Mar, 2023 1 commit
  5. 21 Feb, 2023 1 commit
  6. 15 Feb, 2023 1 commit
    • Steven Liu's avatar
      Refactor model summary (#21408) · 7a5533b2
      Steven Liu authored
      * first draft of model summary
      
      * restructure docs
      
      * finish first draft
      
      * minor reviews and edits
      
      * apply feedbacks
      
      * save important info, create new page for attention
      
      * add attention doc to toctree
      
      *  few more minor fixes
      7a5533b2
  7. 07 Nov, 2022 1 commit
  8. 29 Jun, 2022 1 commit
    • Crystina's avatar
      Flax t5 Encoder (#17784) · 692e61e9
      Crystina authored
      
      
      * first draft adding Flax-t5-encoder and Flax-mt5-encoder
      
      * imports
      
      * after make fixup
      
      * flax t5 encoder test
      
      * black on test
      
      * make fix-copies
      
      * clean
      
      * all_model_classes -> tuple
      
      * clean test
      
      * is_encoder_decoder=False in t5-enc tester
      
      * remove file docstring before FlaxT5Encoder
      
      * black
      
      * isort
      
      * commit suggestions on src/transformers/models/t5/modeling_flax_t5.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * commit suggestions on src/transformers/models/t5/modeling_flax_t5.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * remove _get_encoder_module
      
      * self.decoder_seq_length -> self.encoder_seq_length as t5-enc does not have decoder
      
      * bugfix - self.module_class is class itself, not instance;
      
      * docs for mt5 and t5
      
      * call -> __call__ in t5 doc
      
      * FlaxMT5EncoderModel to TYPE_HINT
      
      * run doc-builder to allow change the files
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      692e61e9
  9. 04 Apr, 2022 1 commit
  10. 22 Dec, 2021 1 commit