1. 15 Dec, 2020 5 commits
  2. 14 Dec, 2020 4 commits
    • Julien Plu's avatar
      Fix T5 and BART for TF (#9063) · df3f4d2a
      Julien Plu authored
      * Fix T5 for graphe compilation+execution
      
      * Fix BART
      
      * Fix import
      
      * Fix naming
      
      * fix attribute name
      
      * Oops
      
      * fix import
      
      * fix tests
      
      * fix tests
      
      * Update test
      
      * Add mising import
      
      * Address Patrick's comments
      
      * Style
      
      * Address Patrick's comment
      df3f4d2a
    • Ahmed Elnaggar's avatar
      Add parallelization support for T5EncoderModel (#9082) · a9c8bff7
      Ahmed Elnaggar authored
      
      
      * add model parallelism to T5EncoderModel
      
      add model parallelism to T5EncoderModel
      
      * remove decoder from T5EncoderModel parallelize
      
      * uodate T5EncoderModel docs
      
      * Extend T5ModelTest for T5EncoderModel
      
      * fix T5Stask using range for get_device_map
      
      * fix style
      Co-authored-by: default avatarAhmed Elnaggar <elnaggar@rostlab.informatik.tu-muenchen.de>
      a9c8bff7
    • Patrick von Platen's avatar
      [RAG, Bart] Align RAG, Bart cache with T5 and other models of transformers (#9098) · fa1ddced
      Patrick von Platen authored
      * fix rag
      
      * fix slow test
      
      * fix past in bart
      fa1ddced
    • Julien Plu's avatar
      Fix embeddings resizing in TF models (#8657) · 51d9c569
      Julien Plu authored
      * Resize the biases in same time than the embeddings
      
      * Trigger CI
      
      * Biases are not reset anymore
      
      * Remove get_output_embeddings + better LM model detection in generation utils
      
      * Apply style
      
      * First test on BERT
      
      * Update docstring + new name
      
      * Apply the new resizing logic to all the models
      
      * fix tests
      
      * Apply style
      
      * Update the template
      
      * Fix naming
      
      * Fix naming
      
      * Apply style
      
      * Apply style
      
      * Remove unused import
      
      * Revert get_output_embeddings
      
      * Trigger CI
      
      * Update num parameters
      
      * Restore get_output_embeddings in TFPretrainedModel and add comments
      
      * Style
      
      * Add decoder resizing
      
      * Style
      
      * Fix tests
      
      * Separate bias and decoder resize
      
      * Fix tests
      
      * Fix tests
      
      * Apply style
      
      * Add bias resizing in MPNet
      
      * Trigger CI
      
      * Apply style
      51d9c569
  3. 11 Dec, 2020 1 commit
  4. 10 Dec, 2020 1 commit
  5. 09 Dec, 2020 4 commits
  6. 08 Dec, 2020 3 commits
    • Sylvain Gugger's avatar
      New squad example (#8992) · 447808c8
      Sylvain Gugger authored
      
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add tick
      
      * Update README
      
      * Address review comments
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      447808c8
    • guillaume-be's avatar
      Removed unused `encoder_hidden_states` and `encoder_attention_mask` (#8972) · 7809eb82
      guillaume-be authored
      * Removed unused `encoder_hidden_states` and `encoder_attention_mask` from MobileBert
      
      * Removed decoder tests for MobileBert
      
      * Removed now unnecessary import
      7809eb82
    • Julien Plu's avatar
      Optional layers (#8961) · bf7f79cd
      Julien Plu authored
      * Apply on BERT and ALBERT
      
      * Update TF Bart
      
      * Add input processing to TF BART
      
      * Add input processing for TF CTRL
      
      * Add input processing to TF Distilbert
      
      * Add input processing to TF DPR
      
      * Add input processing to TF Electra
      
      * Add deprecated arguments
      
      * Add input processing to TF XLM
      
      * remove unused imports
      
      * Add input processing to TF Funnel
      
      * Add input processing to TF GPT2
      
      * Add input processing to TF Longformer
      
      * Add input processing to TF Lxmert
      
      * Apply style
      
      * Add input processing to TF Mobilebert
      
      * Add input processing to TF GPT
      
      * Add input processing to TF Roberta
      
      * Add input processing to TF T5
      
      * Add input processing to TF TransfoXL
      
      * Apply style
      
      * Rebase on master
      
      * Fix wrong model name
      
      * Fix BART
      
      * Apply style
      
      * Put the deprecated warnings in the input processing function
      
      * Remove the unused imports
      
      * Raise an error when len(kwargs)>0
      
      * test ModelOutput instead of TFBaseModelOutput
      
      * Address Patrick's comments
      
      * Address Patrick's comments
      
      * Add boolean processing for the inputs
      
      * Take into account the optional layers
      
      * Add missing/unexpected weights in the other models
      
      * Apply style
      
      * rename parameters
      
      * Apply style
      
      * Remove useless
      
      * Remove useless
      
      * Remove useless
      
      * Update num parameters
      
      * Fix tests
      
      * Address Patrick's comment
      
      * Remove useless attribute
      bf7f79cd
  7. 07 Dec, 2020 3 commits
  8. 03 Dec, 2020 1 commit
  9. 02 Dec, 2020 3 commits
    • Patrick von Platen's avatar
      [PyTorch] Refactor Resize Token Embeddings (#8880) · 443f67e8
      Patrick von Platen authored
      * fix resize tokens
      
      * correct mobile_bert
      
      * move embedding fix into modeling_utils.py
      
      * refactor
      
      * fix lm head resize
      
      * refactor
      
      * break lines to make sylvain happy
      
      * add news tests
      
      * fix typo
      
      * improve test
      
      * skip bart-like for now
      
      * check if base_model = get(...) is necessary
      
      * clean files
      
      * improve test
      
      * fix tests
      
      * revert style templates
      
      * Update templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
      443f67e8
    • Nicolas Patry's avatar
      Warning about too long input for fast tokenizers too (#8799) · a8c3f9aa
      Nicolas Patry authored
      * Warning about too long input for fast tokenizers too
      
      If truncation is not set in tokenizers, but the tokenization is too long
      for the model (`model_max_length`), we used to trigger a warning that
      
      The input would probably fail (which it most likely will).
      
      This PR re-enables the warning for fast tokenizers too and uses common
      code for the trigger to make sure it's consistent across.
      
      * Checking for pair of inputs too.
      
      * Making the function private and adding it's doc.
      
      * Remove formatting ?? in odd place.
      
      * Missed uppercase.
      a8c3f9aa
    • sandip's avatar
      Transfoxl seq classification (#8868) · f6b44e61
      sandip authored
      * Transfoxl sequence classification
      
      * Transfoxl sequence classification
      f6b44e61
  10. 01 Dec, 2020 2 commits
  11. 30 Nov, 2020 3 commits
  12. 29 Nov, 2020 1 commit
    • Patrick von Platen's avatar
      [Pegasus] Refactor Tokenizer (#8731) · 5ced23dc
      Patrick von Platen authored
      * refactor
      
      * further refactor
      
      * fix the rest tomorrow
      
      * save intermediate
      
      * finish slow tokenizer
      
      * make more tests pass
      
      * finish refactor
      
      * fix comment
      
      * clean further
      
      * fix name
      
      * fix naming
      
      * Update src/transformers/models/reformer/tokenization_reformer.py
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * refactor
      
      * fix init tokenizers
      
      * refactor
      
      * improve convert
      
      * refactor
      
      * correct convert slow tokenizer
      
      * final fix for Pegasus Tok
      
      * remove ipdb
      
      * improve links
      5ced23dc
  13. 27 Nov, 2020 6 commits
  14. 25 Nov, 2020 3 commits