1. 04 Dec, 2020 1 commit
    • Julien Plu's avatar
      Better booleans handling in the TF models (#8777) · dcd3046f
      Julien Plu authored
      * Apply on BERT and ALBERT
      
      * Update TF Bart
      
      * Add input processing to TF BART
      
      * Add input processing for TF CTRL
      
      * Add input processing to TF Distilbert
      
      * Add input processing to TF DPR
      
      * Add input processing to TF Electra
      
      * Add deprecated arguments
      
      * Add input processing to TF XLM
      
      * Add input processing to TF Funnel
      
      * Add input processing to TF GPT2
      
      * Add input processing to TF Longformer
      
      * Add input processing to TF Lxmert
      
      * Apply style
      
      * Add input processing to TF Mobilebert
      
      * Add input processing to TF GPT
      
      * Add input processing to TF Roberta
      
      * Add input processing to TF T5
      
      * Add input processing to TF TransfoXL
      
      * Apply style
      
      * Rebase on master
      
      * Bug fix
      
      * Retry to bugfix
      
      * Retry bug fix
      
      * Fix wrong model name
      
      * Try another fix
      
      * Fix BART
      
      * Fix input precessing
      
      * Apply style
      
      * Put the deprecated warnings in the input processing function
      
      * Remove the unused imports
      
      * Raise an error when len(kwargs)>0
      
      * test ModelOutput instead of TFBaseModelOutput
      
      * Bug fix
      
      * Address Patrick's comments
      
      * Address Patrick's comments
      
      * Address Sylvain's comments
      
      * Add boolean processing for the inputs
      
      * Apply style
      
      * Missing optional
      
      * Fix missing some input proc
      
      * Update the template
      
      * Fix missing inputs
      
      * Missing input
      
      * Fix args parameter
      
      * Trigger CI
      
      * Trigger CI
      
      * Trigger CI
      
      * Address Patrick's and Sylvain's comments
      
      * Replace warn by warning
      
      * Trigger CI
      
      * Fix XLNET
      
      * Fix detection
      dcd3046f
  2. 02 Dec, 2020 3 commits
    • Patrick von Platen's avatar
      [PyTorch] Refactor Resize Token Embeddings (#8880) · 443f67e8
      Patrick von Platen authored
      * fix resize tokens
      
      * correct mobile_bert
      
      * move embedding fix into modeling_utils.py
      
      * refactor
      
      * fix lm head resize
      
      * refactor
      
      * break lines to make sylvain happy
      
      * add news tests
      
      * fix typo
      
      * improve test
      
      * skip bart-like for now
      
      * check if base_model = get(...) is necessary
      
      * clean files
      
      * improve test
      
      * fix tests
      
      * revert style templates
      
      * Update templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
      443f67e8
    • ryota-mo's avatar
      Fix typo in docstring (#8905) · 801b2cb3
      ryota-mo authored
      801b2cb3
    • sandip's avatar
      Transfoxl seq classification (#8868) · f6b44e61
      sandip authored
      * Transfoxl sequence classification
      
      * Transfoxl sequence classification
      f6b44e61
  3. 01 Dec, 2020 5 commits
  4. 30 Nov, 2020 2 commits
    • LysandreJik's avatar
      Release: v4.0.0 · 22b0ff75
      LysandreJik authored
      22b0ff75
    • Ahmed Elnaggar's avatar
      Add T5 Encoder for Feature Extraction (#8717) · 40ecaf0c
      Ahmed Elnaggar authored
      
      
      * Add T5 Encoder class for feature extraction
      
      * fix T5 encoder add_start_docstrings indent
      
      * update init with T5 encoder
      
      * update init with TFT5ModelEncoder
      
      * remove TFT5ModelEncoder
      
      * change T5ModelEncoder order in init
      
      * add T5ModelEncoder to transformers init
      
      * clean T5ModelEncoder
      
      * update init with TFT5ModelEncoder
      
      * add TFModelEncoder for Tensorflow
      
      * update init with TFT5ModelEncoder
      
      * Update src/transformers/models/t5/modeling_t5.py
      
      change output from Seq2SeqModelOutput to BaseModelOutput
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * remove encoder_outputs
      
      1. remove encoder_outputs from the function call.
      2. remove the encoder_outputs If statement.
      3. remove isinstance from return_dict.
      
      * Authorize missing decoder keys
      
      * remove unnecessary input parameters
      
      remove pask_key_values and use_cache
      
      * remove use_cache
      
      remove use_cache from the forward method
      
      * add doctoring for T5 encoder
      
      add doctoring for T5 encoder with T5_ENCODER_INPUTS_DOCSTRING
      
      * change return_dict to dot access
      
      * add T5_ENCODER_INPUTS_DOCSTRING for TF T5
      
      * change TFT5Encoder output type to BaseModelOutput
      
      * remove unnecessary parameters for TFT5Encoder
      
      * remove unnecessary if statement
      
      * add import BaseModelOutput
      
      * fix BaseModelOutput typo to TFBaseModelOutput
      
      * update T5 doc with T5ModelEncoder
      
      * add T5ModelEncoder to tests
      
      * finish pytorch
      
      * finish docs and mt5
      
      * add mtf to init
      
      * fix init
      
      * remove n_positions
      
      * finish PR
      
      * Update src/transformers/models/mt5/modeling_mt5.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * Update src/transformers/models/t5/modeling_t5.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * Update src/transformers/models/t5/modeling_tf_t5.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * Update src/transformers/models/mt5/modeling_tf_mt5.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * make style
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      40ecaf0c
  5. 29 Nov, 2020 1 commit
    • Patrick von Platen's avatar
      [Pegasus] Refactor Tokenizer (#8731) · 5ced23dc
      Patrick von Platen authored
      * refactor
      
      * further refactor
      
      * fix the rest tomorrow
      
      * save intermediate
      
      * finish slow tokenizer
      
      * make more tests pass
      
      * finish refactor
      
      * fix comment
      
      * clean further
      
      * fix name
      
      * fix naming
      
      * Update src/transformers/models/reformer/tokenization_reformer.py
      
      * Apply suggestions from code review
      
      * Apply suggestions from code review
      
      * refactor
      
      * fix init tokenizers
      
      * refactor
      
      * improve convert
      
      * refactor
      
      * correct convert slow tokenizer
      
      * final fix for Pegasus Tok
      
      * remove ipdb
      
      * improve links
      5ced23dc
  6. 28 Nov, 2020 1 commit
  7. 27 Nov, 2020 5 commits
  8. 25 Nov, 2020 3 commits
  9. 24 Nov, 2020 4 commits
    • Julien Plu's avatar
      New TF model inputs (#8602) · 29d49924
      Julien Plu authored
      * Apply on BERT and ALBERT
      
      * Update TF Bart
      
      * Add input processing to TF BART
      
      * Add input processing for TF CTRL
      
      * Add input processing to TF Distilbert
      
      * Add input processing to TF DPR
      
      * Add input processing to TF Electra
      
      * Add input processing for TF Flaubert
      
      * Add deprecated arguments
      
      * Add input processing to TF XLM
      
      * remove unused imports
      
      * Add input processing to TF Funnel
      
      * Add input processing to TF GPT2
      
      * Add input processing to TF Longformer
      
      * Add input processing to TF Lxmert
      
      * Apply style
      
      * Add input processing to TF Mobilebert
      
      * Add input processing to TF GPT
      
      * Add input processing to TF Roberta
      
      * Add input processing to TF T5
      
      * Add input processing to TF TransfoXL
      
      * Apply style
      
      * Rebase on master
      
      * Bug fix
      
      * Retry to bugfix
      
      * Retry bug fix
      
      * Fix wrong model name
      
      * Try another fix
      
      * Fix BART
      
      * Fix input precessing
      
      * Apply style
      
      * Put the deprecated warnings in the input processing function
      
      * Remove the unused imports
      
      * Raise an error when len(kwargs)>0
      
      * test ModelOutput instead of TFBaseModelOutput
      
      * Bug fix
      
      * Address Patrick's comments
      
      * Address Patrick's comments
      
      * Address Sylvain's comments
      
      * Add the new inputs in new Longformer models
      
      * Update the template with the new input processing
      
      * Remove useless assert
      
      * Apply style
      
      * Trigger CI
      29d49924
    • Lysandre Debut's avatar
      MT5 should have an autotokenizer (#8743) · e09e54fd
      Lysandre Debut authored
      * MT5 should have an autotokenizer
      
      * Different configurations should be able to point to same tokenizers
      e09e54fd
    • zhiheng-huang's avatar
      Support various BERT relative position embeddings (2nd) (#8276) · 2c83b3c3
      zhiheng-huang authored
      
      
      * Support BERT relative position embeddings
      
      * Fix typo in README.md
      
      * Address review comment
      
      * Fix failing tests
      
      * [tiny] Fix style_doc.py check by adding an empty line to configuration_bert.py
      
      * make fix copies
      
      * fix configs of electra and albert and fix longformer
      
      * remove copy statement from longformer
      
      * fix albert
      
      * fix electra
      
      * Add bert variants forward tests for various position embeddings
      
      * [tiny] Fix style for test_modeling_bert.py
      
      * improve docstring
      
      * [tiny] improve docstring and remove unnecessary dependency
      
      * [tiny] Remove unused import
      
      * re-add to ALBERT
      
      * make embeddings work for ALBERT
      
      * add test for albert
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      2c83b3c3
    • Lysandre Debut's avatar
      Model parallel documentation (#8741) · 02f48b9b
      Lysandre Debut authored
      * Add parallelize methods to the .rst files
      
      * Correct format
      02f48b9b
  10. 23 Nov, 2020 4 commits
  11. 22 Nov, 2020 1 commit
  12. 20 Nov, 2020 1 commit
    • Quentin Lhoest's avatar
      Fix rag finetuning + add finetuning test (#8585) · 8062fa63
      Quentin Lhoest authored
      * replace init_ddp_connection for index init
      
      * style
      
      * add finetune test
      
      * add test data
      
      * move generate tensors to device
      
      * add test on EM metric
      
      * style
      
      * allow multi process test
      
      * keep gloo process group for retrieval
      
      * add multi-gpu test
      
      * use custom accelerator
      
      * clean test finetune
      
      * minor
      
      * style
      
      * style
      
      * typo
      
      * use python call instead of imported main fumction
      
      * return_dict fix in modeling_rag
      
      * use float32 in retrieval
      
      * store as float32 as well in the custom knowledge dataset example
      
      * style
      
      * rename to finetune_rag
      
      * style
      
      * update readme
      
      * rename utils and callbacks to utils_rag and callbacks_rag
      
      * fix test
      
      * patrick's comments
      
      * generate dummy data in the finetue test script
      
      * remove dummy data files
      
      * style
      8062fa63
  13. 19 Nov, 2020 5 commits
    • Sylvain Gugger's avatar
      Add sentencepiece to the CI and fix tests (#8672) · 6494910f
      Sylvain Gugger authored
      * Fix the CI and tests
      
      * Fix quality
      
      * Remove that m form nowhere
      6494910f
    • Zhylko Dima's avatar
      `disable_ngram_loss` fix for prophetnet (#8554) · ca0109bd
      Zhylko Dima authored
      
      
      * `disable_ngram_loss` fix for prophetnet
      
      * add changes documentation
      
      * fix _compute_loss to use mean reduction and -100 to masked tokens & remove unnecessary arguments
      
      * mean label smoothing loss
      
      * small refactor
      
      * fix test
      Co-authored-by: default avatarpatrickvonplaten <patrick.v.platen@gmail.com>
      ca0109bd
    • Sylvain Gugger's avatar
      Better filtering of the model outputs in Trainer (#8633) · 4208f496
      Sylvain Gugger authored
      * Better filtering of the model outputs in Trainer
      
      * Fix examples tests
      
      * Add test for Lysandre
      4208f496
    • Lysandre Debut's avatar
      Fix a bunch of slow tests (#8634) · f2e07e72
      Lysandre Debut authored
      
      
      * CI should install `sentencepiece`
      
      * Requiring TF
      
      * Fixing some TFDPR bugs
      
      * remove return_dict=False/True hack
      Co-authored-by: default avatarpatrickvonplaten <patrick.v.platen@gmail.com>
      f2e07e72
    • elk-cloner's avatar
      Tf longformer for sequence classification (#8231) · 5362bb8a
      elk-cloner authored
      
      
      * working on LongformerForSequenceClassification
      
      * add TFLongformerForMultipleChoice
      
      * add TFLongformerForTokenClassification
      
      * use add_start_docstrings_to_model_forward
      
      * test TFLongformerForSequenceClassification
      
      * test TFLongformerForMultipleChoice
      
      * test TFLongformerForTokenClassification
      
      * remove test from repo
      
      * add test and doc for TFLongformerForSequenceClassification, TFLongformerForTokenClassification, TFLongformerForMultipleChoice
      
      * add requested classes to modeling_tf_auto.py
      update dummy_tf_objects
      fix tests
      fix bugs in requested classes
      
      * pass all tests except test_inputs_embeds
      
      * sync with master
      
      * pass all tests except test_inputs_embeds
      
      * pass all tests
      
      * pass all tests
      
      * work on test_inputs_embeds
      
      * fix style and quality
      
      * make multi choice work
      
      * fix TFLongformerForTokenClassification signature
      
      * fix TFLongformerForMultipleChoice, TFLongformerForSequenceClassification signature
      
      * fix mult choice
      
      * fix mc hint
      
      * fix input embeds
      
      * fix input embeds
      
      * refactor input embeds
      
      * fix copy issue
      
      * apply sylvains changes and clean more
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      5362bb8a
  14. 18 Nov, 2020 3 commits
  15. 17 Nov, 2020 1 commit