1. 08 Dec, 2020 9 commits
    • Lysandre Debut's avatar
      Templates overhaul 1 (#8993) · 67ff1c31
      Lysandre Debut authored
      67ff1c31
    • Sylvain Gugger's avatar
      New squad example (#8992) · 447808c8
      Sylvain Gugger authored
      
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add tick
      
      * Update README
      
      * Address review comments
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      447808c8
    • guillaume-be's avatar
      Removed unused `encoder_hidden_states` and `encoder_attention_mask` (#8972) · 7809eb82
      guillaume-be authored
      * Removed unused `encoder_hidden_states` and `encoder_attention_mask` from MobileBert
      
      * Removed decoder tests for MobileBert
      
      * Removed now unnecessary import
      7809eb82
    • Lysandre Debut's avatar
    • Sylvain Gugger's avatar
      Make `ModelOutput` pickle-able (#8989) · 04c446f7
      Sylvain Gugger authored
      04c446f7
    • Julien Chaumond's avatar
      0d9e6ca9
    • Julien Plu's avatar
      Optional layers (#8961) · bf7f79cd
      Julien Plu authored
      * Apply on BERT and ALBERT
      
      * Update TF Bart
      
      * Add input processing to TF BART
      
      * Add input processing for TF CTRL
      
      * Add input processing to TF Distilbert
      
      * Add input processing to TF DPR
      
      * Add input processing to TF Electra
      
      * Add deprecated arguments
      
      * Add input processing to TF XLM
      
      * remove unused imports
      
      * Add input processing to TF Funnel
      
      * Add input processing to TF GPT2
      
      * Add input processing to TF Longformer
      
      * Add input processing to TF Lxmert
      
      * Apply style
      
      * Add input processing to TF Mobilebert
      
      * Add input processing to TF GPT
      
      * Add input processing to TF Roberta
      
      * Add input processing to TF T5
      
      * Add input processing to TF TransfoXL
      
      * Apply style
      
      * Rebase on master
      
      * Fix wrong model name
      
      * Fix BART
      
      * Apply style
      
      * Put the deprecated warnings in the input processing function
      
      * Remove the unused imports
      
      * Raise an error when len(kwargs)>0
      
      * test ModelOutput instead of TFBaseModelOutput
      
      * Address Patrick's comments
      
      * Address Patrick's comments
      
      * Add boolean processing for the inputs
      
      * Take into account the optional layers
      
      * Add missing/unexpected weights in the other models
      
      * Apply style
      
      * rename parameters
      
      * Apply style
      
      * Remove useless
      
      * Remove useless
      
      * Remove useless
      
      * Update num parameters
      
      * Fix tests
      
      * Address Patrick's comment
      
      * Remove useless attribute
      bf7f79cd
    • Stas Bekman's avatar
      [training] SAVE_STATE_WARNING was removed in pytorch (#8979) · 9d7d0005
      Stas Bekman authored
      * [training] SAVE_STATE_WARNING was removed in pytorch
      
      FYI `SAVE_STATE_WARNING` has been removed 3 days ago: pytorch/pytorch#46813
      
      Fixes: #8232
      
      @sgugger
      
      * style, but add () to prevent autoformatters from botching it
      
      * switch to try/except
      
      * cleanup
      9d7d0005
    • Lysandre Debut's avatar
      Check table as independent script (#8976) · 2ae7388e
      Lysandre Debut authored
      2ae7388e
  2. 07 Dec, 2020 11 commits
  3. 06 Dec, 2020 1 commit
  4. 05 Dec, 2020 2 commits
  5. 04 Dec, 2020 5 commits
  6. 03 Dec, 2020 7 commits
  7. 02 Dec, 2020 5 commits
    • Patrick von Platen's avatar
      [PyTorch] Refactor Resize Token Embeddings (#8880) · 443f67e8
      Patrick von Platen authored
      * fix resize tokens
      
      * correct mobile_bert
      
      * move embedding fix into modeling_utils.py
      
      * refactor
      
      * fix lm head resize
      
      * refactor
      
      * break lines to make sylvain happy
      
      * add news tests
      
      * fix typo
      
      * improve test
      
      * skip bart-like for now
      
      * check if base_model = get(...) is necessary
      
      * clean files
      
      * improve test
      
      * fix tests
      
      * revert style templates
      
      * Update templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
      443f67e8
    • Devangi Purkayastha's avatar
      Update README.md (#8906) · e52f9c0a
      Devangi Purkayastha authored
      e52f9c0a
    • ryota-mo's avatar
      Fix typo in docstring (#8905) · 801b2cb3
      ryota-mo authored
      801b2cb3
    • Stas Bekman's avatar
      [trainer] improve code readability (#8903) · 7e1cb00c
      Stas Bekman authored
      * [trainer] improve code
      
      This PR:
      - removes redundant code 
      ```
      self.model = model if model is not None else None
      ```
      and
      ```
      self.model = model
      ```
      are the same.
      
      * separate attribute assignment from code logic - which simplifies things further.
      
      * whitespace
      7e1cb00c
    • Nicolas Patry's avatar
      Warning about too long input for fast tokenizers too (#8799) · a8c3f9aa
      Nicolas Patry authored
      * Warning about too long input for fast tokenizers too
      
      If truncation is not set in tokenizers, but the tokenization is too long
      for the model (`model_max_length`), we used to trigger a warning that
      
      The input would probably fail (which it most likely will).
      
      This PR re-enables the warning for fast tokenizers too and uses common
      code for the trigger to make sure it's consistent across.
      
      * Checking for pair of inputs too.
      
      * Making the function private and adding it's doc.
      
      * Remove formatting ?? in odd place.
      
      * Missed uppercase.
      a8c3f9aa