1. 09 Dec, 2020 2 commits
    • Sylvain Gugger's avatar
      7e1d709e
    • Patrick von Platen's avatar
      Diverse beam search 2 (#9006) · 02d0e035
      Patrick von Platen authored
      
      
      * diverse beam search
      
      * bug fixes
      
      * bug fixes
      
      * bug fix
      
      * separate out diverse_beam_search function
      
      * separate out diverse_beam_search function
      
      * bug fix
      
      * improve code quality
      
      * bug fix
      
      * bug fix
      
      * separate out diverse beam search scorer
      
      * code format
      
      * code format
      
      * code format
      
      * code format
      
      * add test
      
      * code format
      
      * documentation changes
      
      * code quality
      
      * add slow integration tests
      
      * more general name
      
      * refactor into logits processor
      
      * add test
      
      * avoid too much copy paste
      
      * refactor
      
      * add to docs
      
      * fix-copies
      
      * bug fix
      
      * Revert "bug fix"
      
      This reverts commit c99eb5a8dc57a7b0d33a8ac06d8c6a32a7812ad4.
      
      * improve comment
      
      * implement sylvains feedback
      Co-authored-by: default avatarAyush Jain <a.jain@sprinklr.com>
      Co-authored-by: default avatarayushtiku5 <40797286+ayushtiku5@users.noreply.github.com>
      02d0e035
  2. 08 Dec, 2020 9 commits
    • Lysandre Debut's avatar
      Templates overhaul 1 (#8993) · 67ff1c31
      Lysandre Debut authored
      67ff1c31
    • Sylvain Gugger's avatar
      New squad example (#8992) · 447808c8
      Sylvain Gugger authored
      
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add tick
      
      * Update README
      
      * Address review comments
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      447808c8
    • guillaume-be's avatar
      Removed unused `encoder_hidden_states` and `encoder_attention_mask` (#8972) · 7809eb82
      guillaume-be authored
      * Removed unused `encoder_hidden_states` and `encoder_attention_mask` from MobileBert
      
      * Removed decoder tests for MobileBert
      
      * Removed now unnecessary import
      7809eb82
    • Lysandre Debut's avatar
    • Sylvain Gugger's avatar
      Make `ModelOutput` pickle-able (#8989) · 04c446f7
      Sylvain Gugger authored
      04c446f7
    • Julien Chaumond's avatar
      0d9e6ca9
    • Julien Plu's avatar
      Optional layers (#8961) · bf7f79cd
      Julien Plu authored
      * Apply on BERT and ALBERT
      
      * Update TF Bart
      
      * Add input processing to TF BART
      
      * Add input processing for TF CTRL
      
      * Add input processing to TF Distilbert
      
      * Add input processing to TF DPR
      
      * Add input processing to TF Electra
      
      * Add deprecated arguments
      
      * Add input processing to TF XLM
      
      * remove unused imports
      
      * Add input processing to TF Funnel
      
      * Add input processing to TF GPT2
      
      * Add input processing to TF Longformer
      
      * Add input processing to TF Lxmert
      
      * Apply style
      
      * Add input processing to TF Mobilebert
      
      * Add input processing to TF GPT
      
      * Add input processing to TF Roberta
      
      * Add input processing to TF T5
      
      * Add input processing to TF TransfoXL
      
      * Apply style
      
      * Rebase on master
      
      * Fix wrong model name
      
      * Fix BART
      
      * Apply style
      
      * Put the deprecated warnings in the input processing function
      
      * Remove the unused imports
      
      * Raise an error when len(kwargs)>0
      
      * test ModelOutput instead of TFBaseModelOutput
      
      * Address Patrick's comments
      
      * Address Patrick's comments
      
      * Add boolean processing for the inputs
      
      * Take into account the optional layers
      
      * Add missing/unexpected weights in the other models
      
      * Apply style
      
      * rename parameters
      
      * Apply style
      
      * Remove useless
      
      * Remove useless
      
      * Remove useless
      
      * Update num parameters
      
      * Fix tests
      
      * Address Patrick's comment
      
      * Remove useless attribute
      bf7f79cd
    • Stas Bekman's avatar
      [training] SAVE_STATE_WARNING was removed in pytorch (#8979) · 9d7d0005
      Stas Bekman authored
      * [training] SAVE_STATE_WARNING was removed in pytorch
      
      FYI `SAVE_STATE_WARNING` has been removed 3 days ago: pytorch/pytorch#46813
      
      Fixes: #8232
      
      @sgugger
      
      * style, but add () to prevent autoformatters from botching it
      
      * switch to try/except
      
      * cleanup
      9d7d0005
    • Lysandre Debut's avatar
      Check table as independent script (#8976) · 2ae7388e
      Lysandre Debut authored
      2ae7388e
  3. 07 Dec, 2020 11 commits
  4. 06 Dec, 2020 1 commit
  5. 05 Dec, 2020 2 commits
  6. 04 Dec, 2020 5 commits
  7. 03 Dec, 2020 7 commits
  8. 02 Dec, 2020 3 commits
    • Patrick von Platen's avatar
      [PyTorch] Refactor Resize Token Embeddings (#8880) · 443f67e8
      Patrick von Platen authored
      * fix resize tokens
      
      * correct mobile_bert
      
      * move embedding fix into modeling_utils.py
      
      * refactor
      
      * fix lm head resize
      
      * refactor
      
      * break lines to make sylvain happy
      
      * add news tests
      
      * fix typo
      
      * improve test
      
      * skip bart-like for now
      
      * check if base_model = get(...) is necessary
      
      * clean files
      
      * improve test
      
      * fix tests
      
      * revert style templates
      
      * Update templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py
      443f67e8
    • Devangi Purkayastha's avatar
      Update README.md (#8906) · e52f9c0a
      Devangi Purkayastha authored
      e52f9c0a
    • ryota-mo's avatar
      Fix typo in docstring (#8905) · 801b2cb3
      ryota-mo authored
      801b2cb3