1. 11 Dec, 2020 16 commits
  2. 10 Dec, 2020 7 commits
  3. 09 Dec, 2020 10 commits
  4. 08 Dec, 2020 7 commits
    • Lysandre Debut's avatar
      Templates overhaul 1 (#8993) · 67ff1c31
      Lysandre Debut authored
      67ff1c31
    • Sylvain Gugger's avatar
      New squad example (#8992) · 447808c8
      Sylvain Gugger authored
      
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add new SQUAD example
      
      * Same with a task-specific Trainer
      
      * Address review comment.
      
      * Small fixes
      
      * Initial work for XLNet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Final clean up and working XLNet script
      
      * Test and debug
      
      * Final working version
      
      * Add tick
      
      * Update README
      
      * Address review comments
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      447808c8
    • guillaume-be's avatar
      Removed unused `encoder_hidden_states` and `encoder_attention_mask` (#8972) · 7809eb82
      guillaume-be authored
      * Removed unused `encoder_hidden_states` and `encoder_attention_mask` from MobileBert
      
      * Removed decoder tests for MobileBert
      
      * Removed now unnecessary import
      7809eb82
    • Lysandre Debut's avatar
    • Sylvain Gugger's avatar
      Make `ModelOutput` pickle-able (#8989) · 04c446f7
      Sylvain Gugger authored
      04c446f7
    • Julien Chaumond's avatar
      0d9e6ca9
    • Julien Plu's avatar
      Optional layers (#8961) · bf7f79cd
      Julien Plu authored
      * Apply on BERT and ALBERT
      
      * Update TF Bart
      
      * Add input processing to TF BART
      
      * Add input processing for TF CTRL
      
      * Add input processing to TF Distilbert
      
      * Add input processing to TF DPR
      
      * Add input processing to TF Electra
      
      * Add deprecated arguments
      
      * Add input processing to TF XLM
      
      * remove unused imports
      
      * Add input processing to TF Funnel
      
      * Add input processing to TF GPT2
      
      * Add input processing to TF Longformer
      
      * Add input processing to TF Lxmert
      
      * Apply style
      
      * Add input processing to TF Mobilebert
      
      * Add input processing to TF GPT
      
      * Add input processing to TF Roberta
      
      * Add input processing to TF T5
      
      * Add input processing to TF TransfoXL
      
      * Apply style
      
      * Rebase on master
      
      * Fix wrong model name
      
      * Fix BART
      
      * Apply style
      
      * Put the deprecated warnings in the input processing function
      
      * Remove the unused imports
      
      * Raise an error when len(kwargs)>0
      
      * test ModelOutput instead of TFBaseModelOutput
      
      * Address Patrick's comments
      
      * Address Patrick's comments
      
      * Add boolean processing for the inputs
      
      * Take into account the optional layers
      
      * Add missing/unexpected weights in the other models
      
      * Apply style
      
      * rename parameters
      
      * Apply style
      
      * Remove useless
      
      * Remove useless
      
      * Remove useless
      
      * Update num parameters
      
      * Fix tests
      
      * Address Patrick's comment
      
      * Remove useless attribute
      bf7f79cd