1. 22 Jan, 2021 5 commits
  2. 21 Jan, 2021 11 commits
  3. 20 Jan, 2021 17 commits
  4. 19 Jan, 2021 7 commits
    • Sylvain Gugger's avatar
      Fix model templates and use less than 119 chars (#9684) · 7e662e6a
      Sylvain Gugger authored
      * Fix model templates and use less than 119 chars
      
      * Missing new line
      7e662e6a
    • Daniel Stancl's avatar
      Add separated decoder_head_mask for T5 Models (#9634) · 2ebbbf55
      Daniel Stancl authored
      * Add decoder_head_mask for PyTorch T5 model
      
      * Add decoder_head_mask args into T5Model and T5ForConditionalGeneration
      
      * Slightly change the order of input args to be in accordance
      with the convention from BART-based models introduced within the PR #9569.
      
      * Make style for modeling_t5.py
      
      * Add decoder_head_mask for TF T5 models
      
      * Separate head_mask and decoder_head_mask args in TF T5 models
      
      * Slightly change the order of input args to follow convention
      of BART-based models updated in PR #9569
      
      * Update test_forward_signature tests/test_modeling_tf_common.py
      w.r.t. the changed order of input args
      
      * Add FutureWarnings for T5 and TFT5 models
      
      * Add FutureWarnings for T5 and TFT5 models warning a user that
      input argument `head_mask` was split into two arguments -
      `head_mask` and `decoder_head_mask`
      
      * Add default behaviour - `decoder_head_mask` is set to copy
      `head_mask`
      
      * Fix T5 modeling and FutureWarning
      
      * Make proper usage of head_mask and decoder_head_mask
      in cross_attention
      
      * Fix conditions for raising FutureWarning
      
      * Reformat FutureWarning in T5 modeling
      
      * Refactor the warning message
      2ebbbf55
    • Sylvain Gugger's avatar
      New run_seq2seq script (#9605) · e4c06ed6
      Sylvain Gugger authored
      
      
      * New run_seq2seq script
      
      * Add tests
      
      * Mark as slow
      
      * Update examples/seq2seq/run_seq2seq.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/data/data_collator.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Update src/transformers/data/data_collator.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Address review comments
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      e4c06ed6
    • Julien Plu's avatar
      Fix TF Flaubert and XLM (#9661) · fa876aee
      Julien Plu authored
      * Fix Flaubert and XLM
      
      * Fix Flaubert and XLM
      
      * Apply style
      fa876aee
    • max yue's avatar
      Update integrations.py (#9652) · 11ec7490
      max yue authored
      File "/share/apps/anaconda3/envs/my_env/lib/python3.7/site-packages/transformers/integrations.py", line 419, in __init__
          self._SummaryWriter = SummaryWriter
      UnboundLocalError: local variable 'SummaryWriter' referenced before assignment
      11ec7490
    • Yusuke Mori's avatar
      Update `past_key_values` in GPT-2 (#9596) · b020a736
      Yusuke Mori authored
      
      
      * Update past_key_values in gpt2 (#9391)
      
      * Update generation_utils, and rename some items
      
      * Update modeling_gpt2 to avoid an error in gradient_checkpointing
      
      * Remove 'reorder_cache' from util and add variations to XLNet, TransfoXL, GPT-2
      
      * Change the location of '_reorder_cache' in modeling files
      
      * Add '_reorder_cache' in modeling_ctrl
      
      * Fix a bug of my last commit in CTRL
      
      * Add '_reorder_cache' to GPT2DoubleHeadsModel
      
      * Manage 'use_cache' in config of test_modeling_gpt2
      
      * Clean up the doc string
      
      * Update src/transformers/models/gpt2/modeling_gpt2.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Fix the doc string (GPT-2, CTRL)
      
      * improve gradient_checkpointing_behavior
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      b020a736
    • Sylvain Gugger's avatar
      Fix old Seq2SeqTrainer (#9675) · 97b787fb
      Sylvain Gugger authored
      97b787fb