1. 28 Jan, 2021 3 commits
  2. 27 Jan, 2021 7 commits
  3. 26 Jan, 2021 3 commits
    • Nicolas Patry's avatar
      Adding `skip_special_tokens=True` to FillMaskPipeline (#9783) · 781e4b13
      Nicolas Patry authored
      * We most likely don't want special tokens in this output.
      
      * Adding `skip_special_tokens=True` to FillMaskPipeline
      
      - It's backward incompatible.
      - It makes for sense for pipelines to remove references to
      special_tokens (all of the other pipelines do that).
      - Keeping special tokens makes it hard for users to actually remove them
        because all models have different tokens (<s>, <cls>, [CLS], ....)
      
      * Fixing `token_str` in the same vein, and actually fix the tests too !
      781e4b13
    • Daniel Stancl's avatar
      Add head_mask/decoder_head_mask for TF BART models (#9639) · 1867d9a8
      Daniel Stancl authored
      * Add head_mask/decoder_head_mask for TF BART models
      
      * Add head_mask and decoder_head_mask input arguments for TF BART-based
      models as a TF counterpart to the PR #9569
      
      * Add test_headmasking functionality to tests/test_modeling_tf_common.py
      
      * TODO: Add a test to verify that we can get a gradient back for
      importance score computation
      
      * Remove redundant #TODO note
      
      Remove redundant #TODO note from tests/test_modeling_tf_common.py
      
      * Fix assertions
      
      * Make style
      
      * Fix ...Model input args and adjust one new test
      
      * Add back head_mask and decoder_head_mask to BART-based ...Model
      after the last commit
      
      * Remove head_mask ande decoder_head_mask from input_dict
      in TF test_train_pipeline_custom_model as these two have different
      shape than other input args (Necessary for passing this test)
      
      * Revert adding global_rng in test_modeling_tf_common.py
      1867d9a8
    • Patrick von Platen's avatar
      [Flaky Generation Tests] Make sure that no early stopping is happening for beam search (#9794) · d94cc2f9
      Patrick von Platen authored
      * fix ci
      
      * fix ci
      
      * renaming
      
      * fix dup line
      d94cc2f9
  4. 25 Jan, 2021 1 commit
  5. 22 Jan, 2021 2 commits
  6. 21 Jan, 2021 5 commits
  7. 20 Jan, 2021 2 commits
  8. 19 Jan, 2021 5 commits
    • Daniel Stancl's avatar
      Add separated decoder_head_mask for T5 Models (#9634) · 2ebbbf55
      Daniel Stancl authored
      * Add decoder_head_mask for PyTorch T5 model
      
      * Add decoder_head_mask args into T5Model and T5ForConditionalGeneration
      
      * Slightly change the order of input args to be in accordance
      with the convention from BART-based models introduced within the PR #9569.
      
      * Make style for modeling_t5.py
      
      * Add decoder_head_mask for TF T5 models
      
      * Separate head_mask and decoder_head_mask args in TF T5 models
      
      * Slightly change the order of input args to follow convention
      of BART-based models updated in PR #9569
      
      * Update test_forward_signature tests/test_modeling_tf_common.py
      w.r.t. the changed order of input args
      
      * Add FutureWarnings for T5 and TFT5 models
      
      * Add FutureWarnings for T5 and TFT5 models warning a user that
      input argument `head_mask` was split into two arguments -
      `head_mask` and `decoder_head_mask`
      
      * Add default behaviour - `decoder_head_mask` is set to copy
      `head_mask`
      
      * Fix T5 modeling and FutureWarning
      
      * Make proper usage of head_mask and decoder_head_mask
      in cross_attention
      
      * Fix conditions for raising FutureWarning
      
      * Reformat FutureWarning in T5 modeling
      
      * Refactor the warning message
      2ebbbf55
    • Sylvain Gugger's avatar
      New run_seq2seq script (#9605) · e4c06ed6
      Sylvain Gugger authored
      
      
      * New run_seq2seq script
      
      * Add tests
      
      * Mark as slow
      
      * Update examples/seq2seq/run_seq2seq.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update src/transformers/data/data_collator.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Update src/transformers/data/data_collator.py
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      
      * Address review comments
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSuraj Patil <surajp815@gmail.com>
      e4c06ed6
    • Yusuke Mori's avatar
      Update `past_key_values` in GPT-2 (#9596) · b020a736
      Yusuke Mori authored
      
      
      * Update past_key_values in gpt2 (#9391)
      
      * Update generation_utils, and rename some items
      
      * Update modeling_gpt2 to avoid an error in gradient_checkpointing
      
      * Remove 'reorder_cache' from util and add variations to XLNet, TransfoXL, GPT-2
      
      * Change the location of '_reorder_cache' in modeling files
      
      * Add '_reorder_cache' in modeling_ctrl
      
      * Fix a bug of my last commit in CTRL
      
      * Add '_reorder_cache' to GPT2DoubleHeadsModel
      
      * Manage 'use_cache' in config of test_modeling_gpt2
      
      * Clean up the doc string
      
      * Update src/transformers/models/gpt2/modeling_gpt2.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Fix the doc string (GPT-2, CTRL)
      
      * improve gradient_checkpointing_behavior
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      b020a736
    • Sergey Mkrtchyan's avatar
      Fix DPRReaderTokenizer's attention_mask (#9663) · 917dbb15
      Sergey Mkrtchyan authored
      * Fix the attention_mask in DPRReaderTokenizer
      
      * Add an integration test for DPRReader inference
      
      * Run make style
      917dbb15
    • Patrick von Platen's avatar
      fix test (#9669) · 12c1b5b8
      Patrick von Platen authored
      12c1b5b8
  9. 18 Jan, 2021 1 commit
  10. 14 Jan, 2021 3 commits
  11. 13 Jan, 2021 3 commits
  12. 12 Jan, 2021 5 commits
    • Sylvain Gugger's avatar
      Refactor `prepare_seq2seq_batch` (#9524) · 063d8d27
      Sylvain Gugger authored
      * Add target contextmanager and rework prepare_seq2seq_batch
      
      * Fix tests, treat BART and Barthez
      
      * Add last tokenizers
      
      * Fix test
      
      * Set src token before calling the superclass
      
      * Remove special behavior for T5
      
      * Remove needless imports
      
      * Remove needless asserts
      063d8d27
    • Lysandre Debut's avatar
      topk -> top_k (#9541) · dfbf0f55
      Lysandre Debut authored
      dfbf0f55
    • Lysandre Debut's avatar
      LayoutLM Config (#9539) · a1100fac
      Lysandre Debut authored
      a1100fac
    • NielsRogge's avatar
      Improve LayoutLM (#9476) · e45eba3b
      NielsRogge authored
      
      
      * Add LayoutLMForSequenceClassification and integration tests
      
      Improve docs
      
      Add LayoutLM notebook to list of community notebooks
      
      * Make style & quality
      
      * Address comments by @sgugger, @patrickvonplaten and @LysandreJik
      
      * Fix rebase with master
      
      * Reformat in one line
      
      * Improve code examples as requested by @patrickvonplaten
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      e45eba3b
    • Patrick von Platen's avatar
      [TFBart] Split TF-Bart (#9497) · 7f286132
      Patrick von Platen authored
      * make templates ready
      
      * make add_new_model_command_ready
      
      * finish tf bart
      
      * prepare tf mbart
      
      * finish tf bart
      
      * add tf mbart
      
      * add marian
      
      * prep pegasus
      
      * add tf pegasus
      
      * push blenderbot tf
      
      * add blenderbot
      
      * add blenderbot small
      
      * clean-up
      
      * make fix copy
      
      * define blend bot tok
      
      * fix
      
      * up
      
      * make style
      
      * add to docs
      
      * add copy statements
      
      * overwrite changes
      
      * improve
      
      * fix docs
      
      * finish
      
      * fix last slow test
      
      * fix missing git conflict line
      
      * fix blenderbot
      
      * up
      
      * fix blenderbot small
      
      * load changes
      
      * finish copied from
      
      * upload fix
      7f286132