1. 28 Aug, 2020 3 commits
    • Sam Shleifer's avatar
      prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654) · 9336086a
      Sam Shleifer authored
      * broken test
      
      * batch parity
      
      * tests pass
      
      * boom boom
      
      * boom boom
      
      * split out bart tokenizer tests
      
      * fix tests
      
      * boom boom
      
      * Fixed dataset bug
      
      * Fix marian
      
      * Undo extra
      
      * Get marian working
      
      * Fix t5 tok tests
      
      * Test passing
      
      * Cleanup
      
      * better assert msg
      
      * require torch
      
      * Fix mbart tests
      
      * undo extra decoder_attn_mask change
      
      * Fix import
      
      * pegasus tokenizer can ignore src_lang kwargs
      
      * unused kwarg test cov
      
      * boom boom
      
      * add todo for pegasus issue
      
      * cover one word translation edge case
      
      * Cleanup
      
      * doc
      9336086a
    • RafaelWO's avatar
      Transformer-XL: Improved tokenization with sacremoses (#6322) · cb276b41
      RafaelWO authored
      
      
      * Improved tokenization with sacremoses
      
       * The TransfoXLTokenizer is now using sacremoses for tokenization
       * Added tokenization of comma-separated and floating point numbers.
       * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses
       * Added corresponding tests
       * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast
       * Added deprecation warning to TransfoXLTokenizerFast
      
      * isort change
      Co-authored-by: default avatarTeven <teven.lescao@gmail.com>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      cb276b41
    • Stas Bekman's avatar
      [transformers-cli] fix logger getter (#6777) · 92ac2fa7
      Stas Bekman authored
      92ac2fa7
  2. 27 Aug, 2020 4 commits
  3. 26 Aug, 2020 4 commits
  4. 25 Aug, 2020 3 commits
  5. 24 Aug, 2020 1 commit
  6. 20 Aug, 2020 6 commits
  7. 19 Aug, 2020 5 commits
  8. 18 Aug, 2020 2 commits
  9. 17 Aug, 2020 5 commits
  10. 14 Aug, 2020 2 commits
  11. 13 Aug, 2020 2 commits
    • Lysandre Debut's avatar
      Test model outputs equivalence (#6445) · f7cbc13d
      Lysandre Debut authored
      * Test model outputs equivalence
      
      * Fix failing tests
      
      * From dict to kwargs
      
      * DistilBERT
      
      * Addressing @sgugger and @patrickvonplaten's comments
      f7cbc13d
    • Stas Bekman's avatar
      cleanup tf unittests: part 2 (#6260) · e983da0e
      Stas Bekman authored
      * cleanup torch unittests: part 2
      
      * remove trailing comma added by isort, and which breaks flake
      
      * one more comma
      
      * revert odd balls
      
      * part 3: odd cases
      
      * more ["key"] -> .key refactoring
      
      * .numpy() is not needed
      
      * more unncessary .numpy() removed
      
      * more simplification
      e983da0e
  12. 12 Aug, 2020 3 commits