1. 29 Oct, 2020 1 commit
    • Sylvain Gugger's avatar
      Add a template for examples and apply it for mlm and plm examples (#8153) · 69117628
      Sylvain Gugger authored
      * Add a template for example scripts and apply it to mlm
      
      * Formatting
      
      * Fix test
      
      * Add plm script
      
      * Add a template for example scripts and apply it to mlm
      
      * Formatting
      
      * Fix test
      
      * Add plm script
      
      * Add a template for example scripts and apply it to mlm
      
      * Formatting
      
      * Fix test
      
      * Add plm script
      
      * Styling
      69117628
  2. 28 Oct, 2020 1 commit
  3. 26 Oct, 2020 1 commit
  4. 22 Oct, 2020 1 commit
  5. 12 Oct, 2020 2 commits
  6. 01 Sep, 2020 1 commit
  7. 26 Aug, 2020 1 commit
  8. 29 Jul, 2020 1 commit
  9. 07 Jul, 2020 1 commit
    • Shashank Gupta's avatar
      Added data collator for permutation (XLNet) language modeling and related calls (#5522) · 3dcb748e
      Shashank Gupta authored
      * Added data collator for XLNet language modeling and related calls
      
      Added DataCollatorForXLNetLanguageModeling in data/data_collator.py
      to generate necessary inputs for language modeling training with
      XLNetLMHeadModel. Also added related arguments, logic and calls in
      examples/language-modeling/run_language_modeling.py.
      
      Resolves: #4739, #2008 (partially)
      
      * Changed name to `DataCollatorForPermutationLanguageModeling`
      
      Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`.
      Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use.
      CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of
      similar to `mems` for XLNet).
      Changed calls and imports appropriately.
      
      * Added detailed comments, changed variable names
      
      Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative.
      
      * Added tests for new data collator
      
      Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences.
      
      * Fixed styling issues
      3dcb748e
  10. 25 May, 2020 1 commit
  11. 19 May, 2020 1 commit
  12. 18 May, 2020 1 commit
  13. 15 May, 2020 1 commit
  14. 14 May, 2020 1 commit
  15. 13 May, 2020 1 commit
  16. 08 May, 2020 1 commit
  17. 07 May, 2020 2 commits