1. 29 Nov, 2020 1 commit
  2. 27 Nov, 2020 2 commits
  3. 25 Nov, 2020 2 commits
    • Patrick von Platen's avatar
      [XLNet] Fix mems behavior (#8567) · 2a6fbe6a
      Patrick von Platen authored
      * fix mems in xlnet
      
      * fix use_mems
      
      * fix use_mem_len
      
      * fix use mems
      
      * clean docs
      
      * fix tf typo
      
      * make xlnet tf for generation work
      
      * fix tf test
      
      * refactor use cache
      
      * add use cache for missing models
      
      * correct use_cache in generate
      
      * correct use cache in tf generate
      
      * fix tf
      
      * correct getattr typo
      
      * make sylvain happy
      
      * change in docs as well
      
      * do not apply to cookie cutter statements
      
      * fix tf test
      
      * make pytorch model fully backward compatible
      2a6fbe6a
    • Sylvain Gugger's avatar
      Big model table (#8774) · 4821ea5a
      Sylvain Gugger authored
      
      
      * First draft
      
      * Styling
      
      * With all changes staged
      
      * Update docs/source/index.rst
      Co-authored-by: default avatarJulien Chaumond <chaumond@gmail.com>
      
      * Styling
      Co-authored-by: default avatarJulien Chaumond <chaumond@gmail.com>
      4821ea5a
  4. 24 Nov, 2020 1 commit
  5. 23 Nov, 2020 2 commits
    • Colin Brochtrup's avatar
      Add early stopping callback to pytorch trainer (#8581) · 8ffc01a7
      Colin Brochtrup authored
      * Add early stopping patience and minimum threshold metric must improve to prevent early stopping to pytorch trainer
      
      * Add early stopping test
      
      * Set patience counter to 0 if best metric not defined yet
      
      * Make early stopping a callback. Add callback event for updating the best metric for early stopping callback to trigger on.
      
      * Run make style
      
      * make funciton name sensible
      
      * Improve new argument docstring wording and hope that flakey CI test passes.
      
      * Use on_evaluation callback instead of custom. Remove some debug printing
      
      * Move early stopping arguments and state into early stopping callback
      
      * Run make style
      
      * Remove old code
      
      * Fix docs formatting. make style went rogue on me.
      
      * Remove copied attributes and fix variable
      
      * Add assertions on training arguments instead of mutating them. Move comment out of public docs.
      
      * Make separate test for early stopping callback. Add test of invalid arguments.
      
      * Run make style... I remembered before CI this time!
      
      * appease flake8
      
      * Add EarlyStoppingCallback to callback docs
      
      * Make docstring EarlyStoppingCallabck match other callbacks.
      
      * Fix typo in docs
      8ffc01a7
    • Sylvain Gugger's avatar
      Change default cache path (#8734) · 90002427
      Sylvain Gugger authored
      
      
      * Change default cache path
      
      * Document changes
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      90002427
  6. 22 Nov, 2020 1 commit
  7. 19 Nov, 2020 2 commits
    • Sylvain Gugger's avatar
      cb3e5c33
    • elk-cloner's avatar
      Tf longformer for sequence classification (#8231) · 5362bb8a
      elk-cloner authored
      
      
      * working on LongformerForSequenceClassification
      
      * add TFLongformerForMultipleChoice
      
      * add TFLongformerForTokenClassification
      
      * use add_start_docstrings_to_model_forward
      
      * test TFLongformerForSequenceClassification
      
      * test TFLongformerForMultipleChoice
      
      * test TFLongformerForTokenClassification
      
      * remove test from repo
      
      * add test and doc for TFLongformerForSequenceClassification, TFLongformerForTokenClassification, TFLongformerForMultipleChoice
      
      * add requested classes to modeling_tf_auto.py
      update dummy_tf_objects
      fix tests
      fix bugs in requested classes
      
      * pass all tests except test_inputs_embeds
      
      * sync with master
      
      * pass all tests except test_inputs_embeds
      
      * pass all tests
      
      * pass all tests
      
      * work on test_inputs_embeds
      
      * fix style and quality
      
      * make multi choice work
      
      * fix TFLongformerForTokenClassification signature
      
      * fix TFLongformerForMultipleChoice, TFLongformerForSequenceClassification signature
      
      * fix mult choice
      
      * fix mc hint
      
      * fix input embeds
      
      * fix input embeds
      
      * refactor input embeds
      
      * fix copy issue
      
      * apply sylvains changes and clean more
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      5362bb8a
  8. 18 Nov, 2020 2 commits
  9. 17 Nov, 2020 5 commits
  10. 16 Nov, 2020 1 commit
    • Sylvain Gugger's avatar
      Switch `return_dict` to `True` by default. (#8530) · 1073a2bd
      Sylvain Gugger authored
      * Use the CI to identify failing tests
      
      * Remove from all examples and tests
      
      * More default switch
      
      * Fixes
      
      * More test fixes
      
      * More fixes
      
      * Last fixes hopefully
      
      * Use the CI to identify failing tests
      
      * Remove from all examples and tests
      
      * More default switch
      
      * Fixes
      
      * More test fixes
      
      * More fixes
      
      * Last fixes hopefully
      
      * Run on the real suite
      
      * Fix slow tests
      1073a2bd
  11. 13 Nov, 2020 2 commits
  12. 12 Nov, 2020 2 commits
  13. 11 Nov, 2020 2 commits
  14. 10 Nov, 2020 7 commits
  15. 09 Nov, 2020 3 commits
  16. 06 Nov, 2020 1 commit
  17. 05 Nov, 2020 3 commits
  18. 03 Nov, 2020 1 commit
    • Patrick von Platen's avatar
      Refactoring the generate() function (#6949) · a1bbcf3f
      Patrick von Platen authored
      * first draft
      
      * show design proposition for new generate method
      
      * up
      
      * make better readable
      
      * make first version
      
      * gpt2 tests pass
      
      * make beam search for gpt2 work
      
      * add first encoder-decoder code
      
      * delete typo
      
      * make t5 work
      
      * save indermediate
      
      * make bart work with beam search
      
      * finish beam search bart / t5
      
      * add default kwargs
      
      * make more tests pass
      
      * fix no bad words sampler
      
      * some fixes and tests for all distribution processors
      
      * fix test
      
      * fix rag slow tests
      
      * merge to master
      
      * add nograd to generate
      
      * make all slow tests pass
      
      * speed up generate
      
      * fix edge case bug
      
      * small fix
      
      * correct typo
      
      * add type hints and docstrings
      
      * fix typos in tests
      
      * add beam search tests
      
      * add tests for beam scorer
      
      * fix test rag
      
      * finish beam search tests
      
      * move generation tests in seperate file
      
      * fix generation tests
      
      * more tests
      
      * add aggressive generation tests
      
      * fix tests
      
      * add gpt2 sample test
      
      * add more docstring
      
      * add more docs
      
      * finish doc strings
      
      * apply some more of sylvains and sams comments
      
      * fix some typos
      
      * make fix copies
      
      * apply lysandres and sylvains comments
      
      * final corrections on examples
      
      * small fix for reformer
      a1bbcf3f