"vscode:/vscode.git/clone" did not exist on "4ab8ab4f50baf391612cbc78cfa3f09b7ad0c3ac"
  1. 25 Nov, 2020 1 commit
    • Patrick von Platen's avatar
      [XLNet] Fix mems behavior (#8567) · 2a6fbe6a
      Patrick von Platen authored
      * fix mems in xlnet
      
      * fix use_mems
      
      * fix use_mem_len
      
      * fix use mems
      
      * clean docs
      
      * fix tf typo
      
      * make xlnet tf for generation work
      
      * fix tf test
      
      * refactor use cache
      
      * add use cache for missing models
      
      * correct use_cache in generate
      
      * correct use cache in tf generate
      
      * fix tf
      
      * correct getattr typo
      
      * make sylvain happy
      
      * change in docs as well
      
      * do not apply to cookie cutter statements
      
      * fix tf test
      
      * make pytorch model fully backward compatible
      2a6fbe6a
  2. 20 Nov, 2020 1 commit
  3. 18 Nov, 2020 1 commit
  4. 16 Nov, 2020 1 commit
  5. 03 Nov, 2020 1 commit
    • Patrick von Platen's avatar
      Refactoring the generate() function (#6949) · a1bbcf3f
      Patrick von Platen authored
      * first draft
      
      * show design proposition for new generate method
      
      * up
      
      * make better readable
      
      * make first version
      
      * gpt2 tests pass
      
      * make beam search for gpt2 work
      
      * add first encoder-decoder code
      
      * delete typo
      
      * make t5 work
      
      * save indermediate
      
      * make bart work with beam search
      
      * finish beam search bart / t5
      
      * add default kwargs
      
      * make more tests pass
      
      * fix no bad words sampler
      
      * some fixes and tests for all distribution processors
      
      * fix test
      
      * fix rag slow tests
      
      * merge to master
      
      * add nograd to generate
      
      * make all slow tests pass
      
      * speed up generate
      
      * fix edge case bug
      
      * small fix
      
      * correct typo
      
      * add type hints and docstrings
      
      * fix typos in tests
      
      * add beam search tests
      
      * add tests for beam scorer
      
      * fix test rag
      
      * finish beam search tests
      
      * move generation tests in seperate file
      
      * fix generation tests
      
      * more tests
      
      * add aggressive generation tests
      
      * fix tests
      
      * add gpt2 sample test
      
      * add more docstring
      
      * add more docs
      
      * finish doc strings
      
      * apply some more of sylvains and sams comments
      
      * fix some typos
      
      * make fix copies
      
      * apply lysandres and sylvains comments
      
      * final corrections on examples
      
      * small fix for reformer
      a1bbcf3f
  6. 29 Oct, 2020 1 commit
  7. 26 Oct, 2020 1 commit
    • Sylvain Gugger's avatar
      Doc styling (#8067) · 08f534d2
      Sylvain Gugger authored
      * Important files
      
      * Styling them all
      
      * Revert "Styling them all"
      
      This reverts commit 7d029395fdae8513b8281cbc2a6c239f8093503e.
      
      * Syling them for realsies
      
      * Fix syntax error
      
      * Fix benchmark_utils
      
      * More fixes
      
      * Fix modeling auto and script
      
      * Remove new line
      
      * Fixes
      
      * More fixes
      
      * Fix more files
      
      * Style
      
      * Add FSMT
      
      * More fixes
      
      * More fixes
      
      * More fixes
      
      * More fixes
      
      * Fixes
      
      * More fixes
      
      * More fixes
      
      * Last fixes
      
      * Make sphinx happy
      08f534d2
  8. 19 Oct, 2020 1 commit
  9. 10 Sep, 2020 1 commit
    • Patrick von Platen's avatar
      Add "Leveraging Pretrained Checkpoints for Generation" Seq2Seq models. (#6594) · 7fd1febf
      Patrick von Platen authored
      * add conversion script
      
      * improve conversion script
      
      * make style
      
      * add tryout files
      
      * fix
      
      * update
      
      * add causal bert
      
      * better names
      
      * add tokenizer file as well
      
      * finish causal_bert
      
      * fix small bugs
      
      * improve generate
      
      * change naming
      
      * renaming
      
      * renaming
      
      * renaming
      
      * remove leftover files
      
      * clean files
      
      * add fix tokenizer
      
      * finalize
      
      * correct slow test
      
      * update docs
      
      * small fixes
      
      * fix link
      
      * adapt check repo
      
      * apply sams and sylvains recommendations
      
      * fix import
      
      * implement Lysandres recommendations
      
      * fix logger warn
      7fd1febf
  10. 09 Sep, 2020 1 commit
    • Stas Bekman's avatar
      [generation] consistently add eos tokens (#6982) · 03e363f9
      Stas Bekman authored
      Currently beam search returns inconsistent outputs - if hypos have different lengths we get eos, if they are the same - we don't.
      
      This PR makes the output consistent.
      
      Also why not also replace:
      
      ```
                  if sent_lengths[i] < max_length:
                      decoded[i, sent_lengths[i]] = eos_token_id
      ```
      with:
      ```
                  decoded[i, sent_lengths[i]] = eos_token_id
      ```
      Shouldn't eos always be there? If the data gets truncated, the caller needs to user a larger `max_length`.
      
      Please correct me if my logic is flawed.
      03e363f9
  11. 07 Sep, 2020 2 commits
    • Stas Bekman's avatar
      [gen utils] missing else case (#6980) · 848fbe1e
      Stas Bekman authored
      * [gen utils] missing else case
      
      1. `else` is missing - I hit that case while porting a model. Probably needs to assert there?
      2. also the comment on top seems to be outdated (just vocab_size is being set there)
      
      * typo
      848fbe1e
    • Stas Bekman's avatar
      typo (#6959) · c3317e1f
      Stas Bekman authored
      there is no var `decoder_input_ids`, but there is `input_ids` for decoder :)
      c3317e1f
  12. 01 Sep, 2020 1 commit
    • Patrick von Platen's avatar
      [Generate] Facilitate PyTorch generate using `ModelOutputs` (#6735) · afc4ece4
      Patrick von Platen authored
      * fix generate for GPT2 Double Head
      
      * fix gpt2 double head model
      
      * fix  bart / t5
      
      * also add for no beam search
      
      * fix no beam search
      
      * fix encoder decoder
      
      * simplify t5
      
      * simplify t5
      
      * fix t5 tests
      
      * fix BART
      
      * fix transfo-xl
      
      * fix conflict
      
      * integrating sylvains and sams comments
      
      * fix tf past_decoder_key_values
      
      * fix enc dec test
      afc4ece4
  13. 26 Aug, 2020 2 commits
  14. 20 Aug, 2020 1 commit
  15. 14 Aug, 2020 2 commits
  16. 13 Aug, 2020 1 commit
  17. 11 Aug, 2020 1 commit
    • guillaume-be's avatar
      [Performance improvement] "Bad tokens ids" optimization (#6064) · 40478291
      guillaume-be authored
      * Optimized banned token masking
      
      * Avoid duplicate EOS masking if in bad_words_id
      
      * Updated mask generation to handle empty banned token list
      
      * Addition of unit tests for the updated bad_words_ids masking
      
      * Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test
      
      * Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test (timeout does not work on Windows)
      
      * Moving Marian import to the test context to allow TF only environments to run
      
      * Moving imports to torch_available test
      
      * Updated operations device and test
      
      * Updated operations device and test
      
      * Added docstring and comment for in-place scores modification
      
      * Moving test to own test_generation_utils, use of lighter models for testing
      
      * removed unneded imports in test_modeling_common
      
      * revert formatting change for ModelTesterMixin
      
      * Updated caching, simplified eos token id test, removed unnecessary @require_torch
      
      * formatting compliance
      40478291
  18. 03 Jul, 2020 1 commit
  19. 30 Jun, 2020 1 commit