1. 06 Oct, 2020 1 commit
  2. 01 Oct, 2020 1 commit
  3. 29 Sep, 2020 1 commit
  4. 01 Sep, 2020 1 commit
    • Patrick von Platen's avatar
      [Generate] Facilitate PyTorch generate using `ModelOutputs` (#6735) · afc4ece4
      Patrick von Platen authored
      * fix generate for GPT2 Double Head
      
      * fix gpt2 double head model
      
      * fix  bart / t5
      
      * also add for no beam search
      
      * fix no beam search
      
      * fix encoder decoder
      
      * simplify t5
      
      * simplify t5
      
      * fix t5 tests
      
      * fix BART
      
      * fix transfo-xl
      
      * fix conflict
      
      * integrating sylvains and sams comments
      
      * fix tf past_decoder_key_values
      
      * fix enc dec test
      afc4ece4
  5. 26 Aug, 2020 1 commit
  6. 24 Aug, 2020 1 commit
  7. 20 Aug, 2020 1 commit
  8. 14 Aug, 2020 1 commit
  9. 13 Aug, 2020 1 commit
    • Stas Bekman's avatar
      cleanup tf unittests: part 2 (#6260) · e983da0e
      Stas Bekman authored
      * cleanup torch unittests: part 2
      
      * remove trailing comma added by isort, and which breaks flake
      
      * one more comma
      
      * revert odd balls
      
      * part 3: odd cases
      
      * more ["key"] -> .key refactoring
      
      * .numpy() is not needed
      
      * more unncessary .numpy() removed
      
      * more simplification
      e983da0e
  10. 04 Aug, 2020 1 commit
  11. 31 Jul, 2020 1 commit
  12. 23 Jul, 2020 1 commit
  13. 01 Jul, 2020 1 commit
  14. 24 Jun, 2020 1 commit
  15. 16 Jun, 2020 1 commit
  16. 05 Jun, 2020 1 commit
  17. 02 Jun, 2020 1 commit
    • Julien Chaumond's avatar
      Kill model archive maps (#4636) · d4c2cb40
      Julien Chaumond authored
      * Kill model archive maps
      
      * Fixup
      
      * Also kill model_archive_map for MaskedBertPreTrainedModel
      
      * Unhook config_archive_map
      
      * Tokenizers: align with model id changes
      
      * make style && make quality
      
      * Fix CI
      d4c2cb40
  18. 27 May, 2020 1 commit
  19. 19 May, 2020 2 commits
  20. 01 May, 2020 1 commit
    • Julien Chaumond's avatar
      [ci] Load pretrained models into the default (long-lived) cache · f54dc3f4
      Julien Chaumond authored
      There's an inconsistency right now where:
      - we load some models into CACHE_DIR
      - and some models in the default cache
      - and often, in both for the same models
      
      When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth.
      
      I'd rather always use the default cache
      f54dc3f4
  21. 20 Mar, 2020 1 commit
  22. 08 Mar, 2020 3 commits
  23. 03 Mar, 2020 1 commit
    • Patrick von Platen's avatar
      Add generate() functionality to TF 2.0 (#3063) · 41341003
      Patrick von Platen authored
      * add first copy past test to tf 2 generate
      
      * add tf top_k_top_p_filter fn
      
      * add generate function for TF
      
      * add generate function for TF
      
      * implemented generate for all models expect transfoXL
      
      * implemented generate for all models expect transfoXL
      
      * implemented generate for all models expect transfoXL
      
      * make style
      
      * change permission of test file to correct ones
      
      * delete ipdb
      
      * delete ipdb
      
      * fix bug and finish simple gpt2 integration test
      
      * clean test file
      
      * clean test file
      
      * make style
      
      * make style
      
      * make style
      
      * make style
      
      * change import style
      
      * change import style
      
      * make style
      
      * make style
      
      * add decorators
      
      * add decorators
      
      * fix tf ctrl bug dim => axis in TF
      
      * make style
      
      * make style
      
      * refactored test file
      
      * refactored test file
      
      * take out test_torch_tf_conversion if nothing is defined
      
      * take out test_torch_tf_conversion if nothing is defined
      
      * remove useless files
      
      * remove useless files
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * solve conflicts
      
      * solve conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * merge conflicts
      
      * delete ipdb
      
      * exposed top_k_top_p_filtering fns
      
      * delete weirdly created w! file
      
      * add comment to test tf common modeling
      
      * fix conflicts
      
      * fix conflicts
      
      * make style
      
      * merge conflicts
      
      * make style
      
      * change tf.tensor.shape to shape_list(tensor)
      41341003
  24. 26 Feb, 2020 2 commits
  25. 25 Feb, 2020 2 commits
  26. 24 Feb, 2020 1 commit
    • Patrick von Platen's avatar
      Add slow generate tests for pretrained lm models (#2909) · 17c45c39
      Patrick von Platen authored
      * add slow generate lm_model tests
      
      * fix conflicts
      
      * merge conflicts
      
      * fix conflicts
      
      * add slow generate lm_model tests
      
      * make style
      
      * delete unused variable
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * delete unused variable
      
      * fix conflicts
      
      * finished hard coded tests
      17c45c39
  27. 21 Feb, 2020 1 commit
    • Patrick von Platen's avatar
      Improve special_token_id logic in run_generation.py and add tests (#2885) · fc38d4c8
      Patrick von Platen authored
      
      
      * improving generation
      
      * finalized special token behaviour for no_beam_search generation
      
      * solved modeling_utils merge conflict
      
      * solve merge conflicts in modeling_utils.py
      
      * add run_generation improvements from PR #2749
      
      * adapted language generation to not use hardcoded -1 if no padding token is available
      
      * remove the -1 removal as hard coded -1`s are not necessary anymore
      
      * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown
      
      * add slow language generation tests for pretrained models using hardcoded output with pytorch seed
      
      * delete ipdb
      
      * check that all generated tokens are valid
      
      * renaming
      
      * renaming Generation -> Generate
      
      * make style
      
      * updated so that generate_beam_search has same token behavior than generate_no_beam_search
      
      * consistent return format for run_generation.py
      
      * deleted pretrain lm generate tests -> will be added in another PR
      
      * cleaning of unused if statements and renaming
      
      * run_generate will always return an iterable
      
      * make style
      
      * consistent renaming
      
      * improve naming, make sure generate function always returns the same tensor, add docstring
      
      * add slow tests for all lmhead models
      
      * make style and improve example comments modeling_utils
      
      * better naming and refactoring in modeling_utils
      
      * improving generation
      
      * finalized special token behaviour for no_beam_search generation
      
      * solved modeling_utils merge conflict
      
      * solve merge conflicts in modeling_utils.py
      
      * add run_generation improvements from PR #2749
      
      * adapted language generation to not use hardcoded -1 if no padding token is available
      
      * remove the -1 removal as hard coded -1`s are not necessary anymore
      
      * add lightweight language generation testing for randomely initialized models - just checking whether no errors are thrown
      
      * add slow language generation tests for pretrained models using hardcoded output with pytorch seed
      
      * delete ipdb
      
      * check that all generated tokens are valid
      
      * renaming
      
      * renaming Generation -> Generate
      
      * make style
      
      * updated so that generate_beam_search has same token behavior than generate_no_beam_search
      
      * consistent return format for run_generation.py
      
      * deleted pretrain lm generate tests -> will be added in another PR
      
      * cleaning of unused if statements and renaming
      
      * run_generate will always return an iterable
      
      * make style
      
      * consistent renaming
      
      * improve naming, make sure generate function always returns the same tensor, add docstring
      
      * add slow tests for all lmhead models
      
      * make style and improve example comments modeling_utils
      
      * better naming and refactoring in modeling_utils
      
      * changed fast random lm generation testing design to more general one
      
      * delete in old testing design in gpt2
      
      * correct old variable name
      
      * temporary fix for encoder_decoder lm generation tests - has to be updated when t5 is fixed
      
      * adapted all fast random generate tests to new design
      
      * better warning description in modeling_utils
      
      * better comment
      
      * better comment and error message
      Co-authored-by: default avatarThomas Wolf <thomwolf@users.noreply.github.com>
      fc38d4c8
  28. 06 Jan, 2020 2 commits
  29. 22 Dec, 2019 6 commits