1. 02 Jun, 2020 1 commit
    • Julien Chaumond's avatar
      Kill model archive maps (#4636) · d4c2cb40
      Julien Chaumond authored
      * Kill model archive maps
      
      * Fixup
      
      * Also kill model_archive_map for MaskedBertPreTrainedModel
      
      * Unhook config_archive_map
      
      * Tokenizers: align with model id changes
      
      * make style && make quality
      
      * Fix CI
      d4c2cb40
  2. 27 May, 2020 1 commit
  3. 01 May, 2020 1 commit
    • Julien Chaumond's avatar
      [ci] Load pretrained models into the default (long-lived) cache · f54dc3f4
      Julien Chaumond authored
      There's an inconsistency right now where:
      - we load some models into CACHE_DIR
      - and some models in the default cache
      - and often, in both for the same models
      
      When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth.
      
      I'd rather always use the default cache
      f54dc3f4
  4. 09 Mar, 2020 1 commit
  5. 08 Mar, 2020 1 commit
  6. 03 Mar, 2020 1 commit
    • Patrick von Platen's avatar
      Add generate() functionality to TF 2.0 (#3063) · 41341003
      Patrick von Platen authored
      * add first copy past test to tf 2 generate
      
      * add tf top_k_top_p_filter fn
      
      * add generate function for TF
      
      * add generate function for TF
      
      * implemented generate for all models expect transfoXL
      
      * implemented generate for all models expect transfoXL
      
      * implemented generate for all models expect transfoXL
      
      * make style
      
      * change permission of test file to correct ones
      
      * delete ipdb
      
      * delete ipdb
      
      * fix bug and finish simple gpt2 integration test
      
      * clean test file
      
      * clean test file
      
      * make style
      
      * make style
      
      * make style
      
      * make style
      
      * change import style
      
      * change import style
      
      * make style
      
      * make style
      
      * add decorators
      
      * add decorators
      
      * fix tf ctrl bug dim => axis in TF
      
      * make style
      
      * make style
      
      * refactored test file
      
      * refactored test file
      
      * take out test_torch_tf_conversion if nothing is defined
      
      * take out test_torch_tf_conversion if nothing is defined
      
      * remove useless files
      
      * remove useless files
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * solve conflicts
      
      * solve conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * merge conflicts
      
      * delete ipdb
      
      * exposed top_k_top_p_filtering fns
      
      * delete weirdly created w! file
      
      * add comment to test tf common modeling
      
      * fix conflicts
      
      * fix conflicts
      
      * make style
      
      * merge conflicts
      
      * make style
      
      * change tf.tensor.shape to shape_list(tensor)
      41341003
  7. 06 Jan, 2020 2 commits
  8. 22 Dec, 2019 8 commits
  9. 21 Dec, 2019 2 commits
    • Aymeric Augustin's avatar
      Reformat source code with black. · fa84ae26
      Aymeric Augustin authored
      This is the result of:
      
          $ black --line-length 119 examples templates transformers utils hubconf.py setup.py
      
      There's a lot of fairly long lines in the project. As a consequence, I'm
      picking the longest widely accepted line length, 119 characters.
      
      This is also Thomas' preference, because it allows for explicit variable
      names, to make the code easier to understand.
      fa84ae26
    • Aymeric Augustin's avatar
      Take advantage of the cache when running tests. · b670c266
      Aymeric Augustin authored
      Caching models across test cases and across runs of the test suite makes
      slow tests somewhat more bearable.
      
      Use gettempdir() instead of /tmp in tests. This makes it easier to
      change the location of the cache with semi-standard TMPDIR/TEMP/TMP
      environment variables.
      
      Fix #2222.
      b670c266
  10. 13 Dec, 2019 1 commit
  11. 06 Dec, 2019 1 commit
    • Aymeric Augustin's avatar
      Remove dependency on pytest for running tests (#2055) · 35401fe5
      Aymeric Augustin authored
      * Switch to plain unittest for skipping slow tests.
      
      Add a RUN_SLOW environment variable for running them.
      
      * Switch to plain unittest for PyTorch dependency.
      
      * Switch to plain unittest for TensorFlow dependency.
      
      * Avoid leaking open files in the test suite.
      
      This prevents spurious warnings when running tests.
      
      * Fix unicode warning on Python 2 when running tests.
      
      The warning was:
      
          UnicodeWarning: Unicode equal comparison failed to convert both arguments to Unicode - interpreting them as being unequal
      
      * Support running PyTorch tests on a GPU.
      
      Reverts 27e015bd.
      
      * Tests no longer require pytest.
      
      * Make tests pass on cuda
      35401fe5
  12. 09 Oct, 2019 1 commit
  13. 08 Oct, 2019 1 commit
  14. 04 Oct, 2019 1 commit
    • keskarnitish's avatar
      Adding CTRL (squashed commit) · dbed1c5d
      keskarnitish authored
      adding conversion script
      
      adding first draft of modeling & tokenization
      
      adding placeholder for test files
      
      bunch of changes
      
      registering the tokenizer/model/etc
      
      tests
      
      change link; something is very VERY wrong here
      
      weird end-of-word thingy going on
      
      i think the tokenization works now ; wrote the unit tests
      
      overall structure works;load w next
      
      the monster is alive!
      
      works after some cleanup as well
      
      adding emacs autosave to gitignore
      
      currently only supporting the 48 layer one; seems to infer fine on my macbook
      
      cleanup
      
      fixing some documentation
      
      fixing some documentation
      
      tests passing?
      
      now works on CUDA also
      
      adding greedy?
      
      adding greedy sampling
      
      works well
      dbed1c5d
  15. 26 Sep, 2019 1 commit
  16. 09 Sep, 2019 3 commits
  17. 08 Sep, 2019 2 commits
  18. 05 Sep, 2019 1 commit
  19. 04 Sep, 2019 1 commit
  20. 21 Aug, 2019 2 commits
  21. 12 Jul, 2019 1 commit
  22. 11 Jul, 2019 1 commit
  23. 05 Jul, 2019 2 commits
  24. 02 Jul, 2019 2 commits