1. 20 May, 2021 1 commit
  2. 15 Mar, 2021 1 commit
  3. 12 Mar, 2021 1 commit
    • Nicolas Patry's avatar
      Adding new parameter to `generate`: `max_time`. (#9846) · 543d0549
      Nicolas Patry authored
      * [WIP] Adding new parameter to `generate`:  `max_time`.
      
      Generation by tokens number is sometimes a bit clunky because we don't
      know how many tokens are good enough or even how many tokens are in
      the payload (for pipelines users for instance). This leads to hard
      to understand behavior.
      
      This PR proposes a new argument `max_time` which is a float of seconds
      for the allowed time for `generate` to run on.
      Ideally combinations of `max_tokens=None`, `max_time=2` could be used to
      generate as many tokens as possible within time budget.
      
      NB: Another possible approach consists of passing a callback to `generate`
        putting the caller in charge of the actual decision of when to stop
        generating tokens. It opens the door to 'which args should we pass'
        to this callback. It's hard to imagine other use-cases for this
        early stopping behavior than time (that are not already covered by
        parameters of generate)
      
      * Revamp with StoppingCriteria
      
      * Removing deprecated mentions.
      
      * Forgot arguments to stopping criteria.
      
      * Readding max_length it's not just used as a stopping criteria.
      
      * Default value for `stopping_criteria`.
      
      * Address @patrickvonplaten comments.
      
      - More docstrings
      - Actual doc
      - Include in global namespace
      - Remove TF work.
      
      * Put back `max_length` (deprecation different PR).
      
      * Doc quality.
      
      * Fixing old behavior without `stopping_criteria` but with `max_length`.
      
      Making sure we don't break that in the future.
      
      * Adding more tests for possible inconsistencies between
      
      `max_length` and `stopping_criteria`.
      
      * Fixing the torch imports.
      543d0549
  4. 19 Jan, 2021 1 commit
  5. 22 Dec, 2020 1 commit
  6. 07 Dec, 2020 1 commit
  7. 23 Nov, 2020 1 commit
  8. 17 Nov, 2020 1 commit
  9. 16 Nov, 2020 2 commits
    • Sylvain Gugger's avatar
      Switch `return_dict` to `True` by default. (#8530) · 1073a2bd
      Sylvain Gugger authored
      * Use the CI to identify failing tests
      
      * Remove from all examples and tests
      
      * More default switch
      
      * Fixes
      
      * More test fixes
      
      * More fixes
      
      * Last fixes hopefully
      
      * Use the CI to identify failing tests
      
      * Remove from all examples and tests
      
      * More default switch
      
      * Fixes
      
      * More test fixes
      
      * More fixes
      
      * Last fixes hopefully
      
      * Run on the real suite
      
      * Fix slow tests
      1073a2bd
    • LSinev's avatar
      Fix GPT2DoubleHeadsModel to work with model.generate() (#6601) · afb50c66
      LSinev authored
      * Fix passing token_type_ids during GPT2DoubleHeadsModel.generate() if used
      
      and for GPT2LMHeadModel too
      
      * Update tests to check token_type_ids usage in GPT2 models
      afb50c66
  10. 09 Nov, 2020 1 commit
  11. 03 Nov, 2020 1 commit
    • Patrick von Platen's avatar
      Refactoring the generate() function (#6949) · a1bbcf3f
      Patrick von Platen authored
      * first draft
      
      * show design proposition for new generate method
      
      * up
      
      * make better readable
      
      * make first version
      
      * gpt2 tests pass
      
      * make beam search for gpt2 work
      
      * add first encoder-decoder code
      
      * delete typo
      
      * make t5 work
      
      * save indermediate
      
      * make bart work with beam search
      
      * finish beam search bart / t5
      
      * add default kwargs
      
      * make more tests pass
      
      * fix no bad words sampler
      
      * some fixes and tests for all distribution processors
      
      * fix test
      
      * fix rag slow tests
      
      * merge to master
      
      * add nograd to generate
      
      * make all slow tests pass
      
      * speed up generate
      
      * fix edge case bug
      
      * small fix
      
      * correct typo
      
      * add type hints and docstrings
      
      * fix typos in tests
      
      * add beam search tests
      
      * add tests for beam scorer
      
      * fix test rag
      
      * finish beam search tests
      
      * move generation tests in seperate file
      
      * fix generation tests
      
      * more tests
      
      * add aggressive generation tests
      
      * fix tests
      
      * add gpt2 sample test
      
      * add more docstring
      
      * add more docs
      
      * finish doc strings
      
      * apply some more of sylvains and sams comments
      
      * fix some typos
      
      * make fix copies
      
      * apply lysandres and sylvains comments
      
      * final corrections on examples
      
      * small fix for reformer
      a1bbcf3f
  12. 21 Oct, 2020 1 commit
  13. 14 Oct, 2020 1 commit
  14. 06 Oct, 2020 1 commit
  15. 01 Oct, 2020 1 commit
  16. 29 Sep, 2020 1 commit
  17. 01 Sep, 2020 1 commit
    • Patrick von Platen's avatar
      [Generate] Facilitate PyTorch generate using `ModelOutputs` (#6735) · afc4ece4
      Patrick von Platen authored
      * fix generate for GPT2 Double Head
      
      * fix gpt2 double head model
      
      * fix  bart / t5
      
      * also add for no beam search
      
      * fix no beam search
      
      * fix encoder decoder
      
      * simplify t5
      
      * simplify t5
      
      * fix t5 tests
      
      * fix BART
      
      * fix transfo-xl
      
      * fix conflict
      
      * integrating sylvains and sams comments
      
      * fix tf past_decoder_key_values
      
      * fix enc dec test
      afc4ece4
  18. 26 Aug, 2020 1 commit
  19. 24 Aug, 2020 1 commit
  20. 20 Aug, 2020 1 commit
  21. 14 Aug, 2020 1 commit
  22. 13 Aug, 2020 1 commit
    • Stas Bekman's avatar
      cleanup tf unittests: part 2 (#6260) · e983da0e
      Stas Bekman authored
      * cleanup torch unittests: part 2
      
      * remove trailing comma added by isort, and which breaks flake
      
      * one more comma
      
      * revert odd balls
      
      * part 3: odd cases
      
      * more ["key"] -> .key refactoring
      
      * .numpy() is not needed
      
      * more unncessary .numpy() removed
      
      * more simplification
      e983da0e
  23. 04 Aug, 2020 1 commit
  24. 31 Jul, 2020 1 commit
  25. 23 Jul, 2020 1 commit
  26. 01 Jul, 2020 1 commit
  27. 24 Jun, 2020 1 commit
  28. 16 Jun, 2020 1 commit
  29. 05 Jun, 2020 1 commit
  30. 02 Jun, 2020 1 commit
    • Julien Chaumond's avatar
      Kill model archive maps (#4636) · d4c2cb40
      Julien Chaumond authored
      * Kill model archive maps
      
      * Fixup
      
      * Also kill model_archive_map for MaskedBertPreTrainedModel
      
      * Unhook config_archive_map
      
      * Tokenizers: align with model id changes
      
      * make style && make quality
      
      * Fix CI
      d4c2cb40
  31. 27 May, 2020 1 commit
  32. 19 May, 2020 2 commits
  33. 01 May, 2020 1 commit
    • Julien Chaumond's avatar
      [ci] Load pretrained models into the default (long-lived) cache · f54dc3f4
      Julien Chaumond authored
      There's an inconsistency right now where:
      - we load some models into CACHE_DIR
      - and some models in the default cache
      - and often, in both for the same models
      
      When running the RUN_SLOW tests, this takes a lot of disk space, time, and bandwidth.
      
      I'd rather always use the default cache
      f54dc3f4
  34. 20 Mar, 2020 1 commit
  35. 08 Mar, 2020 3 commits
  36. 03 Mar, 2020 1 commit
    • Patrick von Platen's avatar
      Add generate() functionality to TF 2.0 (#3063) · 41341003
      Patrick von Platen authored
      * add first copy past test to tf 2 generate
      
      * add tf top_k_top_p_filter fn
      
      * add generate function for TF
      
      * add generate function for TF
      
      * implemented generate for all models expect transfoXL
      
      * implemented generate for all models expect transfoXL
      
      * implemented generate for all models expect transfoXL
      
      * make style
      
      * change permission of test file to correct ones
      
      * delete ipdb
      
      * delete ipdb
      
      * fix bug and finish simple gpt2 integration test
      
      * clean test file
      
      * clean test file
      
      * make style
      
      * make style
      
      * make style
      
      * make style
      
      * change import style
      
      * change import style
      
      * make style
      
      * make style
      
      * add decorators
      
      * add decorators
      
      * fix tf ctrl bug dim => axis in TF
      
      * make style
      
      * make style
      
      * refactored test file
      
      * refactored test file
      
      * take out test_torch_tf_conversion if nothing is defined
      
      * take out test_torch_tf_conversion if nothing is defined
      
      * remove useless files
      
      * remove useless files
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * solve conflicts
      
      * solve conflicts
      
      * fix conflicts
      
      * fix conflicts
      
      * merge conflicts
      
      * delete ipdb
      
      * exposed top_k_top_p_filtering fns
      
      * delete weirdly created w! file
      
      * add comment to test tf common modeling
      
      * fix conflicts
      
      * fix conflicts
      
      * make style
      
      * merge conflicts
      
      * make style
      
      * change tf.tensor.shape to shape_list(tensor)
      41341003