1. 23 Dec, 2020 2 commits
    • Suraj Patil's avatar
      Add caching mechanism to BERT, RoBERTa (#9183) · 88ef8893
      Suraj Patil authored
      * add past_key_values
      
      * add use_cache option
      
      * make mask before cutting ids
      
      * adjust position_ids according to past_key_values
      
      * flatten past_key_values
      
      * fix positional embeds
      
      * fix _reorder_cache
      
      * set use_cache to false when not decoder, fix attention mask init
      
      * add test for caching
      
      * add past_key_values for Roberta
      
      * fix position embeds
      
      * add caching test for roberta
      
      * add doc
      
      * make style
      
      * doc, fix attention mask, test
      
      * small fixes
      
      * adress patrick's comments
      
      * input_ids shouldn't start with pad token
      
      * use_cache only when decoder
      
      * make consistent with bert
      
      * make copies consistent
      
      * add use_cache to encoder
      
      * add past_key_values to tapas attention
      
      * apply suggestions from code review
      
      * make coppies consistent
      
      * add attn mask in tests
      
      * remove copied from longformer
      
      * apply suggestions from code review
      
      * fix bart test
      
      * nit
      
      * simplify model outputs
      
      * fix doc
      
      * fix output ordering
      88ef8893
    • Connor Brinton's avatar
      Minor documentation revisions from copyediting (#9266) · bcc87c63
      Connor Brinton authored
      * typo: Revise "checkout" to "check out"
      
      * typo: Change "seemlessly" to "seamlessly"
      
      * typo: Close parentheses in "Using the tokenizer"
      
      * typo: Add closing parenthesis to supported models aside
      
      * docs: Treat ``position_ids`` as plural
      
      Alternatively, the word "argument" could be added to make the subject singular.
      
      * docs: Remove comma, making subordinate clause
      
      * docs: Remove comma separating verb and direct object
      
      * docs: Fix typo ("next" -> "text")
      
      * docs: Reverse phrase order to simplify sentence
      
      * docs: "quicktour" -> "quick tour"
      
      * docs: "to throw" -> "from throwing"
      
      * docs: Remove disruptive newline in padding/truncation section
      
      * docs: "show exemplary" -> "show examples of"
      
      * docs: "much harder as" -> "much harder than"
      
      * docs: Fix typo "seach" -> "search"
      
      * docs: Fix subject-verb disagreement in WordPiece description
      
      * docs: Fix style in preprocessing.rst
      bcc87c63
  2. 22 Dec, 2020 2 commits
  3. 21 Dec, 2020 1 commit
  4. 19 Dec, 2020 1 commit
  5. 18 Dec, 2020 1 commit
  6. 17 Dec, 2020 9 commits
  7. 16 Dec, 2020 4 commits
  8. 15 Dec, 2020 2 commits
    • NielsRogge's avatar
      [WIP] Tapas v4 (tres) (#9117) · 1551e2dc
      NielsRogge authored
      
      
      * First commit: adding all files from tapas_v3
      
      * Fix multiple bugs including soft dependency and new structure of the library
      
      * Improve testing by adding torch_device to inputs and adding dependency on scatter
      
      * Use Python 3 inheritance rather than Python 2
      
      * First draft model cards of base sized models
      
      * Remove model cards as they are already on the hub
      
      * Fix multiple bugs with integration tests
      
      * All model integration tests pass
      
      * Remove print statement
      
      * Add test for convert_logits_to_predictions method of TapasTokenizer
      
      * Incorporate suggestions by Google authors
      
      * Fix remaining tests
      
      * Change position embeddings sizes to 512 instead of 1024
      
      * Comment out positional embedding sizes
      
      * Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES
      
      * Added more model names
      
      * Fix truncation when no max length is specified
      
      * Disable torchscript test
      
      * Make style & make quality
      
      * Quality
      
      * Address CI needs
      
      * Test the Masked LM model
      
      * Fix the masked LM model
      
      * Truncate when overflowing
      
      * More much needed docs improvements
      
      * Fix some URLs
      
      * Some more docs improvements
      
      * Test PyTorch scatter
      
      * Set to slow + minify
      
      * Calm flake8 down
      
      * First commit: adding all files from tapas_v3
      
      * Fix multiple bugs including soft dependency and new structure of the library
      
      * Improve testing by adding torch_device to inputs and adding dependency on scatter
      
      * Use Python 3 inheritance rather than Python 2
      
      * First draft model cards of base sized models
      
      * Remove model cards as they are already on the hub
      
      * Fix multiple bugs with integration tests
      
      * All model integration tests pass
      
      * Remove print statement
      
      * Add test for convert_logits_to_predictions method of TapasTokenizer
      
      * Incorporate suggestions by Google authors
      
      * Fix remaining tests
      
      * Change position embeddings sizes to 512 instead of 1024
      
      * Comment out positional embedding sizes
      
      * Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES
      
      * Added more model names
      
      * Fix truncation when no max length is specified
      
      * Disable torchscript test
      
      * Make style & make quality
      
      * Quality
      
      * Address CI needs
      
      * Test the Masked LM model
      
      * Fix the masked LM model
      
      * Truncate when overflowing
      
      * More much needed docs improvements
      
      * Fix some URLs
      
      * Some more docs improvements
      
      * Add add_pooling_layer argument to TapasModel
      
      Fix comments by @sgugger and @patrickvonplaten
      
      * Fix issue in docs + fix style and quality
      
      * Clean up conversion script and add task parameter to TapasConfig
      
      * Revert the task parameter of TapasConfig
      
      Some minor fixes
      
      * Improve conversion script and add test for absolute position embeddings
      
      * Improve conversion script and add test for absolute position embeddings
      
      * Fix bug with reset_position_index_per_cell arg of the conversion cli
      
      * Add notebooks to the examples directory and fix style and quality
      
      * Apply suggestions from code review
      
      * Move from `nielsr/` to `google/` namespace
      
      * Apply Sylvain's comments
      Co-authored-by: default avatarsgugger <sylvain.gugger@gmail.com>
      Co-authored-by: default avatarRogge Niels <niels.rogge@howest.be>
      Co-authored-by: default avatarLysandreJik <lysandre.debut@reseau.eseo.fr>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarsgugger <sylvain.gugger@gmail.com>
      1551e2dc
    • sandip's avatar
      Added TF OpenAi GPT1 Sequence Classification (#9105) · 389aba34
      sandip authored
      
      
      * TF OpenAI GPT Sequence Classification
      
      * Update src/transformers/models/openai/modeling_tf_openai.py
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      389aba34
  9. 14 Dec, 2020 4 commits
  10. 11 Dec, 2020 1 commit
  11. 10 Dec, 2020 2 commits
  12. 09 Dec, 2020 5 commits
    • Patrick von Platen's avatar
      [Bart] Refactor - fix issues, consistency with the library, naming (#8900) · 06971ac4
      Patrick von Platen authored
      * remove make on the fly linear embedding
      
      * start refactor
      
      * big first refactor
      
      * save intermediate
      
      * save intermediat
      
      * correct mask issue
      
      * save tests
      
      * refactor padding masks
      
      * make all tests pass
      
      * further refactor
      
      * make pegasus test pass
      
      * fix bool if
      
      * fix leftover tests
      
      * continue
      
      * bart renaming
      
      * delete torchscript test hack
      
      * fix imports in tests
      
      * correct shift
      
      * fix docs and repo cons
      
      * re-add fix for FSTM
      
      * typo in test
      
      * fix typo
      
      * fix another typo
      
      * continue
      
      * hot fix 2 for tf
      
      * small fixes
      
      * refactor types linting
      
      * continue
      
      * finish refactor
      
      * fix import in tests
      
      * better bart names
      
      * further refactor and add test
      
      * delete hack
      
      * apply sylvains and lysandres commens
      
      * small perf improv
      
      * further perf improv
      
      * improv perf
      
      * fix typo
      
      * make style
      
      * small perf improv
      06971ac4
    • StillKeepTry's avatar
      Add MP Net 2 (#9004) · df2af6d8
      StillKeepTry authored
      df2af6d8
    • Patrick von Platen's avatar
      push (#9008) · da37a21c
      Patrick von Platen authored
      da37a21c
    • Sylvain Gugger's avatar
      7e1d709e
    • Patrick von Platen's avatar
      Diverse beam search 2 (#9006) · 02d0e035
      Patrick von Platen authored
      
      
      * diverse beam search
      
      * bug fixes
      
      * bug fixes
      
      * bug fix
      
      * separate out diverse_beam_search function
      
      * separate out diverse_beam_search function
      
      * bug fix
      
      * improve code quality
      
      * bug fix
      
      * bug fix
      
      * separate out diverse beam search scorer
      
      * code format
      
      * code format
      
      * code format
      
      * code format
      
      * add test
      
      * code format
      
      * documentation changes
      
      * code quality
      
      * add slow integration tests
      
      * more general name
      
      * refactor into logits processor
      
      * add test
      
      * avoid too much copy paste
      
      * refactor
      
      * add to docs
      
      * fix-copies
      
      * bug fix
      
      * Revert "bug fix"
      
      This reverts commit c99eb5a8dc57a7b0d33a8ac06d8c6a32a7812ad4.
      
      * improve comment
      
      * implement sylvains feedback
      Co-authored-by: default avatarAyush Jain <a.jain@sprinklr.com>
      Co-authored-by: default avatarayushtiku5 <40797286+ayushtiku5@users.noreply.github.com>
      02d0e035
  13. 07 Dec, 2020 3 commits
    • Sylvain Gugger's avatar
      Copyright (#8970) · 00aa9dbc
      Sylvain Gugger authored
      * Add copyright everywhere missing
      
      * Style
      00aa9dbc
    • Navjot's avatar
      c108d0b5
    • sandip's avatar
      Add TFGPT2ForSequenceClassification based on DialogRPT (#8714) · 483e1327
      sandip authored
      * Add TFGPT2ForSequenceClassification based on DialogRPT
      
      * Add TFGPT2ForSequenceClassification based on DialogRPT
      
      * TFGPT2ForSequenceClassification based on DialogRPT-refactored code, implemented review comments and added input processing
      
      * Add TFGPT2ForSequenceClassification based on DialogRPT
      
      * TFGPT2ForSequenceClassification based on DialogRPT-refactored code, implemented review comments and added input processing
      
      * code refactor for latest other TF PR
      
      * code refactor
      
      * code refactor
      
      * Update modeling_tf_gpt2.py
      483e1327
  14. 03 Dec, 2020 2 commits
  15. 02 Dec, 2020 1 commit