1. 27 Jan, 2021 1 commit
  2. 26 Jan, 2021 1 commit
  3. 20 Jan, 2021 6 commits
  4. 14 Jan, 2021 2 commits
  5. 13 Jan, 2021 4 commits
    • Lysandre's avatar
      v4.2.0 documentation · 33a8497d
      Lysandre authored
      33a8497d
    • Lysandre's avatar
      Release: v4.2.0 · 7d9a9d0c
      Lysandre authored
      7d9a9d0c
    • Julien Chaumond's avatar
      Doc: Update pretrained_models wording (#9545) · 247a7b20
      Julien Chaumond authored
      * Update pretrained_models.rst
      
      To clarify things cf. this tweet for instance https://twitter.com/RTomMcCoy/status/1349094111505211395
      
      * format
      247a7b20
    • Stas Bekman's avatar
      [trainer] deepspeed integration (#9211) · 2df34f4a
      Stas Bekman authored
      
      
      * deepspeed integration
      
      * style
      
      * add test
      
      * ds wants to do its own backward
      
      * fp16 assert
      
      * Update src/transformers/training_args.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * style
      
      * for clarity extract what args are being passed to deepspeed
      
      * introduce the concept of self.wrapped_model
      
      * s/self.wrapped_model/self.model_wrapped/
      
      * complete transition to self.wrapped_model / self.model
      
      * fix
      
      * doc
      
      * give ds its own init
      
      * add custom overrides, handle bs correctly
      
      * fix test
      
      * clean up model_init logic, fix small bug
      
      * complete fix
      
      * collapse --deepspeed_config into --deepspeed
      
      * style
      
      * start adding doc notes
      
      * style
      
      * implement hf2ds optimizer and scheduler configuration remapping
      
      * oops
      
      * call get_num_training_steps absolutely when needed
      
      * workaround broken auto-formatter
      
      * deepspeed_config arg is no longer needed - fixed in deepspeed master
      
      * use hf's fp16 args in config
      
      * clean
      
      * start on the docs
      
      * rebase cleanup
      
      * finish up --fp16
      
      * clarify the supported stages
      
      * big refactor thanks to discovering deepspeed.init_distributed
      
      * cleanup
      
      * revert fp16 part
      
      * add checkpoint-support
      
      * more init ds into integrations
      
      * extend docs
      
      * cleanup
      
      * unfix docs
      
      * clean up old code
      
      * imports
      
      * move docs
      
      * fix logic
      
      * make it clear which file it's referring to
      
      * document nodes/gpus
      
      * style
      
      * wrong format
      
      * style
      
      * deepspeed handles gradient clipping
      
      * easier to read
      
      * major doc rewrite
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * docs
      
      * switch to AdamW optimizer
      
      * style
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      
      * clarify doc
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      2df34f4a
  6. 12 Jan, 2021 2 commits
    • NielsRogge's avatar
      Improve LayoutLM (#9476) · e45eba3b
      NielsRogge authored
      
      
      * Add LayoutLMForSequenceClassification and integration tests
      
      Improve docs
      
      Add LayoutLM notebook to list of community notebooks
      
      * Make style & quality
      
      * Address comments by @sgugger, @patrickvonplaten and @LysandreJik
      
      * Fix rebase with master
      
      * Reformat in one line
      
      * Improve code examples as requested by @patrickvonplaten
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
      e45eba3b
    • Patrick von Platen's avatar
      [TFBart] Split TF-Bart (#9497) · 7f286132
      Patrick von Platen authored
      * make templates ready
      
      * make add_new_model_command_ready
      
      * finish tf bart
      
      * prepare tf mbart
      
      * finish tf bart
      
      * add tf mbart
      
      * add marian
      
      * prep pegasus
      
      * add tf pegasus
      
      * push blenderbot tf
      
      * add blenderbot
      
      * add blenderbot small
      
      * clean-up
      
      * make fix copy
      
      * define blend bot tok
      
      * fix
      
      * up
      
      * make style
      
      * add to docs
      
      * add copy statements
      
      * overwrite changes
      
      * improve
      
      * fix docs
      
      * finish
      
      * fix last slow test
      
      * fix missing git conflict line
      
      * fix blenderbot
      
      * up
      
      * fix blenderbot small
      
      * load changes
      
      * finish copied from
      
      * upload fix
      7f286132
  7. 11 Jan, 2021 1 commit
  8. 08 Jan, 2021 1 commit
  9. 07 Jan, 2021 1 commit
  10. 06 Jan, 2021 7 commits
    • Simon Brandeis's avatar
      Add flags to return scores, hidden states and / or attention weights in GenerationMixin (#9150) · c89f1bc9
      Simon Brandeis authored
      
      
      * Define new output dataclasses for greedy generation
      
      * Add output_[...] flags in greedy generation methods
      
      Added output_attentions, output_hidden_states, output_scores flags in
      generate and greedy_search methods in GenerationMixin.
      
      * [WIP] Implement logic and tests for output flags in generation
      
      * Update GreedySearchOutput classes & docstring
      
      * Implement greedy search output accumulation logic
      
      Update greedy_search unittests
      
      Fix generate method return value docstring
      
      Properly init flags with the default config
      
      * Update configuration to add output_scores flag
      
      * Fix test_generation_utils
      
      Sort imports and fix isinstance tests for GreedySearchOutputs
      
      * Fix typo in generation_utils
      
      * Add return_dict_in_generate for backwards compatibility
      
      * Add return_dict_in_generate flag in config
      
      * Fix tyPo in configuration
      
      * Fix handling of attentions and hidden_states flags
      
      * Make style & quality
      
      * first attempt attentions
      
      * some corrections
      
      * improve tests
      
      * special models requires special test
      
      * disable xlm test for now
      
      * clean tests
      
      * fix for tf
      
      * isort
      
      * Add output dataclasses for other generation methods
      
      * Add logic to return dict in sample generation
      
      * Complete test for sample generation
      
      - Pass output_attentions and output_hidden_states flags to encoder in
      encoder-decoder models
      - Fix import satements order in test_generation_utils file
      
      * Add logic to return dict in sample generation
      
      - Refactor tests to avoid using self.assertTrue, which provides
      scarce information when the test fails
      - Add tests for the three beam_search methods: vanilla, sample and
      grouped
      
      * Style doc
      
      * Fix copy-paste error in generation tests
      
      * Rename logits to scores and refactor
      
      * Refactor group_beam_search for consistency
      
      * make style
      
      * add sequences_scores
      
      * fix all tests
      
      * add docs
      
      * fix beam search finalize test
      
      * correct docstring
      
      * clean some files
      
      * Made suggested changes to the documentation
      
      * Style doc ?
      
      * Style doc using the Python util
      
      * Update src/transformers/generation_utils.py
      
      * fix empty lines
      
      * fix all test
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      c89f1bc9
    • Qbiwan's avatar
      Improve documentation coverage for Phobert (#9427) · ecfcac22
      Qbiwan authored
      
      
      * first commit
      
      * change phobert to phoBERT as per author in overview
      
      * v3 and v4 both runs on same code hence there is no need to differentiate them
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      ecfcac22
    • Qbiwan's avatar
      Improve documentation coverage for Herbert (#9428) · be898998
      Qbiwan authored
      * first commit
      
      * changed XLMTokenizer to HerbertTokenizer in code example
      be898998
    • Patrick von Platen's avatar
      finalize (#9431) · b972c1bf
      Patrick von Platen authored
      b972c1bf
    • Sylvain Gugger's avatar
      Upgrade styler to better handle lists (#9423) · bcb55d33
      Sylvain Gugger authored
      * Add missing lines before a new list.
      
      * Update doc styler and restyle some files.
      
      * Fix docstrings of LED and Longformer
      bcb55d33
    • NielsRogge's avatar
      Fix URLs to TAPAS notebooks (#9435) · b7e54897
      NielsRogge authored
      b7e54897
    • Stas Bekman's avatar
      [docs] outline sharded ddp doc (#9208) · d64372fd
      Stas Bekman authored
      
      
      * outline sharded dpp doc
      
      * fix link
      
      * add example
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * narrow the command and remove non-essentials
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      d64372fd
  11. 05 Jan, 2021 3 commits
    • Patrick von Platen's avatar
      [PyTorch Bart] Split Bart into different models (#9343) · eef66035
      Patrick von Platen authored
      * first try
      
      * remove old template
      
      * finish bart
      
      * finish mbart
      
      * delete unnecessary line
      
      * init pegasus
      
      * save intermediate
      
      * correct pegasus
      
      * finish pegasus
      
      * remove cookie cutter leftover
      
      * add marian
      
      * finish blenderbot
      
      * replace in file
      
      * correctly split blenderbot
      
      * delete "old" folder
      
      * correct "add statement"
      
      * adapt config for tf comp
      
      * correct configs for tf
      
      * remove ipdb
      
      * fix more stuff
      
      * fix mbart
      
      * push pegasus fix
      
      * fix mbart
      
      * more fixes
      
      * fix research projects code
      
      * finish docs for bart, mbart, and marian
      
      * delete unnecessary file
      
      * correct attn typo
      
      * correct configs
      
      * remove pegasus for seq class
      
      * correct peg docs
      
      * correct peg docs
      
      * finish configs
      
      * further improve docs
      
      * add copied from statements to mbart
      
      * fix copied from in mbart
      
      * add copy statements to marian
      
      * add copied from to marian
      
      * add pegasus copied from
      
      * finish pegasus
      
      * finish copied from
      
      * Apply suggestions from code review
      
      * make style
      
      * backward comp blenderbot
      
      * apply lysandres and sylvains suggestions
      
      * apply suggestions
      
      * push last fixes
      
      * fix docs
      
      * fix tok tests
      
      * fix imports code style
      
      * fix doc
      eef66035
    • Patrick von Platen's avatar
      LED (#9278) · 189387e9
      Patrick von Platen authored
      * create model
      
      * add integration
      
      * save current state
      
      * make integration tests pass
      
      * add one more test
      
      * add explanation to tests
      
      * remove from bart
      
      * add padding
      
      * remove unnecessary test
      
      * make all tests pass
      
      * re-add cookie cutter tests
      
      * finish PyTorch
      
      * fix attention test
      
      * Update tests/test_modeling_common.py
      
      * revert change
      
      * remove unused file
      
      * add string to doc
      
      * save intermediate
      
      * make tf integration tests pass
      
      * finish tf
      
      * fix doc
      
      * fix docs again
      
      * add led to doctree
      
      * add to auto tokenizer
      
      * added tips for led
      
      * make style
      
      * apply jplus statements
      
      * correct tf longformer
      
      * apply lysandres suggestions
      
      * apply sylvains suggestions
      
      * Apply suggestions from code review
      189387e9
    • Sugeeth's avatar
      Fix documentation links always pointing to master. (#9217) · 314cca28
      Sugeeth authored
      
      
      * Use extlinks to point hyperlink with the version of code
      
      * Point to version on release and master until then
      
      * Apply style
      
      * Correct links
      
      * Add missing backtick
      
      * Simple missing backtick after all.
      Co-authored-by: default avatarRaghavendra Sugeeth P S <raghav-5305@raghav-5305.csez.zohocorpin.com>
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      314cca28
  12. 04 Jan, 2021 2 commits
  13. 24 Dec, 2020 1 commit
  14. 23 Dec, 2020 2 commits
    • Suraj Patil's avatar
      Add caching mechanism to BERT, RoBERTa (#9183) · 88ef8893
      Suraj Patil authored
      * add past_key_values
      
      * add use_cache option
      
      * make mask before cutting ids
      
      * adjust position_ids according to past_key_values
      
      * flatten past_key_values
      
      * fix positional embeds
      
      * fix _reorder_cache
      
      * set use_cache to false when not decoder, fix attention mask init
      
      * add test for caching
      
      * add past_key_values for Roberta
      
      * fix position embeds
      
      * add caching test for roberta
      
      * add doc
      
      * make style
      
      * doc, fix attention mask, test
      
      * small fixes
      
      * adress patrick's comments
      
      * input_ids shouldn't start with pad token
      
      * use_cache only when decoder
      
      * make consistent with bert
      
      * make copies consistent
      
      * add use_cache to encoder
      
      * add past_key_values to tapas attention
      
      * apply suggestions from code review
      
      * make coppies consistent
      
      * add attn mask in tests
      
      * remove copied from longformer
      
      * apply suggestions from code review
      
      * fix bart test
      
      * nit
      
      * simplify model outputs
      
      * fix doc
      
      * fix output ordering
      88ef8893
    • Connor Brinton's avatar
      Minor documentation revisions from copyediting (#9266) · bcc87c63
      Connor Brinton authored
      * typo: Revise "checkout" to "check out"
      
      * typo: Change "seemlessly" to "seamlessly"
      
      * typo: Close parentheses in "Using the tokenizer"
      
      * typo: Add closing parenthesis to supported models aside
      
      * docs: Treat ``position_ids`` as plural
      
      Alternatively, the word "argument" could be added to make the subject singular.
      
      * docs: Remove comma, making subordinate clause
      
      * docs: Remove comma separating verb and direct object
      
      * docs: Fix typo ("next" -> "text")
      
      * docs: Reverse phrase order to simplify sentence
      
      * docs: "quicktour" -> "quick tour"
      
      * docs: "to throw" -> "from throwing"
      
      * docs: Remove disruptive newline in padding/truncation section
      
      * docs: "show exemplary" -> "show examples of"
      
      * docs: "much harder as" -> "much harder than"
      
      * docs: Fix typo "seach" -> "search"
      
      * docs: Fix subject-verb disagreement in WordPiece description
      
      * docs: Fix style in preprocessing.rst
      bcc87c63
  15. 22 Dec, 2020 2 commits
  16. 21 Dec, 2020 1 commit
  17. 19 Dec, 2020 1 commit
  18. 18 Dec, 2020 1 commit
  19. 17 Dec, 2020 1 commit