- 20 Apr, 2021 5 commits
-
-
Sylvain Gugger authored
* Update to use datasets remove_cloumns method * Quality
-
Suraj Patil authored
* create local attention mask ones * remove old method, address patricks comment
-
Patrick von Platen authored
* remove update function * update * refactor more * refactor
-
rajvi-k authored
* initial changes * modified evaluation * updated evaluation * updated evaluation on text translation example script * added translation example script * Formatted translation example script * Reformatted translation example * Fixed evaluation bug and added support for other tokenisers * Fixed evaluation bug and added support for other tokenisers * Added translation example script * Formatted summarization example script * Removed typos from summarization example script
-
Sylvain Gugger authored
-
- 19 Apr, 2021 4 commits
-
-
Sylvain Gugger authored
-
Stas Bekman authored
* fix the placement on device with fp16_full_eval * deepspeed never goes on device
-
TAE YOUNGDON authored
* Update language_modeling.py in "class TextDatasetForNextSentencePrediction(Dataset)", double considering "self.tokenizer.num_special_tokens_to_add(pair=True)" so, i remove self.block_size, and add parameter for "def create_examples_from_document". like "class LineByLineWithSOPTextDataset" do * Update language_modeling.py
-
e authored
-
- 16 Apr, 2021 5 commits
-
-
Sylvain Gugger authored
* Bulk of the work * Polish and tests * Update QA Trainer * Avoid breaking the predict method * Deprecation warnings * Store real eval dataloder * Get eval dataset reference before wrap
-
Lysandre authored
-
Nicolas Patry authored
* [WIP] Enabling multilingual models for translation pipelines. * decoder_input_ids -> forced_bos_token_id * Improve docstring. * Rebase * Fixing 2 bugs - Type token_ids coming from `_parse_and_tokenize` - Wrong index from tgt_lang. * Fixing black version. * Adding tests for _build_translation_inputs and add them for all tokenizers. * Mbart actually puts the lang code at the end. * Fixing m2m100. * Adding TF support to `deep_round`. * Update src/transformers/pipelines/text2text_generation.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Adding one line comment. * Fixing M2M100 `_build_translation_input_ids`, and fix the call site. * Fixing tests + deep_round -> nested_simplify Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Lysandre Debut authored
-
Stas Bekman authored
missed this updating when bumped the version.
-
- 15 Apr, 2021 3 commits
-
-
Sylvain Gugger authored
* Save fast tokenizers in both formats * Fix for HerBERT * Proper fix * Properly test new behavior
-
Sylvain Gugger authored
-
Nicolas Patry authored
* Adding task aliases and adding `token-classification` and `text-classification` tasks. * Cleaning docstring.
-
- 14 Apr, 2021 10 commits
-
-
Sylvain Gugger authored
* IterableDatasetShard * Test and integration in Trainer * Update src/transformers/trainer_pt_utils.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Style Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Stas Bekman authored
* test on one node 2 gpus max * fix the other place * refactor * fix * cleanup * more exact version
-
Sylvain Gugger authored
-
Stas Bekman authored
* add 2 points of reference to the offline mode * link the new doc * add error message * Update src/transformers/modeling_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style * rename * Trigger CI Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Yusuke Mori authored
* Add prefix to examples in model_doc rst * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Thomas Wood authored
* Update modeling_gpt_neo.py dimention -> dimension * Update configuration_speech_to_text.py dimention -> dimension
-
Sudharsan S T authored
Co-authored-by:Sudharsan Thirumalai <sudharsan.t@sprinklr.com>
-
Lysandre Debut authored
* Updated stale bot * Specify issue number * Remove particular handling of assignees * Unleash the stalebot * Remove debug branch
-
Joel Stremmel authored
-
Nithin Holla authored
Co-authored-by:nithin19 <nithin@amberscript.com>
-
- 13 Apr, 2021 13 commits
-
-
Stas Bekman authored
* temp band-aid * style
-
Lysandre Debut authored
* Run CI on deepspeed and fairscale * Test it on this branch :) * Rename * Update the CI image
-
Sylvain Gugger authored
* Indent code block * Indent code blocks version 2 * Quality
-
Sylvain Gugger authored
-
Philipp Schmid authored
-
Sylvain Gugger authored
-
Suraj Patil authored
* refactor GPT2 * fix mlp and head pruning * address Sylvains comments * apply suggestion from code review Co-authored-by:Lysandre Debut <lysandre@huggingface.co>
-
Sylvain Gugger authored
-
Sylvain Gugger authored
* Replace error by warning when loading an architecture in another * Style * Style again * Add a test * Adapt old test
-
Yusuke Mori authored
* Start writing BERT-Japanese doc * Fix typo, Update toctree * Modify model file to use comment for document, Add examples * Clean bert_japanese by make style * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Split a big code block into two * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add prefix >>> to all lines in code blocks * Clean bert_japanese by make fixup Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Suraj Patil authored
-
Lysandre Debut authored
* Fix GPT-2 warnings * Update src/transformers/models/gpt2/modeling_gpt2.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
Lysandre Debut authored
-