- 24 Aug, 2020 5 commits
-
-
Sylvain Gugger authored
* Run new isort * More changes * Update CI, CONTRIBUTING and benchmarks
-
Teven authored
* Fixed DataCollatorForLanguageModeling + PermutationLanguageModeling not accepting lists of lists * Update data_collator.py * black was grumpy
-
sgugger authored
-
Sylvain Gugger authored
* Don't reset the type of the dataset * Formatting * Update trainer.py Co-authored-by:Teven <teven.lescao@gmail.com>
-
Jared T Nielsen authored
-
- 21 Aug, 2020 2 commits
-
-
Suraj Patil authored
* added CamembertForCausalLM * add in __init__ and auto model * style * doc
-
josephrocca authored
-
- 20 Aug, 2020 8 commits
-
-
Sylvain Gugger authored
* Add a classmethod to easily build a Trainer from nlp dataset and metric * Fix docstrings * Split train/eval * Formatting * Log dropped columns + docs * Authorize callable activations * Poc for auto activation * Be framework-agnostic * Formatting * Remove class method * Remove unnecessary code
-
Sam Shleifer authored
-
Ivan Dolgov authored
* xlnet fp16 bug fix * comment cast added * Update modeling_xlnet.py Co-authored-by:Kevin Canwen Xu <canwenxu@126.com>
-
Joe Davison authored
* TFTrainer dataset doc & fix evaluation bug discussed in #6551 * add docstring to test/eval datasets
-
Sylvain Gugger authored
* Add tests to Trainer * Test if removing long breaks everything * Remove ugly hack * Fix distributed test * Use float for number of epochs
-
sgugger authored
-
Prajjwal Bhargava authored
* removed redundant arg in prepare_inputs * made same change in prediction_loop
-
Oren Amsalem authored
-
- 19 Aug, 2020 5 commits
-
-
Sylvain Gugger authored
-
Suraj Patil authored
-
Sam Shleifer authored
-
Pradhy729 authored
* Feed forward chunking for Distilbert & Albert * Added ff chunking for many other models * Change model signature * Added chunking for XLM * Cleaned up by removing some variables. * remove test_chunking flag Co-authored-by:patrickvonplaten <patrick.v.platen@gmail.com>
-
Patrick von Platen authored
* start adding tie encoder to decoder functionality * finish model tying * make style * Apply suggestions from code review * fix t5 list including cross attention * apply sams suggestions * Update src/transformers/modeling_encoder_decoder.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * add max depth break point Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 18 Aug, 2020 3 commits
-
-
Sam Shleifer authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Ali Modarressi authored
* fixed label datatype for sts-b * naming update * make style * make style
-
Sam Shleifer authored
-
- 17 Aug, 2020 7 commits
-
-
Suraj Patil authored
* tests
-
Suraj Patil authored
* add mbart example * add Pegasus and MBart in readme * typo * add MBart in Pretrained models * add pre-proc doc * add DPR in readme * fix indent * doc fix
-
sgugger authored
-
Suraj Patil authored
Co-authored-by:sgugger <sylvain.gugger@gmail.com>
-
Stas Bekman authored
-
Stas Bekman authored
* [testing] switch to a new TestCasePlus + get_auto_remove_tmp_dir() for auto-removal of tmp dirs * respect after=True for tempfile, simplify code * comments * comment fix * put `before` last in args, so can make debug even faster
-
Masatoshi Suzuki authored
* Update BERT Japanese tokenizers * Update CircleCI config to download unidic * Specify to use the latest dictionary packages
-
- 14 Aug, 2020 5 commits
-
-
Jin Young (Daniel) Sohn authored
Currently with the bug introduced we're taking two optimizer steps per batch: one global one, where `xm.optimizer_step` injects a CRS between all cores in training, and one without. This has been affecting training accuracy (for example, XLNet GLUE on MNLI is not converging, etc.).
-
Sylvain Gugger authored
* Generation doc * MBartForConditionalGeneration (#6441) * add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions * Use hash to clean the test dirs (#6475) * Use hash to clean the test dirs * Use hash to clean the test dirs * Use hash to clean the test dirs * fix * [EncoderDecoder] Add Cross Attention for GPT2 (#6415) * add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Sort unique_no_split_tokens to make it deterministic (#6461) * change unique_no_split_tokens's type to set * use sorted list instead of set * style * Import accuracy_score (#6480) * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling * Generation doc * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Address comments * Styling Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Kevin Canwen Xu <canwenxu@126.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Quentin Lhoest <42851186+lhoestq@users.noreply.github.com> Co-authored-by:
gijswijnholds <gijswijnholds@gmail.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Quentin Lhoest authored
* change unique_no_split_tokens's type to set * use sorted list instead of set * style
-
Patrick von Platen authored
* add cross attention layers for gpt2 * make gpt2 cross attention work * finish bert2gpt2 * add explicit comments * remove attention mask since not yet supported * revert attn mask in pipeline * Update src/transformers/modeling_gpt2.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/modeling_encoder_decoder.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Suraj Patil authored
* add MBartForConditionalGeneration * style * rebase and fixes * add mbart test in TEST_FILES_WITH_NO_COMMON_TESTS * fix docs * don't ignore mbart * doc * fix mbart fairseq link * put mbart before bart * apply doc suggestions
-
- 13 Aug, 2020 5 commits
-
-
Suraj Patil authored
Co-authored-by:Lysandre Debut <lysandre@huggingface.co>
-
Suraj Patil authored
-
Lysandre Debut authored
* Test model outputs equivalence * Fix failing tests * From dict to kwargs * DistilBERT * Addressing @sgugger and @patrickvonplaten's comments
-
Prajjwal Bhargava authored
-
Zhu Baohe authored
* fix * fix2 * fix3
-