- 01 Oct, 2021 1 commit
-
-
Anton Lozhkov authored
* Restore broken merge * Additional args, DDP, remove CommonLanguage * Update examples for V100, add training results * Style * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Remove custom datasets for simplicity, apply suggestions from code review * Add the attention_mask flag, reorganize README Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 30 Sep, 2021 1 commit
-
-
Patrick von Platen authored
-
- 29 Sep, 2021 1 commit
-
-
Stas Bekman authored
* missing requirement * list both
-
- 27 Sep, 2021 2 commits
- 26 Sep, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 24 Sep, 2021 5 commits
-
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Gunjan Chhablani authored
* Update run_glue.py * Update run_glue.py * Add model creation snippet to other scripts * Fix style
-
Patrick von Platen authored
-
Patrick von Platen authored
* up * rename * add asr example * add auto feature extractor * some more fixes * correct layerdrop * correct for multi-gpu dist * clean up * refactor * refactor * more fixes * more fixes * clean-up * finish * up * Apply suggestions from code review * fix isort * update * up * add note * apply surajs suggestions * Apply suggestions from code review Co-authored-by:
Suraj Patil <surajp815@gmail.com> * isort * small change * Apply suggestions from code review Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> * Apply suggestions from code review Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com> * add hubert * Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Anton Lozhkov <aglozhkov@gmail.com>
-
- 22 Sep, 2021 1 commit
-
-
Sylvain Gugger authored
* Make gradient_checkpointing a training argument * Update src/transformers/modeling_utils.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/configuration_utils.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Fix tests * Style * document Gradient Checkpointing as a performance feature * Small rename * PoC for not using the config * Adapt BC to new PoC * Forgot to save * Rollout changes to all other models * Fix typo Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas@stason.org>
-
- 21 Sep, 2021 1 commit
-
-
Sylvain Gugger authored
* Add push_to_hub to no_trainer examples * Quality * Document integration * Roll out to other examples
-
- 20 Sep, 2021 1 commit
-
-
Suraj Patil authored
-
- 15 Sep, 2021 1 commit
-
-
Patrick von Platen authored
* finish * delete bogus file * correct some stuff * finish * finish
-
- 09 Sep, 2021 1 commit
-
-
Aleksander Smywiński-Pohl authored
* Fix typo in deepspeed documentation * Add missing import in deepspeed configuration * Fix path in translation examples
-
- 07 Sep, 2021 1 commit
-
-
Nathan Raw authored
*
✅ Update image-classification example's tests *🔥 remove cats_and_dogs test samples *💄 fix flake8
-
- 06 Sep, 2021 2 commits
-
-
Suraj Patil authored
-
Suraj Patil authored
-
- 02 Sep, 2021 1 commit
-
-
Nathan Raw authored
*
✨ add pytorch image classification example *🔥 remove utils.py *💄 fix flake8 style issues *🔥 remove unnecessary line *✨ limit dataset sizes *📌 update reqs *🎨 restructure - use datasets lib *🎨 import transforms directly *📝 add comments *💄 style *🔥 remove flag *📌 update requirement warning *📝 add vision README.md *📝 update README.md *📝 update README.md *🎨 add image-classification tag to model card *🚚 rename vision➡ ️ image-classification *📝 update image-classification README.md
-
- 31 Aug, 2021 3 commits
-
-
Lysandre authored
-
Lysandre authored
-
Sylvain Gugger authored
* Add generate kwargs to Seq2SeqTrainingArguments * typo * Address review comments + doc * Style
-
- 30 Aug, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 28 Aug, 2021 1 commit
-
-
Stefan Schweter authored
* examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples
-
- 27 Aug, 2021 1 commit
-
-
Stefan Schweter authored
* examples: add keep_linebreaks option to text dataset loader for all CLM examples * examples: introduce new keep_linebreaks option as data argument in CLM examples
-
- 19 Aug, 2021 1 commit
-
-
Allan Lin authored
* Update torch.utils.data namespaces to the latest. * Format * Update Dataloader. * Style
-
- 06 Aug, 2021 1 commit
-
-
Sylvain Gugger authored
* Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests
-
- 02 Aug, 2021 1 commit
-
-
Chungman Lee authored
* fix typo in example/text-classification README * add space to align the table
-
- 28 Jul, 2021 2 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
- 26 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 22 Jul, 2021 3 commits
-
-
Lysandre authored
-
Lysandre authored
-
Maxwell Forbes authored
-
- 08 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 07 Jul, 2021 1 commit
-
-
Souvic Chakraborty authored
* Validation split percentage to be used for custom data files also Issue same as https://github.com/huggingface/transformers/issues/12406 fixed for pytorch branch run_mlm.py * Validation split added in the right place * Update run_clm.py * validation split added for custom files * Validation split added for custom files * Update run_plm.py * fixed validation split for custom files as input for pytorch examples in lm * Update run_clm_no_trainer.py * args modified
-
- 28 Jun, 2021 2 commits
-
-
Bhadresh Savani authored
* added cotext manager to datasets map * fixed style and spaces * fixed warning of deprecation * changed desc
-
Taha ValizadehAslani authored
Before the code could not be used for validation only because of this line: extension = data_args.train_file.split(".")[-1] was assuming that extension must be extracted from the training dataset. This line would run regardless of the training or validation options of the user. This would lead to an error if the user only wants to run an evaluation only and does not want to do train (because the training file does not exist). I modified it to extract extension from the training file if the user wants to do train and extract it from the validation file if the user wants to run eval. This way the code can be used for both training and validation separately.
-
- 26 Jun, 2021 1 commit
-
-
Bhadresh Savani authored
-