- 20 Oct, 2021 2 commits
-
-
Kwanghee Choi authored
Co-authored-by:jonas <jonas@hpcnt.com>
-
Robert Stone authored
-
- 11 Oct, 2021 2 commits
-
-
Sylvain Gugger authored
-
Patrick von Platen authored
[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961) * up * correct test
-
- 07 Oct, 2021 1 commit
-
-
Alex Hedges authored
-
- 06 Oct, 2021 3 commits
-
-
Anton Lozhkov authored
-
Sylvain Gugger authored
-
Yanming Wang authored
* Fix logging_nan_inf_filter in torch_xla mode * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix format Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 05 Oct, 2021 1 commit
-
-
Zhaofeng Wu authored
* Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler * Fix
-
- 27 Sep, 2021 1 commit
-
-
Sylvain Gugger authored
Co-authored-by:
quantitative-technologies <james.hirschorn@quantitative-technologies.com> Co-authored-by:
quantitative-technologies <james.hirschorn@quantitative-technologies.com>
-
- 26 Sep, 2021 1 commit
-
-
Patrick von Platen authored
[Trainer] Make sure shown loss in distributed training is correctly averaged over all workers (#13681) * push * improve tr loss gather
-
- 23 Sep, 2021 2 commits
-
-
kding1 authored
* update trainer with cpu distributed fine-tuning support. Signed-off-by:
Ding, Ke <ke.ding@intel.com> * Style. * refinement on cpu dist training check. Signed-off-by:
Ding, Ke <ke.ding@intel.com> * style. Signed-off-by:
Ding, Ke <ke.ding@intel.com> * Test over private field not public one. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Morgan Funtowicz <funtowiczmo@gmail.com> Co-authored-by:
Funtowicz Morgan <mfuntowicz@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
kding1 authored
* add sigopt hpo to transformers. Signed-off-by:
Ding, Ke <ke.ding@intel.com> * extend sigopt changes to test code and others.. Signed-off-by:
Ding, Ke <ke.ding@intel.com> * Style. * fix style for sigopt integration. Signed-off-by:
Ding, Ke <ke.ding@intel.com> * Add necessary information to run unittests on SigOpt. Co-authored-by:
Morgan Funtowicz <funtowiczmo@gmail.com>
-
- 22 Sep, 2021 1 commit
-
-
Sylvain Gugger authored
* Make gradient_checkpointing a training argument * Update src/transformers/modeling_utils.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Update src/transformers/configuration_utils.py Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Fix tests * Style * document Gradient Checkpointing as a performance feature * Small rename * PoC for not using the config * Adapt BC to new PoC * Forgot to save * Rollout changes to all other models * Fix typo Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas@stason.org>
-
- 17 Sep, 2021 1 commit
-
-
Patrick von Platen authored
* finish * add test * push * remove unnecessary code * up * correct test * Update src/transformers/training_args.py
-
- 14 Sep, 2021 2 commits
-
-
elishowk authored
Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
* Push to hub when saving checkpoints * Add model card * Revert partial model card * Small fix for checkpoint * Add tests * Add documentation * Fix tests * Bump huggingface_hub * Fix test
-
- 09 Sep, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 31 Aug, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 30 Aug, 2021 3 commits
-
-
Olatunji Ruwase authored
* Use DS callable API to allow hf_scheduler + ds_optimizer * Preserve backward-compatibility * Restore backward compatibility * Tweak arg positioning * Tweak arg positioning * bump the required version * Undo indent * Update src/transformers/trainer.py * style Co-authored-by:
Stas Bekman <stas@stason.org> Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
Maxwell Forbes authored
-
Li-Huai (Allan) Lin authored
* Check None before going through iteration * Format
-
- 23 Aug, 2021 1 commit
-
-
Philipp Schmid authored
* Barrier -> barrier * added logger for metrics * removed stream handler in trainer * moved handler * removed streamhandler from trainer * updated test image and instance type added datasets version to test * Update tests/sagemaker/scripts/pytorch/requirements.txt Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
- 19 Aug, 2021 1 commit
-
-
Allan Lin authored
* Update torch.utils.data namespaces to the latest. * Format * Update Dataloader. * Style
-
- 06 Aug, 2021 2 commits
-
-
Sylvain Gugger authored
* Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests
-
Sylvain Gugger authored
* Initial work * All auto models * All tf auto models * All flax auto models * Tokenizers * Add feature extractors * Fix typos * Fix other typo * Use the right config * Remove old mapping names and update logic in AutoTokenizer * Update check_table * Fix copies and check_repo script * Fix last test * Add back name * clean up * Update template * Update template * Forgot a ) * Use alternative to fixup * Fix TF model template * Address review comments * Address review comments * Style
-
- 03 Aug, 2021 1 commit
-
-
Philip May authored
* fix #12970 * Update tests/test_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/test_trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove unnecessary issue link * fix test formatting Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 30 Jul, 2021 1 commit
-
-
wulu473 authored
Co-authored-by:Lukas Wutschitz <lukas.wutschitz@microsoft.com>
-
- 26 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 21 Jul, 2021 3 commits
-
-
Sylvain Gugger authored
-
Stas Bekman authored
-
Masatoshi TSUCHIYA authored
* Refer warmup_ratio when setting warmup_num_steps. * Add a method to get number of warmup steps to TrainerArguments class. * Fix. * Fix.
-
- 14 Jul, 2021 1 commit
-
-
Stas Bekman authored
* [trainer] release tmp memory in checkpoint load * Update src/transformers/trainer.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 09 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 08 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 07 Jul, 2021 2 commits
-
-
Sylvain Gugger authored
* Double check for attribute * Use right name
-
shabie authored
-
- 30 Jun, 2021 1 commit
-
-
Sylvain Gugger authored
* Add option to save on each training node * Apply suggestions from code review Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Address review comments Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
- 23 Jun, 2021 1 commit
-
-
Sylvain Gugger authored
* Clean push to hub API * Create working dir if it does not exist * Different tweak * New API + all models + test Flax * Adds the Trainer clean up * Update src/transformers/file_utils.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Address review comments * (nit) output types * No need to set clone_from when folder exists * Update src/transformers/trainer.py Co-authored-by:
Julien Chaumond <julien@huggingface.co> * Add generated_from_trainer tag * Update to new version * Fixes Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Julien Chaumond <julien@huggingface.co> Co-authored-by:
Lysandre <lysandre.debut@reseau.eseo.fr>
-
- 22 Jun, 2021 1 commit
-
-
Stas Bekman authored
* bug fixes and a rename * add extended DDP test
-