- 24 Nov, 2023 1 commit
-
-
Sourab Mangrulkar authored
* add code changes 1. Refactor FSDP 2. Add `--save_only_model` option: When checkpointing, whether to only save the model, or also the optimizer, scheduler & rng state. 3. Bump up the minimum `accelerate` version to `0.21.0` * quality * fix quality? * Revert "fix quality?" This reverts commit 149330a6abc078827be274db84c8a2d26a76eba1. * fix fsdp doc strings * fix quality * Update src/transformers/training_args.py Co-authored-by:
Zach Mueller <muellerzr@gmail.com> * please fix the quality issue
馃槄 * Apply suggestions from code review Co-authored-by:Benjamin Bossan <BenjaminBossan@users.noreply.github.com> * address comment * simplify conditional check as per the comment * update documentation --------- Co-authored-by:
Zach Mueller <muellerzr@gmail.com> Co-authored-by:
Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
-
- 22 Nov, 2023 1 commit
-
-
Quentin Gallou茅dec authored
* fix max_steps doc * Update src/transformers/training_args.py [ci skip] Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * propagate suggested change --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
- 14 Nov, 2023 2 commits
-
-
Zach Mueller authored
* Add tokens seen * Address comments, add to TrainingArgs * Update log * Apply suggestions from code review Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Use self.args * Fix docstring Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Costa Huang authored
* Minor type annotation fix * Trigger Build
-
- 13 Nov, 2023 1 commit
-
-
Tomasz Cichy authored
Docstring entry for `gradient_checkpointing_kwargs` was `gradient_checkpointing_args`. This is incorrect.
-
- 09 Nov, 2023 1 commit
-
-
Dave Berenbaum authored
* dvclive trainer callback * style fixes * dvclive link fixes
-
- 07 Nov, 2023 1 commit
-
-
Plemeur authored
* Allow for scheduler kwargs * Formatting * Arguments checks, passing the tests * Black failed somehow --------- Co-authored-by:Pierre <pierre@avatarin.com>
-
- 01 Nov, 2023 1 commit
-
-
Zach Mueller authored
* Enable split_batches through TrainingArguments * Extra dispatch_batches * Keep as default false * Add to docstring * Add to docstring * Remove the capturewarnings change * Comma
-
- 31 Oct, 2023 2 commits
-
-
Lysandre Debut authored
* Safetensors serialization by default * First pass on the tests * Second pass on the tests * Third pass on the tests * Fix TF weight loading from TF-format safetensors * Specific encoder-decoder fixes for weight crossloading * Add VisionEncoderDecoder fixes for TF too * Change filename test for pt-to-tf * One missing fix for TFVisionEncoderDecoder * Fix the other crossload test * Support for flax + updated tests * Apply suggestions from code review Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Sanchit's comments * Sanchit's comments 2 * Nico's comments * Fix tests * cleanup * Apply suggestions from code review Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
Matt <rocketknight1@gmail.com> Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Younes Belkada authored
* add v1 neftune * use `unwrap_model` instead * add test + docs * Apply suggestions from code review Co-authored-by:
Zach Mueller <muellerzr@gmail.com> * more details * fixup * Update docs/source/en/main_classes/trainer.md Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * refactor a bit * more elaborated test * fix unwrap issue --------- Co-authored-by:
Zach Mueller <muellerzr@gmail.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 30 Oct, 2023 2 commits
-
-
Hz, Ji authored
* remove the obsolete code related to fairscale FSDP * apple review suggestion
-
Younes Belkada authored
* add `gradient_checkpointing_kwargs` in trainer and training arguments * add comment * add test - currently failing * now tests pass
-
- 26 Oct, 2023 1 commit
-
-
L. Yeung authored
* docs(training_args): correct docstrings Correct docstrings of these methods in `TrainingArguments`: - `set_save` - `set_logging` * docs(training_args): adjust words in docstrings Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * docs(trainer): correct a typo in comments --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
- 12 Oct, 2023 1 commit
-
-
Tom Aarsen authored
Add missing spaces in adjacent strings
-
- 11 Oct, 2023 1 commit
-
-
Ben Gubler authored
* feat: update callback doc to explain disabling callbacks using report_to * docs: update report_to docstring
-
- 06 Oct, 2023 1 commit
-
-
statelesshz authored
* remove SharedDDP as it was drepracated * apply review suggestion * make style * Oops,forgot to remove the compute_loss context manager in Seq2SeqTrainer. * remove the unnecessary conditional statement * keep the logic of IPEX * clean code * mix precision setup & make fixup --------- Co-authored-by:statelesshz <jihuazhong1@huawei.com>
-
- 04 Oct, 2023 1 commit
-
-
statelesshz authored
-
- 27 Sep, 2023 1 commit
-
-
statelesshz authored
Co-authored-by:statelesshz <jihuazhong1@huawei.com>
-
- 26 Sep, 2023 1 commit
-
-
Nathan Lambert authored
add rmsprop
-
- 13 Sep, 2023 2 commits
-
-
Abhilash Majumder authored
flex gpu bug fix
-
Serizao authored
* Update training_args.py Missing distributed state so lign 1813-1814 failed because value is undefined * Update training_args.py Co-authored-by:
Zach Mueller <muellerzr@gmail.com> --------- Co-authored-by:
Zach Mueller <muellerzr@gmail.com>
-
- 07 Sep, 2023 1 commit
-
-
CokeDong authored
* Add tgs metrics * bugfix and black formatting * workaround for tokens counting * formating and bugfix * Fix * Add opt-in for tgs metrics * make style and fix error * Fix doc * fix docbuild * hf-doc-build * fix * test * Update src/transformers/training_args.py renaming Co-authored-by:
Zach Mueller <muellerzr@gmail.com> * Update src/transformers/training_args.py renaming Co-authored-by:
Zach Mueller <muellerzr@gmail.com> * Fix some symbol * test * Update src/transformers/trainer_utils.py match nameing patterns Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/training_args.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/trainer.py nice Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Fix reviews * Fix * Fix black --------- Co-authored-by:
Zach Mueller <muellerzr@gmail.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 05 Sep, 2023 2 commits
-
-
Abhilash Majumder authored
* patch with accelerate xpu * patch with accelerate xpu * formatting * fix tests * revert ruff unrelated fixes * revert ruff unrelated fixes * revert ruff unrelated fixes * fix test * review fixes * review fixes * black fixed * review commits * review commits * style fix * use pytorch_utils * revert markuplm test
-
Sahel Sharify authored
This cl iterates through a list of keys rather than dict items while updating the dict elements. Fixes the following error: File "..../transformers/training_args.py", line 1544, in post_init for k, v in self.fsdp_config.items(): RuntimeError: dictionary keys changed during iteration
-
- 01 Sep, 2023 1 commit
-
-
Zach Mueller authored
* Revert frozen training arguments * TODO
-
- 29 Aug, 2023 1 commit
-
-
Arup De authored
* add FSDP config option to enable activation-checkpointing * update docs * add checks and remove redundant code * fix formatting error
-
- 25 Aug, 2023 1 commit
-
-
Younes Belkada authored
* move deepspeed to `lib_integrations.deepspeed` * more refactor * oops * fix slow tests * Fix docs * fix docs * addess feedback * address feedback * final modifs for PEFT * fixup * ok now * trigger CI * trigger CI again * Update docs/source/en/main_classes/deepspeed.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * import from `integrations` * address feedback * revert removal of `deepspeed` module * revert removal of `deepspeed` module * fix conflicts * ooops * oops * add deprecation warning * place it on the top * put `FutureWarning` * fix conflicts with not_doctested.txt * add back `bitsandbytes` module with a depr warning * fix * fix * fixup * oops * fix doctests --------- Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 18 Aug, 2023 1 commit
-
-
Sourab Mangrulkar authored
-
- 17 Aug, 2023 1 commit
-
-
Sourab Mangrulkar authored
* add util for ram efficient loading of model when using fsdp * make fix-copies * fixes
馃槄 * docs * making it further easier to use * rename the function * refactor to handle fsdp ram efficiency in `from_pretrained` * fixes * fixes * fixes * update * fixes * revert `load_pretrained_model_only_on_rank0` * resolve `load_from_checkpoint`
-
- 15 Aug, 2023 1 commit
-
-
Zach Mueller authored
* Make training args fully immutable * Working tests, PyTorch * In test_trainer * during testing * Use proper dataclass way * Fix test * Another one * Fix tf * Lingering slow * Exception * Clean
-
- 09 Aug, 2023 1 commit
-
-
Alan Ji authored
* enhanced tips for some training args * make style
-
- 07 Aug, 2023 1 commit
-
-
Sylvain Gugger authored
* First draft * Deal with progress bars * Update src/transformers/utils/hub.py Co-authored-by:
Lucain <lucainp@gmail.com> * Address review comments * Forgot one * Pin hf_hub * Add argument for push all and fix tests * Fix tests * Address review comments --------- Co-authored-by:
Lucain <lucainp@gmail.com>
-
- 03 Aug, 2023 1 commit
-
-
Tom Aarsen authored
* Update list of logging integrations in docstring Also update type hint * Also add 'flyte' to report_to callback list * Revert 'report_to' type hint update Due to CLI breaking
-
- 02 Aug, 2023 1 commit
-
-
Sourab Mangrulkar authored
* resolving zero3 init when using accelerate config with Trainer * refactor * fix * fix import
-
- 28 Jul, 2023 1 commit
-
-
Lucain authored
* Fix .push_to_hub and cleanup get_full_repo_name usage * Do not rely on Python bool conversion magic * request changes
-
- 27 Jul, 2023 1 commit
-
-
Zach Mueller authored
* Change defaults * Sylvain's comments
-
- 25 Jul, 2023 1 commit
-
-
Xuehai Pan authored
-
- 24 Jul, 2023 1 commit
-
-
Zach Mueller authored
* Dispatch batches * Copy items
-
- 21 Jul, 2023 2 commits
-
-
Sylvain Gugger authored
-
Sourab Mangrulkar authored
* fix fsdp prepare to remove the warnings and fix excess memory usage * Update training_args.py * parity for FSDP+XLA * Update trainer.py
-