- 12 Mar, 2021 1 commit
-
-
Stas Bekman authored
-
- 09 Mar, 2021 1 commit
-
-
Sylvain Gugger authored
* Hotfix fairscale FSDP * Evaluation works * Save on process zero
-
- 08 Mar, 2021 1 commit
-
-
Stas Bekman authored
* fix sharded ddp enum * test fixes * stronger validation + apex breaks other tests
-
- 25 Feb, 2021 1 commit
-
-
Sylvain Gugger authored
* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale * Quality * Rework from review comments * Add doc * Apply suggestions from code review Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Address review comments Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
- 24 Feb, 2021 1 commit
-
-
Stas Bekman authored
* handle get_last_lr() before first step() * abstract away the lr getting logic * cleanup * add test * move to utils
-
- 22 Feb, 2021 1 commit
-
-
Stas Bekman authored
* implement gradient_accumulation_steps support in DeepSpeed integration * typo * cleanup * cleanup
-
- 18 Feb, 2021 1 commit
-
-
Stas Bekman authored
* memory tracker metrics * go back to eval for somewhat consistency * handle no-gpu case * deal with stackable eval calls * restore callback order * style * simplify the API * add test * docs * consistently use eval_ prefix * improve docs * Update src/transformers/trainer_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * rename method * style Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 17 Feb, 2021 1 commit
-
-
Stas Bekman authored
* fix invalid port * missing requirements
-
- 15 Feb, 2021 1 commit
-
-
Stas Bekman authored
* fix run_seq2seq.py; porting DeepSpeed tests to it * unrefactor * defensive programming * defensive programming 2 * port the rest of the trainer tests * style * a cleaner scripts dir finder * cleanup
-
- 11 Feb, 2021 1 commit
-
-
Stas Bekman authored
* init devices/setup explicitly * docs + test * simplify * cleanup * cleanup * cleanup * correct the required dist setup * derive local_rank from env LOCAL_RANK
-
- 10 Feb, 2021 1 commit
-
-
Stas Bekman authored
* free up memory at the end of train * rework tests * consistent formatting * correction
-
- 08 Feb, 2021 1 commit
-
-
Stas Bekman authored
-
- 05 Jul, 2019 2 commits