Commits · 804c2974d5e1c95e71afe57f8f97b3a8bcd921eb · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "f1f23ad1710953e75b53a85953b018b8caceb427"

30 Apr, 2021 1 commit
- Fix do_eval default value in training_args.py (#11511) · 8b945ef0
  bonniehyeon authored Apr 30, 2021
```
* Fix do_eval default value in training_args.py

* Update PULL_REQUEST_TEMPLATE.md
```
  8b945ef0
29 Apr, 2021 1 commit
- Split checkpoint from model_name_or_path in examples (#11492) · b29eb247
  Sylvain Gugger authored Apr 29, 2021
```
* Split checkpoint from model_name_or_path in examples

* Address review comments

* Address review comments
```
  b29eb247
26 Apr, 2021 1 commit

[Deepspeed] ZeRO-Infinity integration plus config revamp (#11418) · bc2571e6

Stas Bekman authored Apr 26, 2021



* adding Z-inf

* revamp config process

* up version requirement

* wip

* massive rewrite

* cleanup

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* consistent json commas

* act on suggestions

* leave this feature for 0.3.16

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

bc2571e6

23 Apr, 2021 1 commit

Trainer push to hub (#11328) · bf2e0cf7

Sylvain Gugger authored Apr 23, 2021



* Initial support for upload to hub

* push -> upload

* Fixes + examples

* Fix torchhub test

* Torchhub test I hate you

* push_model_to_hub -> push_to_hub

* Apply mixin to other pretrained models

* Remove ABC inheritance

* Add tests

* Typo

* Run tests

* Install git-lfs

* Change approach

* Add push_to_hub to all

* Staging test suite

* Typo

* Maybe like this?

* More deps

* Cache

* Adapt name

* Quality

* MOAR tests

* Put it in testing_utils

* Docs + torchhub last hope

* Styling

* Wrong method

* Typos

* Update src/transformers/file_utils.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Address review comments

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bf2e0cf7

16 Apr, 2021 1 commit

Trainer support for IterableDataset for evaluation and predict (#11286) · d9c62047

Sylvain Gugger authored Apr 16, 2021

* Bulk of the work

* Polish and tests

* Update QA Trainer

* Avoid breaking the predict method

* Deprecation warnings

* Store real eval dataloder

* Get eval dataset reference before wrap

d9c62047

13 Apr, 2021 1 commit
- Avoid using no_sync on SageMaker DP (#11229) · 9d8e8a87
  Sylvain Gugger authored Apr 13, 2021
  
  9d8e8a87
08 Apr, 2021 1 commit
- Don't duplicate logs in TensorBoard and handle --use_env (#11141) · dfed4ec2
  Sylvain Gugger authored Apr 08, 2021
  
  dfed4ec2
31 Mar, 2021 2 commits

Update training_args.py (#11000) · 455f8171
JohnnyC08 authored Mar 31, 2021
```
In the group by length documentation length is misspelled as legnth
```
455f8171

Merge trainers (#10975) · cd56f3fe

Sylvain Gugger authored Mar 31, 2021



* Replace is_sagemaker_distributed_available

* Merge SageMakerTrainer into Trainer

* Test with shorter condition

* Put back deleted line

* Deprecate SageMakerTrainer and SageMakerTrainingArguments

* Apply suggestions from code review
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

cd56f3fe

29 Mar, 2021 1 commit

Allow use of pre-computed lengths when grouping by length. (#10953) · ae6b6963

pcuenca authored Mar 29, 2021

A new argument `length_column_name` has been added to
`TrainingArguments`, with default value `"length"`. If this column
exists and `group_by_length` is `True`, the train sampler will use
it for grouping rather than computing it before training starts.

This is an optimization that allows the user to prepare data for fast
processing, preventing sequential access to the dataset as described in
issue #10909.

ae6b6963

24 Mar, 2021 1 commit
- Update training args ignore_skip_data -> ignore_data_skip (#10891) · 1c06240e
  Sidd Karamcheti authored Mar 24, 2021
  
  1c06240e
16 Mar, 2021 2 commits

[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464) · c83fbc5f

Cheng Li authored Mar 16, 2021



* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* update

* make init_deepspeed support config dict

* fix docstring formatting

* clean up trainer's comments

* add new tests

* fix type

* composit argparse doesn't work

* style

* add a new test, rename others

* document new functionality

* complete tests, add docs

* style

* correct level

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add new methods to the doc

* must tell DS we are using a non-native optimizer

* add protection against cpu_offload + HF optimizer combo

* fix the cli overrides

* sync docs + tests

* restore AdamW

* better docs

* need new version

* no longer needed

* remove outdate information

* refactor duplicated code
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c83fbc5f

Add DistributedSamplerWithLoop (#10746) · a0a027c2
Sylvain Gugger authored Mar 16, 2021
```
* Add DistributedSamplerWithLoop

* Fix typo

* Test and small fix
```
a0a027c2

15 Mar, 2021 2 commits
- Fix backward compatibility with EvaluationStrategy (#10718) · f5c097fc
  Sylvain Gugger authored Mar 15, 2021
  
  f5c097fc
- Multiple fixes in SageMakerTrainer (#10687) · 6bef7645
  Sylvain Gugger authored Mar 15, 2021
```
* Handle save differently

* Missing imports

* Fix typo

* Adapt to recent changes in save_pretrained

* Forgotten brackets

* Optimizer load

* Fix world size

* Deal wth None

* Remove needless self
```
  6bef7645
12 Mar, 2021 2 commits

fix: #10628 expanduser path in TrainingArguments (#10660) · 00cad2e5

PaulLerner authored Mar 12, 2021



* fix: #10628 expanduser path in TrainingArguments

* docs: explain why we expand paths in TrainingArguments

* Style
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

00cad2e5

Add auto_wrap option in fairscale integration (#10673) · e8246f78
Sylvain Gugger authored Mar 12, 2021
```
* Add auto_wrap option in fairscale integration

* Style
```
e8246f78

04 Mar, 2021 1 commit
- Removes overwrites for output_dir (#10521) · 805c5200
  Philipp Schmid authored Mar 04, 2021
```
* removed overwrites

* remove default value for output_dir

* adjusted typing
```
  805c5200
03 Mar, 2021 1 commit

Smp grad accum (#10488) · b70f441b

Sylvain Gugger authored Mar 03, 2021

* Fix gradient accumulation for SM Model Parallelism

* Style and divide loss by grad accum steps

b70f441b

28 Feb, 2021 1 commit

Introduce save_strategy training argument (#10286) · 256482ac

Tanmay Garg authored Feb 28, 2021

* Introduce save_strategy training argument

* deprecate EvaluationStrategy

* collapse EvaluationStrategy and LoggingStrategy into a single
  IntervalStrategy enum

* modify tests to use modified enum

256482ac

25 Feb, 2021 1 commit

Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354) · 9d14be5c

Sylvain Gugger authored Feb 25, 2021



* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale

* Quality

* Rework from review comments

* Add doc

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

9d14be5c

19 Feb, 2021 2 commits

Introduce logging_strategy training argument (#10267) (#10267) · 709c86b5
Tanmay Garg authored Feb 19, 2021
```
Introduce logging_strategy training argument
in TrainingArguments and TFTrainingArguments. (#9838)
```
709c86b5

[trainer] implement support for full fp16 in evaluation/predict (#10268) · 4eddc459

Stas Bekman authored Feb 18, 2021



* implement --fp16_full_eval

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* add test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4eddc459

18 Feb, 2021 2 commits

[Trainer] memory tracker metrics (#10225) · 97e688bc

Stas Bekman authored Feb 18, 2021



* memory tracker metrics

* go back to eval for somewhat consistency

* handle no-gpu case

* deal with stackable eval calls

* restore callback order

* style

* simplify the API

* add test

* docs

* consistently use eval_ prefix

* improve docs

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename method

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

97e688bc

Introduce warmup_ratio training argument (#10229) · d7f38c5d

Tanmay Garg authored Feb 18, 2021

Introduce warmup_ratio training argument in both
TrainingArguments and TFTrainingArguments classes (#6673)

d7f38c5d

11 Feb, 2021 2 commits

Add SageMakerTrainer for model paralellism (#10122) · 31245775

Sylvain Gugger authored Feb 11, 2021

* Refactor things out of main train

* Store signature

* Add SageMakerTrainer

* Init + Copyright

* Address review comments

31245775

[DeepSpeed in notebooks] Jupyter + Colab (#10130) · b54cb0bd

Stas Bekman authored Feb 11, 2021

* init devices/setup explicitly

* docs + test

* simplify

* cleanup

* cleanup

* cleanup

* correct the required dist setup

* derive local_rank from env LOCAL_RANK

b54cb0bd

09 Feb, 2021 1 commit
- Fix some edge cases in report_to and add deprecation warnings (#10100) · 77c0ce8c
  Sylvain Gugger authored Feb 09, 2021
  
  77c0ce8c
31 Jan, 2021 1 commit

Clarify definition of seed argument in TrainingArguments (#9903) · 22121e81

lewtun authored Jan 31, 2021



* Clarify definition of seed argument in Trainer

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix style

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

22121e81

29 Jan, 2021 2 commits
- refactor deepspeed setup devices (#9880) · 1420b5ff
  Stas Bekman authored Jan 29, 2021
  
  1420b5ff
- When on sagemaker use their env variables for saves (#9876) · 7eadfe16
  Sylvain Gugger authored Jan 29, 2021
```
* When on sagemaker use their env variables for saves

* Address review comments

* Quality
```
  7eadfe16
28 Jan, 2021 2 commits
- pin_memory -> dataloader_pin_memory (#9874) · bc109ae5
  abhishek thakur authored Jan 28, 2021
  
  bc109ae5
- Pin memory in Trainer by default (#9857) · 25fcb5c1
  abhishek thakur authored Jan 28, 2021
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
```
  25fcb5c1
27 Jan, 2021 1 commit

Add a flag for find_unused_parameters (#9820) · c7b7bd99

Sylvain Gugger authored Jan 27, 2021



* Add a flag for find_unused_parameters

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Remove negation
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

c7b7bd99

26 Jan, 2021 1 commit

Smdistributed trainer (#9798) · 0d0efd3a

Sylvain Gugger authored Jan 26, 2021

* Add a debug print

* Adapt Trainer to use smdistributed if available

* Forgotten parenthesis

* Real check for sagemaker

* Donforget to define device...

* Woopsie, local)rank is defined differently

* Update since local_rank has the proper value

* Remove debug statement

* More robust check for smdistributed

* Quality

* Deal with key not present error

0d0efd3a

22 Jan, 2021 1 commit
- Add `report_to` training arguments to control the reporting integrations used (#9735) · 82d46feb
  Sylvain Gugger authored Jan 22, 2021
  
  82d46feb
20 Jan, 2021 2 commits

Fix style · 2a703773
Sylvain Gugger authored Jan 20, 2021

2a703773

Fix Trainer and Args to mention AdamW, not Adam. (#9685) · 538245b0

Gunjan Chhablani authored Jan 20, 2021

* Fix Trainer and Args to mention AdamW, not Adam.

* Update the docs for Training Arguments.

* Change arguments adamw_* to adam_*

* Fixed links to AdamW in TrainerArguments docs

* Fix line length in Training Args docs.

538245b0

14 Jan, 2021 2 commits

Upstream (and rename) sortish sampler (#9574) · 329fe274

Sylvain Gugger authored Jan 14, 2021



* Upstream (and rename) sortish sampler

* Use proper sampler

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

329fe274

Fix Trainer with a parallel model (#9578) · 5e1bea4f
Sylvain Gugger authored Jan 14, 2021
```
* Fix Trainer with a parallel model

* More clean up
```
5e1bea4f