Commits · 2fbb237967f5d1b2eb65c2131954f23a24bd29ef · chenpangpang / transformers

09 May, 2022 1 commit

Add the auto_find_batch_size capability from Accelerate into Trainer (#17068) · 2fbb2379

Zachary Mueller authored May 09, 2022


Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

- Adds auto_batch_size finder 
- Moves training loop to an inner training loop

2fbb2379

19 Apr, 2022 1 commit

Some tests misusing assertTrue for comparisons fix (#16771) · a2392415

code-review-doctor authored Apr 19, 2022

* Fix issue avoid-misusing-assert-true found at https://codereview.doctor



* fix tests

* fix tf
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a2392415

23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41

05 Oct, 2021 1 commit
- Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler (#13820) · 1b74af76
  Zhaofeng Wu authored Oct 05, 2021
```
* Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler

* Fix
```
  1b74af76
29 Sep, 2021 1 commit
- Fix length of IterableDatasetShard and add test (#13792) · 63cc5bda
  Sylvain Gugger authored Sep 29, 2021
```
* Fix length of IterableDatasetShard and add test

* Add comments
```
  63cc5bda
14 Jun, 2021 1 commit
- [style] consistent nn. and nn.functional: part 3 `tests` (#12155) · 372ab9cd
  Stas Bekman authored Jun 14, 2021
```
* consistent nn. and nn.functional: p3 templates

* restore
```
  372ab9cd
30 Apr, 2021 1 commit
- Accepts BatchEncoding in LengthSampler (#11431) · c2cd02ac
  Takuya Makino authored Apr 30, 2021
  
  c2cd02ac
16 Apr, 2021 1 commit

Trainer support for IterableDataset for evaluation and predict (#11286) · d9c62047

Sylvain Gugger authored Apr 16, 2021

* Bulk of the work

* Polish and tests

* Update QA Trainer

* Avoid breaking the predict method

* Deprecation warnings

* Store real eval dataloder

* Get eval dataset reference before wrap

d9c62047

14 Apr, 2021 1 commit

Trainer iterable dataset (#11254) · aaaed56f

Sylvain Gugger authored Apr 14, 2021



* IterableDatasetShard

* Test and integration in Trainer

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

aaaed56f

05 Apr, 2021 1 commit
- Fix distributed gather for tuples of tensors of varying sizes (#11071) · 04ceee7d
  Sylvain Gugger authored Apr 05, 2021
  
  04ceee7d
17 Mar, 2021 1 commit

Smmp batch not divisible by microbatches fix (#10778) · 0282e24e

Mansi Mane authored Mar 17, 2021



* Added debug prints

* Added config

* Added prints

* Added prints

* Added extra samples to SequentialDistributedSampler

* Added extra samples to SequentialDistributedSampler

Updated SequentialDistributedSampler call

* Added deubg prints

* Removed extra prints

* Making predicitons and labels multiple of batchsize

* updated number of microbatches

* Removed extra prints

* Made start_remainder similar to DistributedSamplerWithLoop

* Minor spacing update

* Added debug prints

Added config

Added prints

Added prints

* Added extra samples to SequentialDistributedSampler

Updated SequentialDistributedSampler call

Added extra samples to SequentialDistributedSampler

Added deubg prints

Removed extra prints

Making predicitons and labels multiple of batchsize

updated number of microbatches

Removed extra prints

Squashing redundant commits

* Made start_remainder similar to DistributedSamplerWithLoop

Minor spacing update

Made start_remainder similar to DistributedSamplerWithLoop

* Test and styling

* Rename test
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

0282e24e

16 Mar, 2021 1 commit
- Add DistributedSamplerWithLoop (#10746) · a0a027c2
  Sylvain Gugger authored Mar 16, 2021
```
* Add DistributedSamplerWithLoop

* Fix typo

* Test and small fix
```
  a0a027c2
08 Mar, 2021 5 commits
- Check layer types for Optimizer construction (#10598) · 3ced9b3e
  Sylvain Gugger authored Mar 08, 2021
```
* Check layer types for Optimizer construction

* Duplicate class
```
  3ced9b3e
- Revert "Tests" · 821d518e
  Sylvain Gugger authored Mar 08, 2021
```
This reverts commit b35e7b68.
```
  821d518e
- Revert "Style" · 4196bfed
  Sylvain Gugger authored Mar 08, 2021
```
This reverts commit a8ec52ef.
```
  4196bfed
- Style · a8ec52ef
  Sylvain Gugger authored Mar 08, 2021
  
  a8ec52ef
- Tests · b35e7b68
  Sylvain Gugger authored Mar 08, 2021
  
  b35e7b68
21 Jan, 2021 1 commit

Fix memory regression in Seq2Seq example (#9713) · 5f80c15e

Sylvain Gugger authored Jan 21, 2021

* Fix memory regression in Seq2Seq example

* Fix test and properly deal with -100

* Easier condition with device safety

* Patch for MBartTokenzierFast

5f80c15e

14 Jan, 2021 1 commit

Upstream (and rename) sortish sampler (#9574) · 329fe274

Sylvain Gugger authored Jan 14, 2021



* Upstream (and rename) sortish sampler

* Use proper sampler

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

329fe274

22 Dec, 2020 1 commit

Seq2seq trainer (#9241) · 490b39e6

Sylvain Gugger authored Dec 22, 2020



* Add label smoothing in Trainer

* Add options for scheduler and Adafactor in Trainer

* Put Seq2SeqTrainer in the main lib

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments and adapt scripts

* Documentation

* Move test not using script to tests folder
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

490b39e6

14 Oct, 2020 1 commit

Add predict step accumulation (#7767) · a1d1b332

Sylvain Gugger authored Oct 14, 2020



* Add eval_accumulation_step and clean distributed eval

* Add TPU test

* Add TPU stuff

* Fix arg name

* Fix Seq2SeqTrainer

* Fix total_size

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Doc and add test to TPU

* Add unit test

* Adapt name
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

a1d1b332