Commits · 08b462189936f193c97acacd0c8dafaa5313d1ab · chenpangpang / transformers

30 Nov, 2022 1 commit
- Repurpose torchdynamo training args towards torch._dynamo (#20498) · 08b46218
  Sylvain Gugger authored Nov 30, 2022
```
* Repurpose torchdynamo training args towards torch._dynamo

* Add doc
```
  08b46218
25 Nov, 2022 1 commit
- [AnyPrecisionAdamW] test fix (#20454) · a547d5bd
  Stas Bekman authored Nov 25, 2022
  
  a547d5bd
18 Nov, 2022 1 commit

Add AnyPrecisionAdamW optimizer (#18961) · 84c9cc6d

atturaioe authored Nov 18, 2022

* Add AnyPrecisionAdamW optimizer

* Add optim_args argument to TrainingArgs

* Add tests for AnyPrecisionOptimizer

* Change AnyPrecisionAdam default params to float32

* Move default_anyprecision_kwargs in trainer test

* Rename AnyPrecisionAdamW

84c9cc6d

16 Nov, 2022 1 commit

Data collator for token classification pads labels column when receives pytorch tensors (#20244) · 610acc5a

Alexander Markov authored Nov 16, 2022



* token cls data_collator pads labels column

* remove walrus operator for code quality

* remove redundat space

* remove comment that was fixed

* PR comments fix
Co-authored-by: Alexander Markov <amarkov.me@gmail.com>

610acc5a

15 Sep, 2022 1 commit

Run `torchdynamo` tests (#19056) · 16242e1b

Yih-Dar authored Sep 15, 2022



* Enable torchdynamo tests

* make style
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

16242e1b

12 Aug, 2022 1 commit
- small change (#18584) · 1ccd2515
  Younes Belkada authored Aug 12, 2022
  
  1ccd2515
13 Jul, 2022 1 commit

Enable torchdynamo with torch_tensorrt(fx path) (#17765) · 7ea6ccc2

Wei authored Jul 13, 2022



* enable fx2trt

* Update perf_train_gpu_one.mdx

* Update perf_train_gpu_one.mdx

* add lib check

* update

* format

* update

* fix import check

* fix isort

* improve doc

* refactor ctx manager

* fix isort

* black format

* isort fix

* fix format

* update args

* update black

* cleanups

* Update perf_train_gpu_one.mdx

* code refactor

* code refactor to init

* remove redundancy

* isort

* replace self.args with args
Co-authored-by: Stas Bekman <stas@stason.org>

7ea6ccc2

12 Jul, 2022 1 commit

Enhance IPEX integration in Trainer (#18072) · b7d8bd37

jianan-gu authored Jul 12, 2022



* enhance ipex import

* refine codes

* refine style

* add link

* style
Co-authored-by: Stas Bekman <stas@stason.org>

b7d8bd37

08 Jul, 2022 1 commit

Make predict() close progress bars after finishing (#17952) (#18078) · 8b332a6a

neverix authored Jul 08, 2022

* Make Trainer.predict call on_evaluate (#17952)

* Add on_predict

* Small fix

* Small and different fix

* Add tests

8b332a6a

01 Jul, 2022 1 commit
- higher atol to avoid flaky trainer test failure (#17979) · 664688b9
  Yih-Dar authored Jul 01, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  664688b9
30 Jun, 2022 1 commit
- skip some ipex tests until it works with torch 1.12 (#17964) · fe140464
  Yih-Dar authored Jun 30, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  fe140464
28 Jun, 2022 1 commit
- Fix `test_number_of_steps_in_training_with_ipex` (#17889) · f717d47f
  Yih-Dar authored Jun 28, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f717d47f
21 Jun, 2022 1 commit

Prepare transformers for v0.8.0 huggingface-hub release (#17716) · 6a5272b2

Lysandre Debut authored Jun 21, 2022



* Prepare CI for v0.8.0

* pin hfh (revert before merge)

* Revert "pin hfh (revert before merge)"

This reverts commit a0103140e1c77b810ffcb735192968bc03be3e1f.

* Test rc3

* Test latest rc

* Unpin to the RC
Co-authored-by: Sylvain Gugger <Sylvain.gugger@gmail.com>

6a5272b2

20 Jun, 2022 1 commit
- deprecate is_torch_bf16_available (#17738) · a2d34b7c
  Stas Bekman authored Jun 20, 2022
```
* deprecate is_torch_bf16_available

* address suggestions
```
  a2d34b7c
14 Jun, 2022 1 commit

Extend Transformers Trainer Class to Enable PyTorch Torchscript for Inference (#17153) · 3b29c9fd

jianan-gu authored Jun 14, 2022



* add jit mode option and model wrap

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refine code

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add ut and refine code

* code refine

* refine code

* add inference doc

* Update src/transformers/trainer.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* add cpu inference performance doc

* Update perf_infer_cpu.mdx

* Update perf_infer_cpu.mdx

* Update performance.mdx

* Update _toctree.yml

* refine jit func naming

* Update _toctree.yml

* Delete perf_infer_gpu_one.mdx

* Update perf_infer_cpu.mdx

* Update docs/source/en/perf_infer_cpu.mdx
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* add none check before jit

* Update docs/source/en/perf_infer_cpu.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/en/perf_infer_cpu.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

3b29c9fd

08 Jun, 2022 1 commit

Extend Transformers Trainer Class to Enable CPU AMP and Integrate Intel... · 34097b33

jianan-gu authored Jun 08, 2022


Extend Transformers Trainer Class to Enable CPU AMP and Integrate Intel Extension for PyTorch (#17138)

* init PR

* fix import ipex

* minor fix on bf16

* refine optimizer

* refine args notes

* refine code

* refine ipex optimize args

* refine half_precision_backend

* black format

* isort format

* isort format files

* flake8 format

* doc builder format

* refine codes

* remove jit and optim bits

* black preview format

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refine code

* refine notes

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* code refine

* add ipex ut

* add performance cpu doc

* link to the cpu doc from main perf doc

* install ipex into CI's docker

* Update perf_train_cpu.mdx

* Update docs/source/en/perf_train_cpu.mdx
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update perf_train_cpu.mdx

* Update perf_train_cpu.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

34097b33

25 May, 2022 1 commit

Support compilation via Torchdynamo, AOT Autograd, NVFuser (#17308) · 897a8dd8

Animesh Jain authored May 25, 2022



* Support compilation via Torchdynamo, AOT Autograd, NVFuser

* Address comments

* Lint

* Stas comments - missing quality test

* Lintere

* Quality test

* Doc lint

* Reset CUDA peak mem

* Add CustomTrainer

* require a single gpu
Co-authored-by: Stas Bekman <stas@stason.org>

897a8dd8

18 May, 2022 1 commit
- [tests] fix copy-n-paste error (#17312) · 3601aa8f
  Stas Bekman authored May 18, 2022
```
* [tests] fix copy-n-paste error

* fix
```
  3601aa8f
16 May, 2022 1 commit
- Make TrainerHyperParameterSigOptIntegrationTest slow test (#17288) · 66b3e106
  Yih-Dar authored May 16, 2022
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  66b3e106
11 May, 2022 2 commits

Ensure tensors are at least 1d for pad and concat (#17179) · 47412c7d

Antoni Baum authored May 11, 2022

* Ensure tensors are at least 1d for pad and concat

* Compatibility

* Fix

* Fix

* Add test

* Retrigger CI

* Consistency with master

* Retrigger CI

47412c7d

Remove unnecessary columns for all dataset types in `Trainer` (#17166) · edcc66d2

Antoni Baum authored May 11, 2022

* Remove unneeded columns for IterableDataset

* Add test

* Update trainer tests

* Edit docstring

* Lint

* Apply feedback

* Apply feedback

edcc66d2

09 May, 2022 1 commit

Add the auto_find_batch_size capability from Accelerate into Trainer (#17068) · 2fbb2379

Zachary Mueller authored May 09, 2022


Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

- Adds auto_batch_size finder 
- Moves training loop to an inner training loop

2fbb2379

03 May, 2022 2 commits
- Fix RNG reload in resume training from epoch checkpoint (#17055) · 1c9fcd0e
  Sylvain Gugger authored May 03, 2022
```
* Fix RNG reload in resume training from epoch checkpoint

* Fix test
```
  1c9fcd0e
- Make Trainer compatible with sharded checkpoints (#17053) · a8fa2f91
  Sylvain Gugger authored May 03, 2022
```
* Make Trainer compatible with sharded checkpoints

* Add doc
```
  a8fa2f91
19 Apr, 2022 2 commits

Add support for bitsandbytes (#15622) · 3104036e

Manuel R. Ciosici authored Apr 19, 2022



* Add initial BNB integration

* fixup! Add initial BNB integration

* Add bnb test decorator

* Update Adamw8bit option name

* Use the full bnb package name

* Overide bnb for all embedding layers

* Fix package name

* Formatting

* Remove unnecessary import

* Update src/transformers/trainer.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Rename AdamwBNB optimizer option

* Add training test checking that bnb memory utilization is lower

* fix merge

* fix merge; fix + extend new test

* cleanup

* expand bnb

* move all require_* candidates to testing_utils.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

3104036e

Some tests misusing assertTrue for comparisons fix (#16771) · a2392415

code-review-doctor authored Apr 19, 2022

* Fix issue avoid-misusing-assert-true found at https://codereview.doctor



* fix tests

* fix tf
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a2392415

29 Mar, 2022 1 commit

Avoid accessing .dataset of a DataLoader in Trainer (#16451) · d7c8ce57

Sander Land authored Mar 29, 2022



* Avoid accessing .dataset of a dataloader

* style

* fix

* cleaning up, reverting some misunderstandings

* black

* add train_dataset argument to get_train_dataloader, and fix other instances of length checks

* flake8

* address comments

* fix bug

* cleanup

* add test

* Update tests/trainer/test_trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* under torch

* merge

* stylistic suggestion
Co-authored-by: Sander Land <sander@chatdesk.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d7c8ce57

23 Mar, 2022 1 commit

Reorganize file utils (#16264) · 4975002d

Sylvain Gugger authored Mar 23, 2022

* Split file_utils in several submodules

* Fixes

* Add back more objects

* More fixes

* Who exactly decided to import that from there?

* Second suggestion to code with code review

* Revert wront move

* Fix imports

* Adapt all imports

* Adapt all imports everywhere

* Revert this import, will fix in a separate commit

4975002d

08 Mar, 2022 1 commit

Seed _get_train_sampler's generator with arg seed to improve reproducibility (#15961) · 5b7dcc73

David Hall authored Mar 08, 2022



* Seed get_train_sampler's generator with arg seed to improve reproducibility

and make the world_size<=1 code path more similar to the others

* move test file into trainer test explicitly

* dumb typo

* make style lint happy

* per discussion, switch to data_seed

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

5b7dcc73

23 Feb, 2022 1 commit

[Test refactor 1/5] Per-folder tests reorganization (#15725) · 29c10a41

Lysandre Debut authored Feb 23, 2022



* Per-folder tests reorganization
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Stas Bekman <stas@stason.org>

29c10a41