Commits · 651408a077f842e76e75bfc7d02b8ac38eeb6480 · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "a5e5c92aea1e99cb84d7342bd63826ca6cd884c4"

16 Nov, 2023 1 commit

[`Styling`] stylify using ruff (#27144) · 651408a0

Arthur authored Nov 16, 2023



* try to stylify using ruff

* might need to remove these changes?

* use ruf format andruff check

* use isinstance instead of type comparision

* use # fmt: skip

* use # fmt: skip

* nits

* soem styling changes

* update ci job

* nits isinstance

* more files update

* nits

* more nits

* small nits

* check and format

* revert wrong changes

* actually use formatter instead of checker

* nits

* well docbuilder is overwriting this commit

* revert notebook changes

* try to nuke docbuilder

* style

* fix feature exrtaction test

* remve `indent-width = 4`

* fixup

* more nits

* update the ruff version that we use

* style

* nuke docbuilder styling

* leve the print for detected changes

* nits

* Remove file I/O
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

* style

* nits

* revert notebook changes

* Add # fmt skip when possible

* Add # fmt skip when possible

* Fix

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* NIts

* more fixes

* fix tapas

* Another way to skip

* Recommended way

* Fix two more fiels

* Remove asynch
Remove asynch

---------
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

651408a0

14 Nov, 2023 1 commit

Have seq2seq just use gather (#27025) · 067c4a31

Zach Mueller authored Nov 14, 2023



* Have seq2seq just use gather

* Change

* Reset after

* Make slow

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Clean

* Simplify and just use gather

* Update tests/trainer/test_trainer_seq2seq.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* gather always for seq2seq

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

067c4a31

06 Nov, 2023 1 commit
- enable memory tracker metrics for npu (#27280) · 1ffc4dee
  Hz, Ji authored Nov 06, 2023
  
  1ffc4dee
31 Oct, 2023 2 commits

Safetensors serialization by default (#27064) · 113ebf80

Lysandre Debut authored Oct 31, 2023



* Safetensors serialization by default

* First pass on the tests

* Second pass on the tests

* Third pass on the tests

* Fix TF weight loading from TF-format safetensors

* Specific encoder-decoder fixes for weight crossloading

* Add VisionEncoderDecoder fixes for TF too

* Change filename test for pt-to-tf

* One missing fix for TFVisionEncoderDecoder

* Fix the other crossload test

* Support for flax + updated tests

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Sanchit's comments

* Sanchit's comments 2

* Nico's comments

* Fix tests

* cleanup

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

113ebf80

[FEAT] Add Neftune into transformers Trainer (#27141) · 309a9066

Younes Belkada authored Oct 31, 2023



* add v1 neftune

* use `unwrap_model` instead

* add test + docs

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* more details

* fixup

* Update docs/source/en/main_classes/trainer.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor a bit

* more elaborated test

* fix unwrap issue

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

309a9066

30 Oct, 2023 2 commits
- Device agnostic trainer testing (#27131) · 5bbf6712
  Hz, Ji authored Oct 31, 2023
  
  5bbf6712
- [`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments (#27068) · 5fbed2d7
  Younes Belkada authored Oct 30, 2023
```
* add `gradient_checkpointing_kwargs` in trainer and training arguments

* add comment

* add test - currently failing

* now tests pass
```
  5fbed2d7
26 Oct, 2023 1 commit

Save TB logs as part of push_to_hub (#27022) · 34a64064

Zach Mueller authored Oct 26, 2023

* Support runs/

* Upload runs folder as part of push to hub

* Add a test

* Add to test deps

* Update with proposed solution from Slack

* Ensure that repo gets deleted in tests

34a64064

12 Sep, 2023 1 commit

enable optuna multi-objectives feature (#25969) · 8f609ab9

Wang, Yi authored Sep 13, 2023



* enable optuna multi-objectives feature
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update hpo doc

* update docstring
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* extend direction to List[str] type
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Update src/transformers/integrations/integration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8f609ab9

05 Sep, 2023 1 commit

Patch with accelerate xpu (#25714) · 70a98024

Abhilash Majumder authored Sep 05, 2023

* patch with accelerate xpu

* patch with accelerate xpu

* formatting

* fix tests

* revert ruff unrelated fixes

* revert ruff unrelated fixes

* revert ruff unrelated fixes

* fix test

* review fixes

* review fixes

* black fixed

* review commits

* review commits

* style fix

* use pytorch_utils

* revert markuplm test

70a98024

01 Sep, 2023 1 commit
- Revert frozen training arguments (#25903) · be0e189b
  Zach Mueller authored Sep 01, 2023
```
* Revert frozen training arguments

* TODO
```
  be0e189b
15 Aug, 2023 1 commit

Make training args fully immutable (#25435) · ca514992

Zach Mueller authored Aug 15, 2023

* Make training args fully immutable

* Working tests, PyTorch

* In test_trainer

* during testing

* Use proper dataclass way

* Fix test

* Another one

* Fix tf

* Lingering slow

* Exception

* Clean

ca514992

07 Aug, 2023 1 commit

Migrate Trainer from `Repository` to `upload_folder` (#25095) · baf1daa5

Sylvain Gugger authored Aug 07, 2023



* First draft

* Deal with progress bars

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Address review comments

* Forgot one

* Pin hf_hub

* Add argument for push all and fix tests

* Fix tests

* Address review comments

---------
Co-authored-by: Lucain <lucainp@gmail.com>

baf1daa5

24 Jul, 2023 1 commit
- Add dispatch_batches to training arguments (#25038) · 3b734f50
  Zach Mueller authored Jul 24, 2023
```
* Dispatch batches

* Copy items
```
  3b734f50
18 Jul, 2023 1 commit
- add ascend npu accelerator support (#24879) · 9c875839
  statelesshz authored Jul 18, 2023
```
* Add Ascend NPU accelerator support

* fix style warining
```
  9c875839
12 Jul, 2023 1 commit

Fix pad across processes dim in trainer and not being able to set the timeout (#24775) · 02842855

Zach Mueller authored Jul 12, 2023



* dim, and rm copy

* Don't rm copy for now

* Oops

* pad index

* Should be a working test

* Tickle down ddp timeout

* Put fix back in now that testing locally is done

* Better comment specifying timeout
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

02842855

27 Jun, 2023 1 commit

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15

22 Jun, 2023 1 commit

Refactor hyperparameter search backends (#24384) · b6295b26

Alex Hall authored Jun 22, 2023

* Refactor hyperparameter search backends

* Simpler refactoring without abstract base class

* black

* review comments:
specify name in class
use methods instead of callable class attributes
name constant better

* review comments: safer bool checking, log multiple available backends

* test ALL_HYPERPARAMETER_SEARCH_BACKENDS vs HPSearchBackend in unit test, not module. format with black.

* copyright

b6295b26

12 Jun, 2023 1 commit
- 🚨🚨🚨 Replace DataLoader logic for Accelerate in Trainer, remove unneeded tests 🚨🚨🚨 (#24028) · ebd94b0f
  Zach Mueller authored Jun 12, 2023
```
* Working integration

* Fix failing test

* Revert label host logic

* Bring it back!
```
  ebd94b0f
24 May, 2023 1 commit

Paged Optimizer + Lion Optimizer for Trainer (#23217) · 796162c5

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

796162c5

28 Apr, 2023 1 commit

Add Trainer support for ReduceLROnPlateau (#23010) · 9b435204

Maxime Méloux authored Apr 28, 2023



* Add Trainer support for ReduceLROnPlateau

Fixes #16503

* Remove training argument and add default instance

---------
Co-authored-by: mmeloux <maxime.meloux@loria.fr>

9b435204

17 Apr, 2023 1 commit

Introduce `PartialState` as the device handler in the `Trainer` (#22752) · 03462875

Zachary Mueller authored Apr 17, 2023



* Use accelerate for device management

* Add accelerate to setup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

03462875

12 Apr, 2023 1 commit
- [tests] switch to torchrun (#22712) · 1306b7d3
  Stas Bekman authored Apr 12, 2023
  
  1306b7d3
04 Apr, 2023 1 commit

Implemented safetensors checkpoints save/load for Trainer (#22498) · 871598be

Viktor Scherbakov authored Apr 04, 2023



* implemented safetensors save/load

* remove duplicated file

* added tests

* more tests

* style fix

* fix tf tests

* change to list comprehension
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* review fixes + safe load for sharded checkpoint

* style fix

* remove rogue import

* remove partial to avoid undefined exception

* use naming alias instead of safetensors.torch

* fix safe sharding in tests

* grammar
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* update docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* update docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* minor corrections

* style

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

871598be

16 Mar, 2023 1 commit

(#22204) · 5110e574

Yih-Dar authored Mar 16, 2023



* py38 + torch 2

* increment cache versions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5110e574

10 Mar, 2023 1 commit
- handle numpy inputs in whole word mask data collator (#22032) · 2f4cdd97
  Dean Wyatte authored Mar 10, 2023
  
  2f4cdd97
09 Mar, 2023 1 commit

Remove set_access_token usage + fail tests if FutureWarning (#22051) · 923110b7

Lucain authored Mar 09, 2023



* Remove set_access_token usage + fail tests if FutureWarning

* do not fail on FutureWarning in CI

---------
Co-authored-by: testbot <lucainp@hf.co>

923110b7

28 Feb, 2023 1 commit
- Fix flaky test for log level (#21776) · b29e2dca
  Sylvain Gugger authored Feb 28, 2023
```
* Fix flaky test for log level

* Fix other flaky test
```
  b29e2dca
23 Feb, 2023 1 commit
- Skip test_log_level for now · aa3787c8
  ydshieh authored Feb 23, 2023
  
  aa3787c8
22 Feb, 2023 2 commits
- Respect documentation on passive log level (#21700) · b19d64d8
  Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
  b19d64d8
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
07 Feb, 2023 1 commit
- Fix epoch number when resuming training (#21478) · cc840752
  Sylvain Gugger authored Feb 06, 2023
  
  cc840752
06 Feb, 2023 1 commit

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

18 Jan, 2023 2 commits

Add AWS Neuron torchrun support (#20806) · c59d71b2

jeffhataws authored Jan 18, 2023

* Add XLA torchrun support

* Clarify that currently DDP doesn't work with torch.distributed XLA backend yet

* Enable DDP with torchrun and XLA (now available in PT-XLA 1.13)

* Add check for AWS Neuron availability and AWS Neuron specific compiler flag

* Change the new test's name to TestTrainerDistributedNeuronCore

* Remove "assert" and replace raised exception

* Remove compiler flag as it is optional. If needed, will be another PR.

* Use TORCHELASTIC_RUN_ID to determine whether torchrun is used

c59d71b2

Adapt repository creation to latest hf_hub (#21158) · 05e72aa0

Sylvain Gugger authored Jan 18, 2023

* Adapt repository creation to latest hf_hub

* Update all examples

* Fix other tests, add Flax examples

* Address review comments

05e72aa0

12 Jan, 2023 1 commit

Fix past CI (#20967) · b3a0aad3

Yih-Dar authored Jan 12, 2023



* Fix for Past CI

* make style

* clean up

* unindent 2 blocks
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b3a0aad3

20 Dec, 2022 1 commit
- fix typo output not ouput in bitsandbytes trainer test (#20839) · 7ef3f19c
  Thomas-MMJ authored Dec 19, 2022
```
fix typo output not ouput

typo was causing an error on pytest collection
```
  7ef3f19c
30 Nov, 2022 1 commit
- Repurpose torchdynamo training args towards torch._dynamo (#20498) · 08b46218
  Sylvain Gugger authored Nov 30, 2022
```
* Repurpose torchdynamo training args towards torch._dynamo

* Add doc
```
  08b46218
25 Nov, 2022 1 commit
- [AnyPrecisionAdamW] test fix (#20454) · a547d5bd
  Stas Bekman authored Nov 25, 2022
  
  a547d5bd
18 Nov, 2022 1 commit

Add AnyPrecisionAdamW optimizer (#18961) · 84c9cc6d

atturaioe authored Nov 18, 2022

* Add AnyPrecisionAdamW optimizer

* Add optim_args argument to TrainingArgs

* Add tests for AnyPrecisionOptimizer

* Change AnyPrecisionAdam default params to float32

* Move default_anyprecision_kwargs in trainer test

* Rename AnyPrecisionAdamW

84c9cc6d