Commits · 7252e8d9374b3088215c94b9f82904e22010fac0 · chenpangpang / transformers

22 Jan, 2024 1 commit

Avoid root logger's level being changed (#28638) · d336c56d

Yih-Dar authored Jan 22, 2024



* avoid root logger's level being changed

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d336c56d

10 Jan, 2024 2 commits
- Support `DeepSpeed` when using auto find batch size (#28088) · 6015d0ad
  Zach Mueller authored Jan 10, 2024
```
Fixup test
```
  6015d0ad
- Skip now failing test in the Trainer tests (#28421) · a777f525
  Zach Mueller authored Jan 10, 2024
```
* Fix test

* Skip
```
  a777f525
20 Dec, 2023 1 commit

move code to Trainer.evaluate to enable use of that function with multiple datasets (#27844) · 769a9542

peter-sk authored Dec 20, 2023



* move code to Trainer.evaluate to enable use of that function with multiple datasets

* test

* update doc string

* and a tip

* forgot the type

---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>

769a9542

13 Dec, 2023 1 commit

Fix bug with rotating checkpoints (#28009) · 93766251

Zach Mueller authored Dec 13, 2023

* Fix bug

* Write test

* Keep back old modification for grad accum steps

* Whitespace...

* Whitespace again

* Race condition

* Wait for everyone

93766251

11 Dec, 2023 1 commit
- Fix test for auto_find_batch_size on multi-GPU (#27947) · 44127ec6
  Zach Mueller authored Dec 11, 2023
```
* Fix test for multi-GPU

* WIth CPU handle
```
  44127ec6
08 Dec, 2023 2 commits

Allow `resume_from_checkpoint` to handle `auto_find_batch_size` (#27568) · 6757ed28

Zach Mueller authored Dec 08, 2023



* Fuffill request

* Add test

* Better test

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Better test

* Better test

* MOre comments

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6757ed28

fix: non-atomic checkpoint save (#27820) · 4c5ed1d0
Jonathon Belotti authored Dec 08, 2023

4c5ed1d0

28 Nov, 2023 1 commit
- Fixed passing scheduler-specific kwargs via TrainingArguments lr_scheduler_kwargs (#27595) · 2ca73e5e
  Charbel Abi Daher authored Nov 28, 2023
```
* Fix passing scheduler-specific kwargs through TrainingArguments `lr_scheduler_kwargs`

* Added test for lr_scheduler_kwargs
```
  2ca73e5e
21 Nov, 2023 1 commit
- dvclive callback: warn instead of fail when logging non-scalars (#27608) · 8eb9e29d
  Dave Berenbaum authored Nov 21, 2023
```
* dvclive callback: warn instead of fail when logging non-scalars

* tests: log lr as scalar
```
  8eb9e29d
16 Nov, 2023 1 commit

[`Styling`] stylify using ruff (#27144) · 651408a0

Arthur authored Nov 16, 2023



* try to stylify using ruff

* might need to remove these changes?

* use ruf format andruff check

* use isinstance instead of type comparision

* use # fmt: skip

* use # fmt: skip

* nits

* soem styling changes

* update ci job

* nits isinstance

* more files update

* nits

* more nits

* small nits

* check and format

* revert wrong changes

* actually use formatter instead of checker

* nits

* well docbuilder is overwriting this commit

* revert notebook changes

* try to nuke docbuilder

* style

* fix feature exrtaction test

* remve `indent-width = 4`

* fixup

* more nits

* update the ruff version that we use

* style

* nuke docbuilder styling

* leve the print for detected changes

* nits

* Remove file I/O
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

* style

* nits

* revert notebook changes

* Add # fmt skip when possible

* Add # fmt skip when possible

* Fix

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* NIts

* more fixes

* fix tapas

* Another way to skip

* Recommended way

* Fix two more fiels

* Remove asynch
Remove asynch

---------
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

651408a0

14 Nov, 2023 1 commit

Have seq2seq just use gather (#27025) · 067c4a31

Zach Mueller authored Nov 14, 2023



* Have seq2seq just use gather

* Change

* Reset after

* Make slow

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Clean

* Simplify and just use gather

* Update tests/trainer/test_trainer_seq2seq.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* gather always for seq2seq

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

067c4a31

06 Nov, 2023 1 commit
- enable memory tracker metrics for npu (#27280) · 1ffc4dee
  Hz, Ji authored Nov 06, 2023
  
  1ffc4dee
31 Oct, 2023 2 commits

Safetensors serialization by default (#27064) · 113ebf80

Lysandre Debut authored Oct 31, 2023



* Safetensors serialization by default

* First pass on the tests

* Second pass on the tests

* Third pass on the tests

* Fix TF weight loading from TF-format safetensors

* Specific encoder-decoder fixes for weight crossloading

* Add VisionEncoderDecoder fixes for TF too

* Change filename test for pt-to-tf

* One missing fix for TFVisionEncoderDecoder

* Fix the other crossload test

* Support for flax + updated tests

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Sanchit's comments

* Sanchit's comments 2

* Nico's comments

* Fix tests

* cleanup

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

113ebf80

[FEAT] Add Neftune into transformers Trainer (#27141) · 309a9066

Younes Belkada authored Oct 31, 2023



* add v1 neftune

* use `unwrap_model` instead

* add test + docs

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* more details

* fixup

* Update docs/source/en/main_classes/trainer.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor a bit

* more elaborated test

* fix unwrap issue

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

309a9066

30 Oct, 2023 2 commits
- Device agnostic trainer testing (#27131) · 5bbf6712
  Hz, Ji authored Oct 31, 2023
  
  5bbf6712
- [`Trainer` / `GC`] Add `gradient_checkpointing_kwargs` in trainer and training arguments (#27068) · 5fbed2d7
  Younes Belkada authored Oct 30, 2023
```
* add `gradient_checkpointing_kwargs` in trainer and training arguments

* add comment

* add test - currently failing

* now tests pass
```
  5fbed2d7
26 Oct, 2023 1 commit

Save TB logs as part of push_to_hub (#27022) · 34a64064

Zach Mueller authored Oct 26, 2023

* Support runs/

* Upload runs folder as part of push to hub

* Add a test

* Add to test deps

* Update with proposed solution from Slack

* Ensure that repo gets deleted in tests

34a64064

12 Sep, 2023 1 commit

enable optuna multi-objectives feature (#25969) · 8f609ab9

Wang, Yi authored Sep 13, 2023



* enable optuna multi-objectives feature
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update hpo doc

* update docstring
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* extend direction to List[str] type
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* Update src/transformers/integrations/integration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8f609ab9

05 Sep, 2023 1 commit

Patch with accelerate xpu (#25714) · 70a98024

Abhilash Majumder authored Sep 05, 2023

* patch with accelerate xpu

* patch with accelerate xpu

* formatting

* fix tests

* revert ruff unrelated fixes

* revert ruff unrelated fixes

* revert ruff unrelated fixes

* fix test

* review fixes

* review fixes

* black fixed

* review commits

* review commits

* style fix

* use pytorch_utils

* revert markuplm test

70a98024

01 Sep, 2023 1 commit
- Revert frozen training arguments (#25903) · be0e189b
  Zach Mueller authored Sep 01, 2023
```
* Revert frozen training arguments

* TODO
```
  be0e189b
15 Aug, 2023 1 commit

Make training args fully immutable (#25435) · ca514992

Zach Mueller authored Aug 15, 2023

* Make training args fully immutable

* Working tests, PyTorch

* In test_trainer

* during testing

* Use proper dataclass way

* Fix test

* Another one

* Fix tf

* Lingering slow

* Exception

* Clean

ca514992

07 Aug, 2023 1 commit

Migrate Trainer from `Repository` to `upload_folder` (#25095) · baf1daa5

Sylvain Gugger authored Aug 07, 2023



* First draft

* Deal with progress bars

* Update src/transformers/utils/hub.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Address review comments

* Forgot one

* Pin hf_hub

* Add argument for push all and fix tests

* Fix tests

* Address review comments

---------
Co-authored-by: Lucain <lucainp@gmail.com>

baf1daa5

24 Jul, 2023 1 commit
- Add dispatch_batches to training arguments (#25038) · 3b734f50
  Zach Mueller authored Jul 24, 2023
```
* Dispatch batches

* Copy items
```
  3b734f50
18 Jul, 2023 1 commit
- add ascend npu accelerator support (#24879) · 9c875839
  statelesshz authored Jul 18, 2023
```
* Add Ascend NPU accelerator support

* fix style warining
```
  9c875839
12 Jul, 2023 1 commit

Fix pad across processes dim in trainer and not being able to set the timeout (#24775) · 02842855

Zach Mueller authored Jul 12, 2023



* dim, and rm copy

* Don't rm copy for now

* Oops

* pad index

* Should be a working test

* Tickle down ddp timeout

* Put fix back in now that testing locally is done

* Better comment specifying timeout
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

02842855

27 Jun, 2023 1 commit

Fix TypeError: Object of type int64 is not JSON serializable (#24340) · 239ace15

Xiaoli Wang authored Jun 27, 2023

* Fix TypeError: Object of type int64 is not JSON serializable

* Convert numpy.float64 and numpy.int64 to float and int for json serialization

* Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py

* * make style

239ace15

22 Jun, 2023 1 commit

Refactor hyperparameter search backends (#24384) · b6295b26

Alex Hall authored Jun 22, 2023

* Refactor hyperparameter search backends

* Simpler refactoring without abstract base class

* black

* review comments:
specify name in class
use methods instead of callable class attributes
name constant better

* review comments: safer bool checking, log multiple available backends

* test ALL_HYPERPARAMETER_SEARCH_BACKENDS vs HPSearchBackend in unit test, not module. format with black.

* copyright

b6295b26

12 Jun, 2023 1 commit
- 🚨🚨🚨 Replace DataLoader logic for Accelerate in Trainer, remove unneeded tests 🚨🚨🚨 (#24028) · ebd94b0f
  Zach Mueller authored Jun 12, 2023
```
* Working integration

* Fix failing test

* Revert label host logic

* Bring it back!
```
  ebd94b0f
24 May, 2023 1 commit

Paged Optimizer + Lion Optimizer for Trainer (#23217) · 796162c5

Tim Dettmers authored May 24, 2023



* Added lion and paged optimizers and made original tests pass.

* Added tests for paged and lion optimizers.

* Added and fixed optimizer tests.

* Style and quality checks.

---------
Co-authored-by: younesbelkada <younesbelkada@gmail.com>

796162c5

28 Apr, 2023 1 commit

Add Trainer support for ReduceLROnPlateau (#23010) · 9b435204

Maxime Méloux authored Apr 28, 2023



* Add Trainer support for ReduceLROnPlateau

Fixes #16503

* Remove training argument and add default instance

---------
Co-authored-by: mmeloux <maxime.meloux@loria.fr>

9b435204

17 Apr, 2023 1 commit

Introduce `PartialState` as the device handler in the `Trainer` (#22752) · 03462875

Zachary Mueller authored Apr 17, 2023



* Use accelerate for device management

* Add accelerate to setup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

03462875

12 Apr, 2023 1 commit
- [tests] switch to torchrun (#22712) · 1306b7d3
  Stas Bekman authored Apr 12, 2023
  
  1306b7d3
04 Apr, 2023 1 commit

Implemented safetensors checkpoints save/load for Trainer (#22498) · 871598be

Viktor Scherbakov authored Apr 04, 2023



* implemented safetensors save/load

* remove duplicated file

* added tests

* more tests

* style fix

* fix tf tests

* change to list comprehension
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* review fixes + safe load for sharded checkpoint

* style fix

* remove rogue import

* remove partial to avoid undefined exception

* use naming alias instead of safetensors.torch

* fix safe sharding in tests

* grammar
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* update docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* update docs
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* minor corrections

* style

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

871598be

16 Mar, 2023 1 commit

(#22204) · 5110e574

Yih-Dar authored Mar 16, 2023



* py38 + torch 2

* increment cache versions

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5110e574

10 Mar, 2023 1 commit
- handle numpy inputs in whole word mask data collator (#22032) · 2f4cdd97
  Dean Wyatte authored Mar 10, 2023
  
  2f4cdd97
09 Mar, 2023 1 commit

Remove set_access_token usage + fail tests if FutureWarning (#22051) · 923110b7

Lucain authored Mar 09, 2023



* Remove set_access_token usage + fail tests if FutureWarning

* do not fail on FutureWarning in CI

---------
Co-authored-by: testbot <lucainp@hf.co>

923110b7

28 Feb, 2023 1 commit
- Fix flaky test for log level (#21776) · b29e2dca
  Sylvain Gugger authored Feb 28, 2023
```
* Fix flaky test for log level

* Fix other flaky test
```
  b29e2dca
23 Feb, 2023 1 commit
- Skip test_log_level for now · aa3787c8
  ydshieh authored Feb 23, 2023
  
  aa3787c8
22 Feb, 2023 1 commit
- Respect documentation on passive log level (#21700) · b19d64d8
  Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
  b19d64d8