Commits · 1de7dc7403b3b89ec421d43a8c9ee245211a61f6 · chenpangpang / transformers

"tests/models/led/test_modeling_tf_led.py" did not exist on "d0b3797a3be095f74659341ed396cc8bccff96f6"

26 Jun, 2024 1 commit

amyeroberts authored Jun 26, 2024

* Skip tests properly

* [test_all]

* Add 'reason' as kwarg for skipTest

* [test_all] Fix up

* [test_all]

1de7dc74

17 Jun, 2024 1 commit
- Support multiple validation datasets when `dataloader_persistent_workers=True` (#30627) · 485fd814
  Bastien Le Chenadec authored Jun 17, 2024
```
* Support multiple validation datasets when dataloader_persistent_workers=True

* Test support of multiple validation datasets
```
  485fd814
07 Jun, 2024 1 commit

Implement JSON dump conversion for torch_dtype in TrainingArguments (#31224) · 60861fe1

조준래 authored Jun 07, 2024



* Implement JSON dump conversion for torch_dtype in TrainingArguments

* Add unit test for converting torch_dtype in TrainingArguments to JSON

* move unit test for converting torch_dtype into TrainerIntegrationTest class

* reformating using ruff

* convert dict_torch_dtype_to_str to private method _dict_torch_dtype_to_str

---------
Co-authored-by: jun.4 <jun.4@kakaobrain.com>

60861fe1

21 May, 2024 3 commits

Enforce saving at end of training if saving option chosen (#30160) · daf281f4

Zach Mueller authored May 21, 2024

* Enforce saving at end of training

* Fix test

* Rework test

* Fixup tests'

* Update comment based on sourab feedback

* Clean

daf281f4

CI: AMD MI300 tests fix (#30797) · 7a4792e6

Mohit Sharma authored May 21, 2024

* add fix

* update import

* updated dicts and comments

* remove prints

* Update testing_utils.py

7a4792e6

FEAT / Trainer: LOMO optimizer support (#30178) · 8871b261

Younes Belkada authored May 21, 2024



* add V1 - adalomo not working yet

* add todo docs + refactor from comments

* adjust LR

* add docs

* add more elaborated test

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix

* push

* add accelerate check

* fix DDP case

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix

* init kwargs

* safely add attribute

* revert to enum logic

* Update src/transformers/trainer.py

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8871b261

20 May, 2024 1 commit

Introduce configured_state arg for accelerator_config (#29781) · 92d1d97c

Zach Mueller authored May 20, 2024



* Introduce configured_state

* Include note on tuning

* Allow for users to have defined a state already

* Include tests

* Add note on hpam tune

* Guard a bit better

* Update src/transformers/training_args.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Finish rebase

* Finish rebase

* Guard carefully

* Fixup test

* Refactor

* Fin refactor

* Comment

* Update wrt feedback

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

92d1d97c

13 May, 2024 1 commit

CI: update to ROCm 6.0.2 and test MI300 (#30266) · 37bba2a3

fxmarty authored May 13, 2024



* update to ROCm 6.0.2 and test MI300

* add callers for mi300

* update dockerfile

* fix trainer tests

* remove apex

* style

* Update tests/trainer/test_trainer_seq2seq.py

* Update tests/trainer/test_trainer_seq2seq.py

* Update tests/trainer/test_trainer_seq2seq.py

* Update tests/trainer/test_trainer_seq2seq.py

* update to torch 2.3

* add workflow dispatch target

* we may need branches: mi300-ci after all

* nit

* fix docker build

* nit

* add check runner

* remove docker-gpu

* fix issues

* fix

---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

37bba2a3

06 May, 2024 2 commits

Trainer - add cache clearing and the option for batched eval metrics computation (#28769) · df475bf8

Nate Cibik authored May 06, 2024

* Added cache clearing for GPU efficiency.

* Added cache clearing for GPU efficiency.

* Added batch_eval_metrics capability

* Ran make fixup

* Fixed bug

* Fixed whitespace issue

* Fixed outdated condition

* Updated docstrings with instructions for batch_eval_metrics. Updated end of dataloader logic

* Added first version of batch_eval_metrics Trainer test

* Fixed batch_eval_metrics Trainer tests for both eval and predict

* Fixed batch_eval_metrics behavior for new Trainer variables

* Fixed batch_eval_metrics Trainer tests

* Ran fixup

df475bf8

Trainer._load_from_checkpoint - support loading multiple Peft adapters (#30505) · e0769530

Clara Pohland authored May 06, 2024



* Trainer: load checkpoint model with multiple adapters

* Trainer._load_from_checkpoint support multiple active adapters

* PeftModel.set_adapter does not support multiple adapters yet

* Trainer._load_from_checkpoint test multiple adapters

---------
Co-authored-by: Clara Luise Pohland <clara-luise.pohland@telekom.de>

e0769530

19 Apr, 2024 1 commit

Update unwrap from accelerate (#29933) · b4fd49b6

Marc Sun authored Apr 19, 2024



* Use unwrap with the one in accelerate

* oups

* update unwrap

* fix

* wording

* raise error instead

* comment

* doc

* Update src/transformers/modeling_utils.py
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* style

* put else

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

b4fd49b6

18 Apr, 2024 1 commit
- 🚨🚨🚨Deprecate `evaluation_strategy` to `eval_strategy`🚨🚨🚨 (#30190) · 60d5f8f9
  Zach Mueller authored Apr 18, 2024
```
* Alias

* Note alias

* Tests and src

* Rest

* Clean

* Change typing?

* Fix tests

* Deprecation versions
```
  60d5f8f9
16 Apr, 2024 1 commit
- Raise relevent err when wrong type is passed in as the accelerator_config (#29997) · e27d9308
  Zach Mueller authored Apr 16, 2024
```
* Raise relevent err

* Use type instead
```
  e27d9308
31 Mar, 2024 1 commit

Rework tests to compare trainer checkpoint args (#29883) · 3b8e2932

Zach Mueller authored Mar 30, 2024



* Start rework

* Fix failing test

* Include max

* Update src/transformers/trainer.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

3b8e2932

28 Mar, 2024 2 commits

Allow GradientAccumulationPlugin to be configured from AcceleratorConfig (#29589) · 4df5b9b4

Yu Chin Fabian Lim authored Mar 28, 2024



* add gradient_accumulation_kwargs to AcceleratorConfig

* add suggestions from @muellerzr to docstrings, new behavior and tests

* Documentation suggestions from @muellerz
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* addressed @muellerzr comments regarding tests and test utils

* moved accelerate version to top of file.

* @muellerzr's variable fix
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* address @amyeroberts. fix tests and docstrings

* address @amyeroberts additional suggestions

---------
Co-authored-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

4df5b9b4

add functions to inspect model and optimizer status to trainer.py (#29838) · aac7099c

Christopher Keibel authored Mar 28, 2024



* add functions to get number of params which require grad, get optimizer group for parameters and get learning rates of param groups to trainer.py

* add tests and raise ValueError when optimizer is None

* add second layer to test and freeze its weigths

* check if torch is available before running tests

* use decorator to check if torch is available
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix test indentation
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

aac7099c

26 Mar, 2024 2 commits

Add `cosine_with_min_lr` scheduler in Trainer (#29341) · ef609958
Yanyi Liu authored Mar 26, 2024
```
* Add cosine_with_min_lr scheduler

* Update error message for missing min_lr or min_lr_rate
```
ef609958

Add warnings if training args differ from checkpoint trainer state (#29255) · b5a6d6ee

Jonathan Flynn authored Mar 26, 2024



* add warnings if training args differ from checkpoint args stored in trainer_state.json

* run formatting and styling

* add a test

* format and styling

---------
Co-authored-by: Jonathan Flynn <jonl.flynn@guardian.co.uk>

b5a6d6ee

19 Mar, 2024 1 commit

FEAT / Optim: Add GaLore optimizer (#29588) · f6261d7d

Younes Belkada authored Mar 19, 2024



* add galore v1

* add import

* add tests and doc

* fix doctest

* forward contrib credits from discussions

* forward contrib credits from discussions

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix failing tests'

* switch to `optim_target_modules` and clarify docs

* more clarification

* enhance lookup logic

* update a test to add peak memory

* add regex, all-linear and single string support

* add layer-wise optimization through DummyOptimizers and LRSchedulers

* forward contrib credits from discussions and original idea

* add a section about DDP not supported in layerwise

* Update src/transformers/trainer.py
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix self

* check only if layer_wise

* Update src/transformers/training_args.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* oops

* make use of intervals

* clarify comment

* add matching tests

* GaLoRe -> GaLore

* move to `get_scheduler`

* add note on docs

* add a warning

* adapt a bit the docs

* update docstring

* support original API

* Update docs/source/en/trainer.md

* slightly refactor

* Update docs/source/en/trainer.md
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

* Update src/transformers/training_args.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix args parsing and add tests

* remove warning for regex

* fix type hint

* add note about extra args

* make `is_regex` return optional

---------

Co-authored-by: Maxime <maximegmd @users.noreply.github.com>
Co-authored-by: Wing Lian <winglian @users.noreply.github.com>
Co-authored-by: Zach Mueller <muellerzr@gmail.com>
Co-authored-by: hiyouga <hiyouga@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Matthew Douglas <38992547+matthewdouglas@users.noreply.github.com>

f6261d7d

08 Mar, 2024 1 commit

[tests] use the correct `n_gpu` in... · 3f6973db

Fanli Lin authored Mar 08, 2024

[tests] use the correct `n_gpu` in `TrainerIntegrationTest::test_train_and_eval_dataloaders` for XPU (#29307)

* fix n_gpu

* fix style

3f6973db

04 Mar, 2024 1 commit
- 🚨 Fully revert atomic checkpointing 🚨 (#29370) · 1681a6d4
  Zach Mueller authored Mar 04, 2024
```
Fully revert atomic checkpointing
```
  1681a6d4
01 Mar, 2024 1 commit

Fix deprecated arg issue (#29372) · 1a7c117d

Zach Mueller authored Mar 01, 2024

* Fix deprecated arg issue

* Trainer check too

* Check for dict or dataclass

* Simplify, make config always AcceleratorConfig

* Upstream to Trainer

1a7c117d

20 Feb, 2024 2 commits

FIX [`PEFT` / `Trainer` ] Handle better peft + quantized compiled models (#29055) · efdd4366
Younes Belkada authored Feb 20, 2024
```
* handle peft + compiled models

* add tests

* fixup

* adapt from suggestions

* clarify comment
```
efdd4366

FEAT [`Trainer` / `bnb`]: Add RMSProp from `bitsandbytes` to HF `Trainer` (#29082) · f7ef7cec

Younes Belkada authored Feb 20, 2024



* add RMSProp to Trainer

* revert some change

* Update src/transformers/trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f7ef7cec

16 Feb, 2024 2 commits

Fix trainer test wrt DeepSpeed + auto_find_bs (#29061) · 636b0324

Zach Mueller authored Feb 16, 2024



* FIx trainer test

* Update tests/trainer/test_trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

636b0324

Update all references to canonical models (#29001) · f497f564
Lysandre Debut authored Feb 16, 2024
```
* Script & Manual edition

* Update
```
f497f564

14 Feb, 2024 3 commits

FIX [`Trainer` / tags]: Fix trainer + tags when users do not pass `"tags"` to... · 7a0fccc6

Younes Belkada authored Feb 14, 2024

FIX [`Trainer` / tags]: Fix trainer + tags when users do not pass `"tags"` to `trainer.push_to_hub()` (#29009)

* fix trainer tags

* add test

7a0fccc6

Introduce AcceleratorConfig dataclass (#28664) · 0507e69d

Zach Mueller authored Feb 14, 2024



* Introduce acceleratorconfig dataclass

* Extra second warn

* Move import

* Try moving import under is_accelerate_available

* Quality

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Clean

* Remove to_kwargs

* Change version

* Improve tests by including dispatch and split batches

* Improve reliability

* Update tests/trainer/test_trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fixup tests and review nits

* Make tests pass

* protect import

* Protect import

* Empty-Commit

* Make training_args.to_dict handle the AcceleratorConfig

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0507e69d

Set the dataset format used by `test_trainer` to float32 (#28920) · 69ca640d
Huazhong Ji authored Feb 14, 2024
```
Co-authored-by: unit_test <test@unit.com>
```
69ca640d

22 Jan, 2024 1 commit

Avoid root logger's level being changed (#28638) · d336c56d

Yih-Dar authored Jan 22, 2024



* avoid root logger's level being changed

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d336c56d

10 Jan, 2024 2 commits
- Support `DeepSpeed` when using auto find batch size (#28088) · 6015d0ad
  Zach Mueller authored Jan 10, 2024
```
Fixup test
```
  6015d0ad
- Skip now failing test in the Trainer tests (#28421) · a777f525
  Zach Mueller authored Jan 10, 2024
```
* Fix test

* Skip
```
  a777f525
20 Dec, 2023 1 commit

move code to Trainer.evaluate to enable use of that function with multiple datasets (#27844) · 769a9542

peter-sk authored Dec 20, 2023



* move code to Trainer.evaluate to enable use of that function with multiple datasets

* test

* update doc string

* and a tip

* forgot the type

---------
Co-authored-by: Prof. Peter Schneider-Kamp <jps@ordbogen.com>

769a9542

11 Dec, 2023 1 commit
- Fix test for auto_find_batch_size on multi-GPU (#27947) · 44127ec6
  Zach Mueller authored Dec 11, 2023
```
* Fix test for multi-GPU

* WIth CPU handle
```
  44127ec6
08 Dec, 2023 2 commits

Allow `resume_from_checkpoint` to handle `auto_find_batch_size` (#27568) · 6757ed28

Zach Mueller authored Dec 08, 2023



* Fuffill request

* Add test

* Better test

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Better test

* Better test

* MOre comments

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6757ed28

fix: non-atomic checkpoint save (#27820) · 4c5ed1d0
Jonathon Belotti authored Dec 08, 2023

4c5ed1d0

28 Nov, 2023 1 commit
- Fixed passing scheduler-specific kwargs via TrainingArguments lr_scheduler_kwargs (#27595) · 2ca73e5e
  Charbel Abi Daher authored Nov 28, 2023
```
* Fix passing scheduler-specific kwargs through TrainingArguments `lr_scheduler_kwargs`

* Added test for lr_scheduler_kwargs
```
  2ca73e5e
21 Nov, 2023 1 commit
- dvclive callback: warn instead of fail when logging non-scalars (#27608) · 8eb9e29d
  Dave Berenbaum authored Nov 21, 2023
```
* dvclive callback: warn instead of fail when logging non-scalars

* tests: log lr as scalar
```
  8eb9e29d
16 Nov, 2023 1 commit

[`Styling`] stylify using ruff (#27144) · 651408a0

Arthur authored Nov 16, 2023



* try to stylify using ruff

* might need to remove these changes?

* use ruf format andruff check

* use isinstance instead of type comparision

* use # fmt: skip

* use # fmt: skip

* nits

* soem styling changes

* update ci job

* nits isinstance

* more files update

* nits

* more nits

* small nits

* check and format

* revert wrong changes

* actually use formatter instead of checker

* nits

* well docbuilder is overwriting this commit

* revert notebook changes

* try to nuke docbuilder

* style

* fix feature exrtaction test

* remve `indent-width = 4`

* fixup

* more nits

* update the ruff version that we use

* style

* nuke docbuilder styling

* leve the print for detected changes

* nits

* Remove file I/O
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

* style

* nits

* revert notebook changes

* Add # fmt skip when possible

* Add # fmt skip when possible

* Fix

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* NIts

* more fixes

* fix tapas

* Another way to skip

* Recommended way

* Fix two more fiels

* Remove asynch
Remove asynch

---------
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

651408a0

06 Nov, 2023 1 commit
- enable memory tracker metrics for npu (#27280) · 1ffc4dee
  Hz, Ji authored Nov 06, 2023
  
  1ffc4dee