Commits · 4fb3d3a0f6f0da95548a5e6bd02850d468c85e32 · chenpangpang / transformers

12 Jan, 2024 1 commit
- TF: purge `TFTrainer` (#28483) · 4fb3d3a0
  Joao Gante authored Jan 12, 2024
  
  4fb3d3a0
11 Jan, 2024 1 commit

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

13 Dec, 2023 1 commit
- Dev version · 3ed3e319
  Lysandre authored Dec 13, 2023
  
  3ed3e319
11 Dec, 2023 1 commit

fix no sequence length models error (#27522) · 4850aaba

Adam Louly authored Dec 11, 2023

* fix no sequence length models error

* block size check

---------

Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>

4850aaba

30 Nov, 2023 1 commit
- uses dvclive_test mode in examples/pytorch/test_accelerate_examples.py (#27763) · fe41647a
  Dave Berenbaum authored Nov 30, 2023
  
  fe41647a
27 Nov, 2023 1 commit

docs: replace torch.distributed.run by torchrun (#27528) · ce315081

Peter Pan authored Nov 28, 2023



* docs: replace torch.distributed.run by torchrun

 `transformers` now officially support pytorch >= 1.10.
 The entrypoint `torchrun`` is present from 1.10 onwards.
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

* Update src/transformers/trainer.py

with @ArthurZucker's suggestion
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ce315081

20 Nov, 2023 1 commit

[ examples] fix loading jsonl with load dataset in run translation example (#26924) · f31af392

Mathias Nielsen authored Nov 20, 2023



* Renamed variable extension to builder_name

* If builder name is jsonl change to json to align with load_datasets

* Apply suggestions from code review
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

---------
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

f31af392

17 Nov, 2023 1 commit
- Broken links fixed related to datasets docs (#27569) · ffbcfc01
  V.Prasanna kumar authored Nov 18, 2023
```
fixed the broken links belogs to dataset library of transformers
```
  ffbcfc01
15 Nov, 2023 3 commits

Incorrect setting for num_beams in translation and summarization examples (#27519) · 2e72bbab

Matt authored Nov 15, 2023



* Remove the torch main_process_first context manager from TF examples

* Correctly set num_beams=1 in our examples, and add a guard in GenerationConfig.validate()

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2e72bbab

Fixing the failure of models without max_position_embeddings attribute. (#27499) · e6522e49

Adam Louly authored Nov 15, 2023

fix max pos issue

Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>

e6522e49

Fix wav2vec2 params (#27515) · a85ea4b1
Zach Mueller authored Nov 15, 2023
```
Fix test
```
a85ea4b1

09 Nov, 2023 3 commits
- Final fix of the accelerate installation issue (#27408) · c8b6052f
  Yih-Dar authored Nov 09, 2023
```
* fix

* [test-all] commit

* fix

* [test-all] commit

* [test-all] commit

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c8b6052f
- Adds dvclive callback (#27352) · 791ec370
  Dave Berenbaum authored Nov 09, 2023
```
* dvclive trainer callback

* style fixes

* dvclive link fixes
```
  791ec370
- Change thresh in test (#27378) · e9adb0c9
  Zach Mueller authored Nov 09, 2023
```
Change thresh
```
  e9adb0c9
08 Nov, 2023 3 commits
- Remove unused param from example script tests (#27354) · 845aa832
  Zach Mueller authored Nov 08, 2023
```
Unused param
```
  845aa832
- Fix example tests from failing (#27353) · efa57cb2
  Zach Mueller authored Nov 08, 2023
```
* Fix example tests from failing

* CHange thresh
```
  efa57cb2
- moving example of benchmarking to legacy dir (#27337) · b6dbfee0
  Hz, Ji authored Nov 08, 2023
```
move example of benchmarking to legacy
```
  b6dbfee0
02 Nov, 2023 1 commit
- Dev version · bc78fd12
  Lysandre authored Nov 02, 2023
  
  bc78fd12
31 Oct, 2023 1 commit
- Unify warning styles for better readability (#27184) · 25e6e941
  Dong-geon Lee authored Nov 01, 2023
  
  25e6e941
30 Oct, 2023 1 commit
- make tests of pytorch_example device agnostic (#27081) · cd19b193
  Hz, Ji authored Oct 30, 2023
  
  cd19b193
29 Oct, 2023 1 commit
- [Typo fix] flag config in WANDB (#27130) · 722e9364
  Gema Parreño authored Oct 29, 2023
```
typo fix flag config
```
  722e9364
27 Oct, 2023 1 commit
- Provide alternative when warning on use_auth_token (#27105) · 66b088fa
  Lucain authored Oct 27, 2023
  
  66b088fa
24 Oct, 2023 1 commit

Normalize only if needed (#26049) · e2d6d5ce

Michal Jamroz authored Oct 24, 2023



* Normalize only if needed

* Update examples/pytorch/image-classification/run_image_classification.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* if else in one line

* within block

* one more place, sorry for mess

* import order

* Update examples/pytorch/image-classification/run_image_classification.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/image-classification/run_image_classification_no_trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e2d6d5ce

23 Oct, 2023 1 commit
- fix logit-to-multi-hot conversion in example (#26936) · f71c9ccf
  YQ authored Oct 23, 2023
```
* fix logit to multi-hot converstion

* add comments

* typo
```
  f71c9ccf
12 Oct, 2023 1 commit
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
11 Oct, 2023 1 commit
- Fix checkpoint path in `no_trainer` scripts (#26733) · 1d6a8474
  Zach Mueller authored Oct 11, 2023
```
checkpoint path
```
  1d6a8474
10 Oct, 2023 1 commit
- Fix source_prefix default value (#26654) · 3eceaa36
  jheitmann authored Oct 10, 2023
  
  3eceaa36
04 Oct, 2023 1 commit

refactor: change default block_size (#26229) · 6015f91a

Phuc Van Phan authored Oct 04, 2023

* refactor: change default block_size

* fix: return tf to origin

* fix: change files to origin

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* refactor: add min block_size to files

* reformat: add min block_size for run_clm tf

6015f91a

03 Oct, 2023 1 commit
- v4.35.0.dev0 · bd620591
  Lysandre authored Oct 03, 2023
  
  bd620591
28 Sep, 2023 1 commit

docs: change assert to raise and some small docs (#26232) · ba47efbf

Phuc Van Phan authored Sep 28, 2023

* docs: change assert to raise and some small docs

* docs: add rule and some document

* fix: fix bug

* fix: fix bug

* chorse: revert logging

* chorse: revert

ba47efbf

12 Sep, 2023 1 commit
- chore: correct update_step and correct gradient_accumulation_steps (#26068) · 4fb64e28
  Phuc Van Phan authored Sep 13, 2023
  
  4fb64e28
11 Sep, 2023 2 commits
- docs: add space to docs (#26067) · 5af2c626
  Phuc Van Phan authored Sep 12, 2023
```
* docs: add space to docs

* docs: remove reduntant space
```
  5af2c626
- docs: update link huggingface map (#26077) · 9cebae64
  Phuc Van Phan authored Sep 11, 2023
  
  9cebae64
05 Sep, 2023 2 commits
- Trainer: delegate default generation values to `generation_config` (#25987) · 9a70d6e5
  Joao Gante authored Sep 05, 2023
  
  9a70d6e5
- Fix typo (#25966) · 404ff8fc
  Susnato Dhar authored Sep 05, 2023
```
* Update feature_extraction_clap.py

* changed all lenght to length
```
  404ff8fc
04 Sep, 2023 1 commit
- v4.34.dev.0 · d8e13b3e
  Lysandre authored Sep 04, 2023
  
  d8e13b3e
01 Sep, 2023 1 commit
- Revert frozen training arguments (#25903) · be0e189b
  Zach Mueller authored Sep 01, 2023
```
* Revert frozen training arguments

* TODO
```
  be0e189b
23 Aug, 2023 1 commit
- correct resume training steps number in progress bar (#25691) · 656e17f6
  Phuc Van Phan authored Aug 24, 2023
```
feat: correct update resume update with steps
```
  656e17f6
21 Aug, 2023 1 commit
- v4.33.0.dev0 · 5c67682b
  Sylvain Gugger authored Aug 21, 2023
  
  5c67682b
15 Aug, 2023 1 commit

Make training args fully immutable (#25435) · ca514992

Zach Mueller authored Aug 15, 2023

* Make training args fully immutable

* Working tests, PyTorch

* In test_trainer

* during testing

* Use proper dataclass way

* Fix test

* Another one

* Fix tf

* Lingering slow

* Exception

* Clean

ca514992