Commits · 2418c64a1cfe317bb8d238d3670693799276d16d · chenpangpang / transformers

01 Feb, 2024 1 commit
- [docs] fix some bugs about parameter description (#28806) · d98591a1
  zspo authored Feb 02, 2024
```
Co-authored-by: p_spozzhang <p_spozzhang@tencent.com>
```
  d98591a1
30 Jan, 2024 1 commit

Matt authored Jan 30, 2024



* Pin torch to <2.2.0

* Pin torchvision and torchaudio as well

* Playing around with versions to see if this helps

* twiddle something to restart the CI

* twiddle it back

* Try changing the natten version

* make fixup

* Revert "Try changing the natten version"

This reverts commit de0d6592c35dc39ae8b5a616c27285db28262d06.

* make fixup

* fix fix fix

* fix fix fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

74c9cfea

29 Jan, 2024 1 commit
- Fix input data file extension in examples (#28741) · 39fa4009
  Klaus Hipp authored Jan 29, 2024
  
  39fa4009
26 Jan, 2024 1 commit
- [docs] Fix datasets in guides (#28715) · abe0289e
  Steven Liu authored Jan 26, 2024
```
* change datasets

* fix
```
  abe0289e
22 Jan, 2024 2 commits
- Fix lr_scheduler in no_trainer training scripts (#27872) · deb2b590
  bofeng huang authored Jan 22, 2024
```
* Fix lr_scheduler

* Fix lr scheduler
```
  deb2b590
- Fix id2label assignment in run_classification.py (#28590) · f0acf7b6
  jheitmann authored Jan 22, 2024
  
  f0acf7b6
19 Jan, 2024 1 commit
- v4.38.dev.0 · b2748a6e
  Amy Roberts authored Jan 19, 2024
  
  b2748a6e
18 Jan, 2024 2 commits

Making CTC training example more general (#28582) · 772307be

Yoach Lacombe authored Jan 18, 2024



* add w2v2bert compatibility

* Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

772307be

Add new meta w2v2-conformer BERT-like model (#28165) · d2cdefb9

Yoach Lacombe authored Jan 18, 2024



* first commit

* correct default value non causal

* update config and modeling code

* update converting checkpoint

* clean modeling and fix tests

* make style

* add new config parameters to docstring

* fix copied from statements

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* make position_embeddings_type docstrings clearer

* clean converting script

* remove function not used

* clean modeling file

* apply suggestion for test file + add convert script to not_doctested

* modify tests according to review - cleaner logic and more tests

* Apply nit suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add checker of valid position embeddings type

* instantiate new layer norm layer with the right eps

* fix freeze_feature_encoder since it can be None in some cases

* add test same output in convert script

* restore wav2vec2conformer and add new model

* create processor and FE + clean

* add new model code

* fix convert script and set default config parameters

* correct model id paths

* make style

* make fix-copies and cleaning files

* fix copied from statements

* complete .md and fixe copies

* clean convert script argument defaults

* fix config parameters docstrings

* fix config docstring

* add copied from and enrich FE tests

* fix copied from and repo-consistency

* add autotokenizer

* make test input length shorter and change docstring code

* fix docstrings and copied from

* add add_adapter to ASR training example

* make testing of adapters more robust

* adapt to multi adapter layers

* refactor input_values->input_features and remove w2v2-bert feature extractor

* remove pretraining model

* remove depreciated features and useless lines

* add copied from and ignore statements to modeling tests

* remove pretraining model #2

* change import in convert script

* change default in convert script

* update readme and remove useless line

* Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor BERT to Bert for consistency

* remove useless ignore copy statement

* add persistent to buffer in rotary

* add eps in LayerNorm init and remove copied from

* add adapter activation parameters and add copied from statements

* Fix copied statements and add unitest.skip reasons

* add copied statement in test_processor

* refactor processor

* make style

* replace numpy random by torch rand

* remove expected output CTC

* improve converting script with processor class

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove gumbel class

* remove tests related to previously deleted class

* Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* correct typos

* remove uused parameters

* update processor to takes both text and audio

* update checkpoints

* update expected output and add ctc expected output

* add label_attention_mask

* replace pt with np in processor tests

* fix typo

* revert to behaviour with labels_attention_mask

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

d2cdefb9

16 Jan, 2024 1 commit

Remove `task` arg in `load_dataset` in image-classification example (#28408) · 0cdcd7a2

regisss authored Jan 16, 2024

* Remove `task` arg in `load_dataset` in image-classification example

* Manage case where "train" is not in dataset

* Add new args to manage image and label column names

* Similar to audio-classification example

* Fix README

* Update tests

0cdcd7a2

15 Jan, 2024 2 commits
- improve dev setup comments and hints (#28495) · ff86bc36
  Timothy Cronin authored Jan 15, 2024
```
* improve dev setup comments and hints

* fix tests for new dev setup hints
```
  ff86bc36
- Don't set `finetuned_from` if it is a local path (#28482) · 64bdbd88
  Yih-Dar authored Jan 15, 2024
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  64bdbd88
12 Jan, 2024 1 commit
- TF: purge `TFTrainer` (#28483) · 4fb3d3a0
  Joao Gante authored Jan 12, 2024
  
  4fb3d3a0
11 Jan, 2024 1 commit

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

13 Dec, 2023 1 commit
- Dev version · 3ed3e319
  Lysandre authored Dec 13, 2023
  
  3ed3e319
11 Dec, 2023 1 commit

fix no sequence length models error (#27522) · 4850aaba

Adam Louly authored Dec 11, 2023

* fix no sequence length models error

* block size check

---------

Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>

4850aaba

30 Nov, 2023 1 commit
- uses dvclive_test mode in examples/pytorch/test_accelerate_examples.py (#27763) · fe41647a
  Dave Berenbaum authored Nov 30, 2023
  
  fe41647a
27 Nov, 2023 1 commit

docs: replace torch.distributed.run by torchrun (#27528) · ce315081

Peter Pan authored Nov 28, 2023



* docs: replace torch.distributed.run by torchrun

 `transformers` now officially support pytorch >= 1.10.
 The entrypoint `torchrun`` is present from 1.10 onwards.
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

* Update src/transformers/trainer.py

with @ArthurZucker's suggestion
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ce315081

20 Nov, 2023 1 commit

[ examples] fix loading jsonl with load dataset in run translation example (#26924) · f31af392

Mathias Nielsen authored Nov 20, 2023



* Renamed variable extension to builder_name

* If builder name is jsonl change to json to align with load_datasets

* Apply suggestions from code review
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

---------
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

f31af392

17 Nov, 2023 1 commit
- Broken links fixed related to datasets docs (#27569) · ffbcfc01
  V.Prasanna kumar authored Nov 18, 2023
```
fixed the broken links belogs to dataset library of transformers
```
  ffbcfc01
15 Nov, 2023 3 commits

Incorrect setting for num_beams in translation and summarization examples (#27519) · 2e72bbab

Matt authored Nov 15, 2023



* Remove the torch main_process_first context manager from TF examples

* Correctly set num_beams=1 in our examples, and add a guard in GenerationConfig.validate()

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2e72bbab

Fixing the failure of models without max_position_embeddings attribute. (#27499) · e6522e49

Adam Louly authored Nov 15, 2023

fix max pos issue

Co-authored-by: Adam Louly <adamlouly@microsoft.com@orttrainingdev9.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>

e6522e49

Fix wav2vec2 params (#27515) · a85ea4b1
Zach Mueller authored Nov 15, 2023
```
Fix test
```
a85ea4b1

09 Nov, 2023 3 commits
- Final fix of the accelerate installation issue (#27408) · c8b6052f
  Yih-Dar authored Nov 09, 2023
```
* fix

* [test-all] commit

* fix

* [test-all] commit

* [test-all] commit

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c8b6052f
- Adds dvclive callback (#27352) · 791ec370
  Dave Berenbaum authored Nov 09, 2023
```
* dvclive trainer callback

* style fixes

* dvclive link fixes
```
  791ec370
- Change thresh in test (#27378) · e9adb0c9
  Zach Mueller authored Nov 09, 2023
```
Change thresh
```
  e9adb0c9
08 Nov, 2023 3 commits
- Remove unused param from example script tests (#27354) · 845aa832
  Zach Mueller authored Nov 08, 2023
```
Unused param
```
  845aa832
- Fix example tests from failing (#27353) · efa57cb2
  Zach Mueller authored Nov 08, 2023
```
* Fix example tests from failing

* CHange thresh
```
  efa57cb2
- moving example of benchmarking to legacy dir (#27337) · b6dbfee0
  Hz, Ji authored Nov 08, 2023
```
move example of benchmarking to legacy
```
  b6dbfee0
02 Nov, 2023 1 commit
- Dev version · bc78fd12
  Lysandre authored Nov 02, 2023
  
  bc78fd12
31 Oct, 2023 1 commit
- Unify warning styles for better readability (#27184) · 25e6e941
  Dong-geon Lee authored Nov 01, 2023
  
  25e6e941
30 Oct, 2023 1 commit
- make tests of pytorch_example device agnostic (#27081) · cd19b193
  Hz, Ji authored Oct 30, 2023
  
  cd19b193
29 Oct, 2023 1 commit
- [Typo fix] flag config in WANDB (#27130) · 722e9364
  Gema Parreño authored Oct 29, 2023
```
typo fix flag config
```
  722e9364
27 Oct, 2023 1 commit
- Provide alternative when warning on use_auth_token (#27105) · 66b088fa
  Lucain authored Oct 27, 2023
  
  66b088fa
24 Oct, 2023 1 commit

Normalize only if needed (#26049) · e2d6d5ce

Michal Jamroz authored Oct 24, 2023



* Normalize only if needed

* Update examples/pytorch/image-classification/run_image_classification.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* if else in one line

* within block

* one more place, sorry for mess

* import order

* Update examples/pytorch/image-classification/run_image_classification.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/image-classification/run_image_classification_no_trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e2d6d5ce

23 Oct, 2023 1 commit
- fix logit-to-multi-hot conversion in example (#26936) · f71c9ccf
  YQ authored Oct 23, 2023
```
* fix logit to multi-hot converstion

* add comments

* typo
```
  f71c9ccf
12 Oct, 2023 1 commit
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
11 Oct, 2023 1 commit
- Fix checkpoint path in `no_trainer` scripts (#26733) · 1d6a8474
  Zach Mueller authored Oct 11, 2023
```
checkpoint path
```
  1d6a8474
10 Oct, 2023 1 commit
- Fix source_prefix default value (#26654) · 3eceaa36
  jheitmann authored Oct 10, 2023
  
  3eceaa36
04 Oct, 2023 1 commit

refactor: change default block_size (#26229) · 6015f91a

Phuc Van Phan authored Oct 04, 2023

* refactor: change default block_size

* fix: return tf to origin

* fix: change files to origin

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* rebase

* refactor: add min block_size to files

* reformat: add min block_size for run_clm tf

6015f91a