Commits · 721ee783ca1b62bcbe85c96bfa5af5ef4a414de5 · chenpangpang / transformers

02 Feb, 2024 1 commit

[Docs] Fix spelling and grammar mistakes (#28825) · 721ee783

Klaus Hipp authored Feb 02, 2024

* Fix typos and grammar mistakes in docs and examples

* Fix typos in docstrings and comments

* Fix spelling of `tokenizer` in model tests

* Remove erroneous spaces in decorators

* Remove extra spaces in Markdown link texts

721ee783

01 Feb, 2024 1 commit
- [docs] fix some bugs about parameter description (#28806) · d98591a1
  zspo authored Feb 02, 2024
```
Co-authored-by: p_spozzhang <p_spozzhang@tencent.com>
```
  d98591a1
19 Jan, 2024 1 commit
- v4.38.dev.0 · b2748a6e
  Amy Roberts authored Jan 19, 2024
  
  b2748a6e
18 Jan, 2024 2 commits

Making CTC training example more general (#28582) · 772307be

Yoach Lacombe authored Jan 18, 2024



* add w2v2bert compatibility

* Update examples/pytorch/speech-recognition/run_speech_recognition_ctc.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

772307be

Add new meta w2v2-conformer BERT-like model (#28165) · d2cdefb9

Yoach Lacombe authored Jan 18, 2024



* first commit

* correct default value non causal

* update config and modeling code

* update converting checkpoint

* clean modeling and fix tests

* make style

* add new config parameters to docstring

* fix copied from statements

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* make position_embeddings_type docstrings clearer

* clean converting script

* remove function not used

* clean modeling file

* apply suggestion for test file + add convert script to not_doctested

* modify tests according to review - cleaner logic and more tests

* Apply nit suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add checker of valid position embeddings type

* instantiate new layer norm layer with the right eps

* fix freeze_feature_encoder since it can be None in some cases

* add test same output in convert script

* restore wav2vec2conformer and add new model

* create processor and FE + clean

* add new model code

* fix convert script and set default config parameters

* correct model id paths

* make style

* make fix-copies and cleaning files

* fix copied from statements

* complete .md and fixe copies

* clean convert script argument defaults

* fix config parameters docstrings

* fix config docstring

* add copied from and enrich FE tests

* fix copied from and repo-consistency

* add autotokenizer

* make test input length shorter and change docstring code

* fix docstrings and copied from

* add add_adapter to ASR training example

* make testing of adapters more robust

* adapt to multi adapter layers

* refactor input_values->input_features and remove w2v2-bert feature extractor

* remove pretraining model

* remove depreciated features and useless lines

* add copied from and ignore statements to modeling tests

* remove pretraining model #2

* change import in convert script

* change default in convert script

* update readme and remove useless line

* Update tests/models/wav2vec2_bert/test_processor_wav2vec2_bert.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* refactor BERT to Bert for consistency

* remove useless ignore copy statement

* add persistent to buffer in rotary

* add eps in LayerNorm init and remove copied from

* add adapter activation parameters and add copied from statements

* Fix copied statements and add unitest.skip reasons

* add copied statement in test_processor

* refactor processor

* make style

* replace numpy random by torch rand

* remove expected output CTC

* improve converting script with processor class

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* remove gumbel class

* remove tests related to previously deleted class

* Update src/transformers/models/wav2vec2_bert/configuration_wav2vec2_bert.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* correct typos

* remove uused parameters

* update processor to takes both text and audio

* update checkpoints

* update expected output and add ctc expected output

* add label_attention_mask

* replace pt with np in processor tests

* fix typo

* revert to behaviour with labels_attention_mask

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

d2cdefb9

11 Jan, 2024 1 commit

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

13 Dec, 2023 1 commit
- Dev version · 3ed3e319
  Lysandre authored Dec 13, 2023
  
  3ed3e319
27 Nov, 2023 1 commit

docs: replace torch.distributed.run by torchrun (#27528) · ce315081

Peter Pan authored Nov 28, 2023



* docs: replace torch.distributed.run by torchrun

 `transformers` now officially support pytorch >= 1.10.
 The entrypoint `torchrun`` is present from 1.10 onwards.
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>

* Update src/transformers/trainer.py

with @ArthurZucker's suggestion
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Signed-off-by: Peter Pan <Peter.Pan@daocloud.io>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

ce315081

17 Nov, 2023 1 commit
- Broken links fixed related to datasets docs (#27569) · ffbcfc01
  V.Prasanna kumar authored Nov 18, 2023
```
fixed the broken links belogs to dataset library of transformers
```
  ffbcfc01
02 Nov, 2023 1 commit
- Dev version · bc78fd12
  Lysandre authored Nov 02, 2023
  
  bc78fd12
31 Oct, 2023 1 commit
- Unify warning styles for better readability (#27184) · 25e6e941
  Dong-geon Lee authored Nov 01, 2023
  
  25e6e941
27 Oct, 2023 1 commit
- Provide alternative when warning on use_auth_token (#27105) · 66b088fa
  Lucain authored Oct 27, 2023
  
  66b088fa
12 Oct, 2023 1 commit
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
03 Oct, 2023 1 commit
- v4.35.0.dev0 · bd620591
  Lysandre authored Oct 03, 2023
  
  bd620591
05 Sep, 2023 1 commit
- Fix typo (#25966) · 404ff8fc
  Susnato Dhar authored Sep 05, 2023
```
* Update feature_extraction_clap.py

* changed all lenght to length
```
  404ff8fc
04 Sep, 2023 1 commit
- v4.34.dev.0 · d8e13b3e
  Lysandre authored Sep 04, 2023
  
  d8e13b3e
21 Aug, 2023 1 commit
- v4.33.0.dev0 · 5c67682b
  Sylvain Gugger authored Aug 21, 2023
  
  5c67682b
07 Aug, 2023 1 commit

Allow `trust_remote_code` in example scripts (#25248) · 14510938

Jackmin801 authored Aug 07, 2023

* pytorch examples

* pytorch mim no trainer

* cookiecutter

* flax examples

* missed line in pytorch run_glue

* tensorflow examples

* tensorflow run_clip

* tensorflow run_mlm

* tensorflow run_ner

* tensorflow run_clm

* pytorch example from_configs

* pytorch no trainer examples

* Revert "tensorflow run_clip"

This reverts commit 261f86ac1f1c9e05dd3fd0291e1a1f8e573781d5.

* fix: duplicated argument

14510938

02 Aug, 2023 1 commit

Add `token` arugment in example scripts (#25172) · 149cb0cc

Yih-Dar authored Aug 02, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

149cb0cc

28 Jul, 2023 1 commit

Update `use_auth_token` -> `token` in example scripts (#25167) · d53b8ad7

Yih-Dar authored Jul 28, 2023



* pytorch examples

* tensorflow examples

* flax examples

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d53b8ad7

20 Jul, 2023 1 commit
- Change logic for logging in the examples (#24956) · aa1b09c5
  Zach Mueller authored Jul 20, 2023
```
Change logic
```
  aa1b09c5
17 Jul, 2023 1 commit
- 4.32.0.dev0 · e9ad5130
  Sylvain Gugger authored Jul 17, 2023
  
  e9ad5130
14 Jun, 2023 1 commit

Add MMS CTC Fine-Tuning (#24281) · 1609a436

Patrick von Platen authored Jun 15, 2023

* Add mms ctc fine tuning

* make style

* More fixes that are needed

* make fix-copies

* make draft for README

* add new file

* move to new file

* make style

* make style

* add quick test

* make style

* make style

1609a436

07 Jun, 2023 1 commit
- v4.31.0.dev0 · ba695c1e
  Sylvain Gugger authored Jun 07, 2023
  
  ba695c1e
10 May, 2023 1 commit
- CTC example: updated trainer parameters to save tokenizer (#23243) · 91f4c84a
  Maria Khalusova authored May 10, 2023
```
trainer parameters changed to save tokenizer in addition to feature_extractor
```
  91f4c84a
09 May, 2023 1 commit
- v4.30.0.dev0 · a0c0a782
  Sylvain Gugger authored May 09, 2023
  
  a0c0a782
13 Apr, 2023 1 commit
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored Apr 12, 2023
  
  888c4a2a
05 Apr, 2023 1 commit

Sync preprocesses before loading the processor at run_speech_recognition_ctc.py (#21926) · d5239bab

Mikel Penagarikano authored Apr 05, 2023

* Update run_speech_recognition_ctc.py

Make sure all processes wait until data is saved before loading the processor from the output_dit

* Make sure all processes wait until data is saved before loading the processor from the output_dit

* Update run_speech_recognition_ctc.py

* Update run_speech_recognition_seq2seq.py

d5239bab

14 Mar, 2023 1 commit
- v4.28.0.dev0 · ebdb185b
  Sylvain Gugger authored Mar 14, 2023
  
  ebdb185b
08 Mar, 2023 1 commit

[examples/speech-recognition] Add SpecAugment to run_speech_recognition_seq2seq.py (#21942) · 6192549c

bofeng huang authored Mar 08, 2023



* Add specaugment to run_speech_recognition_seq2seq.py

* Remove useless argument: text_column

* Fix quality

* Update return_attention_mask condition

* Update specaugment arguments only for whisper models

* Remove SpecAugment arguments from ModelArguments, only leave default values for simplicity

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update apply_spec_augment only for whisper models

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Rename return_attention_mask to forward_attention_mask to avoid confusion with wav2vec2 models

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

6192549c

22 Feb, 2023 1 commit
- Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
  Aaron Gokaslan authored Feb 22, 2023
  
  5e8c8eb5
09 Feb, 2023 1 commit

fix typo in run_speech_recognition_ctc.py (#21528) · b31cee67

lee1jun authored Feb 09, 2023

Update run_speech_recognition_ctc.py

There should be `# limitations under the License` line at the end of the documentation section.

b31cee67

06 Feb, 2023 1 commit

Update quality tooling for formatting (#21480) · 6f79d264

Sylvain Gugger authored Feb 06, 2023

* Result of black 23.1

* Update target to Python 3.7

* Switch flake8 to ruff

* Configure isort

* Configure isort

* Apply isort with line limit

* Put the right black version

* adapt black in check copies

* Fix copies

6f79d264

23 Jan, 2023 1 commit
- v4.27.0.dev0 · 7119bb05
  Sylvain Gugger authored Jan 23, 2023
  
  7119bb05
07 Dec, 2022 1 commit
- run_speech_recognition_seq2seq.py: add cache_dir param to dataset (#20540) · 0526a075
  Emmanuel Schmidbauer authored Dec 07, 2022
  
  0526a075
06 Dec, 2022 1 commit
- Fix link to speech encoder decoder model in speech recognition readme (#20633) · f821bea0
  Francisco Kurucz authored Dec 06, 2022
  
  f821bea0
01 Dec, 2022 1 commit
- v4.26.0.dev0 · 60d1f31b
  Sylvain Gugger authored Dec 01, 2022
  
  60d1f31b
18 Nov, 2022 1 commit
- [ASR Examples] Update README for Whisper (#20230) · c29a2f7c
  Sanchit Gandhi authored Nov 18, 2022
```
* [ASR Examples] Update README for seq2seq

* add language info

* add training results

* re-word
```
  c29a2f7c
14 Nov, 2022 1 commit

[Examples] Generalise Seq2Seq ASR to handle Whisper (#19519) · af1a7c8c

Sanchit Gandhi authored Nov 14, 2022

* merge conflicts

* bos and eos in datacollator

* (temp) hardcode removal of attention mask

* freeze encoder

* actually freeze encoder

* set max length / num beams according to gen kwargs

* (temp) fix tests

* don't pop attn mask

* override return attention mask config from Hub

* Hub configs updated 🤗

* final fixes

* update type annotations

* backward comp

af1a7c8c

04 Nov, 2022 1 commit
- Update README.md (#20063) · 3502c202
  bhuang authored Nov 04, 2022
  
  3502c202