Commits · eca77f4719531ecaabe9ec6b2dee6075a391d98a · chenpangpang / transformers

23 Mar, 2022 1 commit

Updates the default branch from master to main (#16326) · eca77f47

Lysandre Debut authored Mar 23, 2022



* Updates the default branch from master to main

* Links from `master` to `main`

* Typo

* Update examples/flax/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

eca77f47

12 Mar, 2022 1 commit

[Deepspeed] add support for bf16 mode (#14569) · 580dd87c

Stas Bekman authored Mar 11, 2022



* [WIP] add support for bf16 mode

* prep for bf16

* prep for bf16

* fix; zero2/bf16 is ok

* check bf16 is available

* test fixes

* enable zero3_bf16

* config files

* docs

* split stage_dtype; merge back to non-dtype-specific config file

* fix doc

* cleanup

* cleanup

* bfloat16 => bf16 to match the PR changes

* s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/

* test fixes/skipping

* move

* fix

* Update docs/source/main_classes/deepspeed.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* backticks

* cleanup

* cleanup

* cleanup

* new version

* add note about grad accum in bf16
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

580dd87c

09 Feb, 2022 1 commit
- Upgrade black to version ~=22.0 (#15565) · 7732d0fe
  Lysandre Debut authored Feb 09, 2022
```
* Upgrade black to version ~=22.0

* Check copies

* Fix code
```
  7732d0fe
10 Jan, 2022 1 commit

[Wav2Vec2 Speech Event] Add speech event v2 (#15083) · d72343d2

Patrick von Platen authored Jan 10, 2022

* up

* up

* up

* up

* up

* up

* improve

* up

* up

* Update src/transformers/trainer.py

* up

* up

* up

d72343d2

11 Nov, 2021 1 commit
- fix --gradient_checkpointing (#13964) · 77262ef7
  Stas Bekman authored Nov 11, 2021
  
  77262ef7
22 Oct, 2021 1 commit
- Add missing --validation_split_percentage data args (#14119) · 05a2afc2
  Antonio Carlos Falcão Petri authored Oct 22, 2021
  
  05a2afc2
14 Oct, 2021 1 commit
- up (#14008) · 7fb2a8b3
  Patrick von Platen authored Oct 14, 2021
  
  7fb2a8b3
08 Aug, 2021 1 commit
- Update README.md · 24cbf6bc
  Patrick von Platen authored Aug 08, 2021
  
  24cbf6bc
30 Jul, 2021 1 commit

fix typo in gradient_checkpointing arg (#12855) · 5c673efa

21jun authored Jul 30, 2021

help for `ModelArguments.gradient_checkpointing` should be
"If True, use gradient checkpointing to save memory
at the expense of slower backward pass."
not "Whether to freeze the feature extractor layers of the model."
(which is duplicated from `freeze_feature_extractor` arg)

5c673efa

23 Jul, 2021 1 commit
- [tests] fix logging_steps requirements (#12860) · 98364ea7
  Stas Bekman authored Jul 23, 2021
  
  98364ea7
15 Jul, 2021 1 commit

[Wav2Vec2] Correctly pad mask indices for PreTraining (#12748) · 2e9fb13f

Patrick von Platen authored Jul 15, 2021



* fix_torch_device_generate_test

* remove @

* start adding tests

* correct wav2vec2 pretraining

* up

* up
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2e9fb13f

25 Jun, 2021 1 commit
- remove extra white space from log format (#12360) · 4a872cae
  Stas Bekman authored Jun 25, 2021
  
  4a872cae
14 Jun, 2021 1 commit
- [style] consistent nn. and nn.functional: part 4 `examples` (#12156) · 88e84186
  Stas Bekman authored Jun 14, 2021
```
* consistent nn. and nn.functional: p4 examples

* restore
```
  88e84186
09 Jun, 2021 2 commits

Wav2Vec2 Pretraining (#11306) · d472bd7b

Anton Lozhkov authored Jun 09, 2021



* Working quantizer forward

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Remove custom outputs from the shared ones

* correct conversion

* correct bug

* add first pretrain script

* save intermediate

* static shapes

* save intermediate

* finish first pretrain script version

* more refactor

* remove wanddb

* refactor more

* improve test

* correct perplexity compute bug

* finish model implementation

* add to docs

* finish docs

* finish pretraining script

* finish pretraining script

* remove wandb

* finish PR for merge

* finish config

* finish

* make deepspeed work

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

* fix flaky test
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d472bd7b

sync LayerDrop for Wav2Vec2Encoder + tests (#12076) · d14e0af2
Stas Bekman authored Jun 09, 2021

d14e0af2

08 Jun, 2021 1 commit

[Deepspeed Wav2vec2] integration (#11638) · 11d86d3d

Stas Bekman authored Jun 08, 2021

* wip

* wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044

* cleanup

* workaround

* working 5/8 modes

* solve fp32 distributed zero3

* style

* sync

* sync

* rework

* deprecation

* cleanup

* https://github.com/microsoft/DeepSpeed/pull/1044

 pr was merged

* clean up

* add a guide

* more prose

* more prose

* fix

* more prose

* sub_group_size was too big

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refactor

* bug fix

* make the true check explicit

* new deepspeed release
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

11d86d3d

12 May, 2021 1 commit
- remove defaults to None if optional (#11703) · 77f4c46b
  Philip May authored May 12, 2021
  
  77f4c46b
14 Apr, 2021 1 commit
- Save the Wav2Vec2 processor before training starts (#10910) · 653076ca
  Nithin Holla authored Apr 14, 2021
```
Co-authored-by: nithin19 <nithin@amberscript.com>
```
  653076ca
30 Mar, 2021 1 commit
- fix md file to avoid evaluation crash (#10962) · e031162a
  Yih-Dar authored Mar 30, 2021
  
  e031162a
22 Mar, 2021 2 commits
- Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849) · 29904a96
  Qiushi Pan authored Mar 22, 2021
```
Fix typo.
```
  29904a96
- push (#10846) · 0f226f78
  Patrick von Platen authored Mar 22, 2021
  
  0f226f78
21 Mar, 2021 4 commits
- Update FINE_TUNE_XLSR_WAV2VEC2.md · 82b8d8c7
  Suraj Patil authored Mar 21, 2021
  
  82b8d8c7
- Update FINE_TUNE_XLSR_WAV2VEC2.md · af6125ff
  Patrick von Platen authored Mar 21, 2021
  
  af6125ff
- small improvements for wav2vec2 info script (#10829) · 5aaf6e14
  Patrick von Platen authored Mar 21, 2021
  
  5aaf6e14
- add doc for Local machine (#10828) · 68b55885
  Suraj Patil authored Mar 21, 2021
  
  68b55885
19 Mar, 2021 3 commits
- wav2vec doc tweaks (#10808) · 1438c487
  Julien Chaumond authored Mar 19, 2021
```
* wording/typos tweaks

* Make model upload instructions simpler
```
  1438c487
- Update FINE_TUNE_XLSR_WAV2VEC2.md · b9570a81
  Patrick von Platen authored Mar 19, 2021
  
  b9570a81
- [XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806) · e8968bd0
  Patrick von Platen authored Mar 19, 2021
```
* finish

* fix

* fix

* fix

* fix
```
  e8968bd0
18 Mar, 2021 6 commits

Update FINE_TUNE_XLSR_WAV2VEC2.md · 2ae67822
Patrick von Platen authored Mar 19, 2021

2ae67822
Update FINE_TUNE_XLSR_WAV2VEC2.md · 68a32159
Patrick von Platen authored Mar 19, 2021

68a32159
Update FINE_TUNE_XLSR_WAV2VEC2.md · 03df3fbc
Patrick von Platen authored Mar 19, 2021

03df3fbc

Add XLSR-Wav2Vec2 Fine-Tuning README.md (#10786) · e84adbed

Patrick von Platen authored Mar 19, 2021

* upload

* upload fine-tuning script

* improve

* adapt

* Apply suggestions from code review

* correct

* upload

* finalize

* remove @

* correct typos

e84adbed

add run_common_voice script (#10767) · 5f19c07a

Suraj Patil authored Mar 18, 2021

* add initial script

* finish script

* add shell script example

* accept chars_to_ignor as cl arg

* align the script with other example scripts

* add torchaudio dep

5f19c07a

wav2vec2: support datasets other than LibriSpeech (#10581) · af8afdc8

Mohamed El-Geish authored Mar 18, 2021

* wav2vec2: support datasets other than LibriSpeech

* Formatting run_asr.py to pass code quality test

* bundled orthography options and added verbose logs

* fixing a typo in timit fine-tuning script

* update comment for clarity

* resize_lm_head and load custom vocab from file

* adding a max_duration_in_seconds filter

* do not assign `duration_filter` lambda, use a def

* log untransliterated text as well

* fix base model for arabic

* fix duration filter when target_sr is not set

* drop duration_in_seconds when unneeded

* script for wav2vec2-large-lv60-timit-asr

* fix for "tha" in arabic corpus (huggingface#10581)

* adding more options to work with common_voice

* PR feedback (huggingface#10581)

* small README change

af8afdc8

05 Mar, 2021 1 commit
- fix run seq2seq (#10547) · 395ffcd7
  Patrick von Platen authored Mar 05, 2021
  
  395ffcd7
01 Mar, 2021 1 commit

Add Fine-Tuning for Wav2Vec2 (#10145) · 0234de84

Patrick von Platen authored Mar 01, 2021



* add encode labels function to tokenizer

* start adding finetuning

* init dropout

* upload

* correct convert script

* apply changes

* fix second typo

* make first dummy training run

* adapt convert script

* push confg for comparison

* remove conf

* finish training

* adapt data collator

* add research folder

* update according to fairseq feedback

* some minor corrections

* refactor masking indices a bit

* some minor changes

* clean tokenizer

* finish clean-up

* remove previous logic

* update run script

* correct training

* finish changes

* finish model

* correct bug

* fix training a bit more

* add some tests

* finish gradient checkpointing

* finish example

* correct gradient checkpointing

* improve tokenization method

* revert changes in tokenizer

* revert general change

* adapt fine-tuning

* update

* save intermediate test

* Update README.md

* finish finetuning

* delete conversion script

* Update src/transformers/models/wav2vec2/configuration_wav2vec2.py

* Update src/transformers/models/wav2vec2/processing_wav2vec2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* finish wav2vec2 script

* finish wav2vec2 fine-tuning

* finalize test

* correct test

* adapt tests

* finish

* remove test file
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

0234de84