Commits · 372ab9cd6d07a0a9dc4b181b1cce3331be3c5cc1 · chenpangpang / transformers

09 Jun, 2021 3 commits

rm require_version_examples (#12088) · 61e19198
Stas Bekman authored Jun 09, 2021

61e19198

Wav2Vec2 Pretraining (#11306) · d472bd7b

Anton Lozhkov authored Jun 09, 2021



* Working quantizer forward

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Working quantizer forward

* Clean up unused model parts, test reproducibility

* Remove custom outputs from the shared ones

* correct conversion

* correct bug

* add first pretrain script

* save intermediate

* static shapes

* save intermediate

* finish first pretrain script version

* more refactor

* remove wanddb

* refactor more

* improve test

* correct perplexity compute bug

* finish model implementation

* add to docs

* finish docs

* finish pretraining script

* finish pretraining script

* remove wandb

* finish PR for merge

* finish config

* finish

* make deepspeed work

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply suggestions

* fix flaky test
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d472bd7b

sync LayerDrop for Wav2Vec2Encoder + tests (#12076) · d14e0af2
Stas Bekman authored Jun 09, 2021

d14e0af2

08 Jun, 2021 3 commits

[Deepspeed Wav2vec2] integration (#11638) · 11d86d3d

Stas Bekman authored Jun 08, 2021

* wip

* wip - but working with https://github.com/microsoft/DeepSpeed/pull/1044

* cleanup

* workaround

* working 5/8 modes

* solve fp32 distributed zero3

* style

* sync

* sync

* rework

* deprecation

* cleanup

* https://github.com/microsoft/DeepSpeed/pull/1044

 pr was merged

* clean up

* add a guide

* more prose

* more prose

* fix

* more prose

* sub_group_size was too big

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* refactor

* bug fix

* make the true check explicit

* new deepspeed release
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

11d86d3d

Replace legacy tensor.Tensor with torch.tensor/torch.empty (#12027) · f5eec0d8
Mario Šaško authored Jun 08, 2021
```
* Replace legacy torch.Tensor constructor with torch.{tensor, empty}

* Remove torch.Tensor in examples
```
f5eec0d8

updated the original RAG implementation to be compatible with latest Pytorch-Lightning (#11806) · e33085d6

Shamane Siri authored Jun 09, 2021

* updated the original RAG implementation to be compatible with the latest PL version

* updated the requirements.txt file

* execute make style

* code quality test

* code quality

* conflix resolved in requirement.txt

* code quality

* changed the MyDDP class name to CustomDDP

e33085d6

02 Jun, 2021 1 commit

Bump urllib3 from 1.25.8 to 1.26.5 in /examples/research_projects/lxmert (#11983) · 6db3a87d

dependabot[bot] authored Jun 02, 2021

Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.8 to 1.26.5.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/1.25.8...1.26.5

)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

6db3a87d

01 Jun, 2021 1 commit

RAG-2nd2end-revamp (#11893) · 9ec0f01b

Shamane Siri authored Jun 01, 2021



* initial

* code quality test

* code quality

* added test functions in test_modeling_rag.py and test_retrieval_rag.py to test end2end retreiver

* minor change in test_modeling_rag

* fixed tests

* Update examples/research_projects/rag-end2end-retriever/README.md

typo corrected as suggested by lhoestq
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* Update examples/research_projects/rag-end2end-retriever/finetune_rag.py

type change suggested by lhoestq
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* Update src/transformers/models/rag/retrieval_rag.py

Adding this change as mentioned by lhoestq.
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* completed the minor changes suggested by the reviewers
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

9ec0f01b

12 May, 2021 1 commit
- remove defaults to None if optional (#11703) · 77f4c46b
  Philip May authored May 12, 2021
  
  77f4c46b
10 May, 2021 2 commits
- Update requirements.txt (#11634) · 1a0b4178
  Quentin Lhoest authored May 10, 2021
  
  1a0b4178
- [Examples] Fix invalid links after reorg (#11650) · 7e406f4a
  Tommy Chiang authored May 10, 2021
  
  7e406f4a
30 Apr, 2021 1 commit
- Update README.md (#11489) · 58c789e3
  Manuel Romero authored Apr 30, 2021
```
Add link to code
```
  58c789e3
26 Apr, 2021 2 commits

Variable Correction for Consistency in Distillation Example (#11444) · 0661abc5

Jaimeen Ahn authored Apr 27, 2021

As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively,  the correction makes the example work

0661abc5

make style (#11442) · 32dbb2d9
Patrick von Platen authored Apr 26, 2021

32dbb2d9

14 Apr, 2021 2 commits
- Close open files to suppress ResourceWarning (#11240) · f25444cb
  Sudharsan S T authored Apr 14, 2021
```
Co-authored-by: Sudharsan Thirumalai <sudharsan.t@sprinklr.com>
```
  f25444cb
- Save the Wav2Vec2 processor before training starts (#10910) · 653076ca
  Nithin Holla authored Apr 14, 2021
```
Co-authored-by: nithin19 <nithin@amberscript.com>
```
  653076ca
07 Apr, 2021 1 commit
- fix: The 'warn' method is deprecated (#11105) · c9035e45
  Stas Bekman authored Apr 07, 2021
```
* The 'warn' method is deprecated

* fix test
```
  c9035e45
05 Apr, 2021 1 commit
- s|Pretrained|PreTrained| (#11048) · 3d39226a
  Stas Bekman authored Apr 04, 2021
  
  3d39226a
02 Apr, 2021 1 commit
- fixed typo: logging instead of logger (#11025) · 335c0ca3
  versis authored Apr 02, 2021
  
  335c0ca3
30 Mar, 2021 1 commit
- fix md file to avoid evaluation crash (#10962) · e031162a
  Yih-Dar authored Mar 30, 2021
  
  e031162a
29 Mar, 2021 1 commit

[vulnerability] dep fix (#10954) · 05c966f2

Stas Bekman authored Mar 29, 2021

Fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pygments/open

@LysandreJik

05c966f2

26 Mar, 2021 1 commit

[vulnerability] fix dependency (#10914) · 3c27d246

Stas Bekman authored Mar 26, 2021

this PR fixes https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/PyYAML/open

3c27d246

22 Mar, 2021 4 commits

[vulnerability] in example deps fix (#10817) · 8fb46718

Stas Bekman authored Mar 22, 2021

Takes care of:
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/jinja2/open



@LysandreJik
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

8fb46718

Bump jinja2 from 2.11.2 to 2.11.3 in /examples/research_projects/lxmert (#10818) · dbfe3795

dependabot[bot] authored Mar 22, 2021

Bumps [jinja2](https://github.com/pallets/jinja) from 2.11.2 to 2.11.3.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/master/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/2.11.2...2.11.3

)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

dbfe3795

Update FINE_TUNE_XLSR_WAV2VEC2.md (#10849) · 29904a96
Qiushi Pan authored Mar 22, 2021
```
Fix typo.
```
29904a96
push (#10846) · 0f226f78
Patrick von Platen authored Mar 22, 2021

0f226f78

21 Mar, 2021 4 commits
- Update FINE_TUNE_XLSR_WAV2VEC2.md · 82b8d8c7
  Suraj Patil authored Mar 21, 2021
  
  82b8d8c7
- Update FINE_TUNE_XLSR_WAV2VEC2.md · af6125ff
  Patrick von Platen authored Mar 21, 2021
  
  af6125ff
- small improvements for wav2vec2 info script (#10829) · 5aaf6e14
  Patrick von Platen authored Mar 21, 2021
  
  5aaf6e14
- add doc for Local machine (#10828) · 68b55885
  Suraj Patil authored Mar 21, 2021
  
  68b55885
19 Mar, 2021 4 commits
- wav2vec doc tweaks (#10808) · 1438c487
  Julien Chaumond authored Mar 19, 2021
```
* wording/typos tweaks

* Make model upload instructions simpler
```
  1438c487
- Update FINE_TUNE_XLSR_WAV2VEC2.md · b9570a81
  Patrick von Platen authored Mar 19, 2021
  
  b9570a81
- [XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806) · e8968bd0
  Patrick von Platen authored Mar 19, 2021
```
* finish

* fix

* fix

* fix

* fix
```
  e8968bd0
- addressing vulnerability report in research project deps (#10802) · 427ea3fe
  Stas Bekman authored Mar 18, 2021
```
Following up on a security alert:
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pillow/open
```
  427ea3fe
18 Mar, 2021 6 commits

Update FINE_TUNE_XLSR_WAV2VEC2.md · 2ae67822
Patrick von Platen authored Mar 19, 2021

2ae67822
Update FINE_TUNE_XLSR_WAV2VEC2.md · 68a32159
Patrick von Platen authored Mar 19, 2021

68a32159
Update FINE_TUNE_XLSR_WAV2VEC2.md · 03df3fbc
Patrick von Platen authored Mar 19, 2021

03df3fbc

Add XLSR-Wav2Vec2 Fine-Tuning README.md (#10786) · e84adbed

Patrick von Platen authored Mar 19, 2021

* upload

* upload fine-tuning script

* improve

* adapt

* Apply suggestions from code review

* correct

* upload

* finalize

* remove @

* correct typos

e84adbed

add run_common_voice script (#10767) · 5f19c07a

Suraj Patil authored Mar 18, 2021

* add initial script

* finish script

* add shell script example

* accept chars_to_ignor as cl arg

* align the script with other example scripts

* add torchaudio dep

5f19c07a

wav2vec2: support datasets other than LibriSpeech (#10581) · af8afdc8

Mohamed El-Geish authored Mar 18, 2021

* wav2vec2: support datasets other than LibriSpeech

* Formatting run_asr.py to pass code quality test

* bundled orthography options and added verbose logs

* fixing a typo in timit fine-tuning script

* update comment for clarity

* resize_lm_head and load custom vocab from file

* adding a max_duration_in_seconds filter

* do not assign `duration_filter` lambda, use a def

* log untransliterated text as well

* fix base model for arabic

* fix duration filter when target_sr is not set

* drop duration_in_seconds when unneeded

* script for wav2vec2-large-lv60-timit-asr

* fix for "tha" in arabic corpus (huggingface#10581)

* adding more options to work with common_voice

* PR feedback (huggingface#10581)

* small README change

af8afdc8