Commits · 5aaf6e1460861ce86a285219c57a743de6cf481d · chenpangpang / transformers

21 Mar, 2021 3 commits

small improvements for wav2vec2 info script (#10829) · 5aaf6e14
Patrick von Platen authored Mar 21, 2021

5aaf6e14

Add new community notebook - wav2vec2 with GPT (#10794) · be87b842

Eric Lam authored Mar 21, 2021



* Add new community notebook - wav2vec2 with GPT

* Update:community.md, new nb add
* feat: notebook of wav2vec xlsr ctc decoding with gpt logit adjustment
* Update: Wav2vec2 CTC decoding with gpt2 adjustment

* Update docs/source/community.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>

be87b842

add doc for Local machine (#10828) · 68b55885
Suraj Patil authored Mar 21, 2021

68b55885

19 Mar, 2021 9 commits

Sort init import (#10801) · 21e86f99

Sylvain Gugger authored Mar 19, 2021



* Initial script

* Add script to properly sort imports in init.

* Add to the CI

* Update utils/custom_init_isort.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Separate scripts that change content from quality

* Move class_mapping_update to style_checks
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

21e86f99

wav2vec doc tweaks (#10808) · 1438c487
Julien Chaumond authored Mar 19, 2021
```
* wording/typos tweaks

* Make model upload instructions simpler
```
1438c487
Update FINE_TUNE_XLSR_WAV2VEC2.md · b9570a81
Patrick von Platen authored Mar 19, 2021

b9570a81

Add transformers id to hub requests (#10811) · f2b744f6

Philipp Schmid authored Mar 19, 2021

* add uuid.hext to user_agent

* add log

* changed order of it

* renamed as session id

* renamed variable

* reverted naming of the const

f2b744f6

Expand a bit the presentation of examples (#10799) · 946400fb

Sylvain Gugger authored Mar 19, 2021



* Expand a bit the presentation of examples

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

946400fb

[Example] Updating Question Answering examples for Predict Stage (#10792) · fd1d9f1a
Bhadresh Savani authored Mar 19, 2021
```
* added prediction stage and eval fix

* style correction

* removed extra lines
```
fd1d9f1a
[XLSR-Wav2Vec2 Info doc] Add a couple of lines (#10806) · e8968bd0
Patrick von Platen authored Mar 19, 2021
```
* finish

* fix

* fix

* fix

* fix
```
e8968bd0

fix backend tokenizer args override: key mismatch (#10686) · 117dba99

Théo Matussière authored Mar 19, 2021



* fix backend tokenizer args override: key mismatch

* no touching the docs

* fix mpnet

* add mpnet to test

* fix test
Co-authored-by: theo <theo@matussie.re>

117dba99

addressing vulnerability report in research project deps (#10802) · 427ea3fe

Stas Bekman authored Mar 18, 2021

Following up on a security alert:
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/Pillow/open

427ea3fe

18 Mar, 2021 14 commits

Update FINE_TUNE_XLSR_WAV2VEC2.md · 2ae67822
Patrick von Platen authored Mar 19, 2021

2ae67822
Update FINE_TUNE_XLSR_WAV2VEC2.md · 68a32159
Patrick von Platen authored Mar 19, 2021

68a32159
Update FINE_TUNE_XLSR_WAV2VEC2.md · 03df3fbc
Patrick von Platen authored Mar 19, 2021

03df3fbc

Add XLSR-Wav2Vec2 Fine-Tuning README.md (#10786) · e84adbed

Patrick von Platen authored Mar 19, 2021

* upload

* upload fine-tuning script

* improve

* adapt

* Apply suggestions from code review

* correct

* upload

* finalize

* remove @

* correct typos

e84adbed

Document v4.4.2 · dcebe254
Sylvain Gugger authored Mar 18, 2021

dcebe254
Fix distributed evaluation (#10795) · 008672e6
Sylvain Gugger authored Mar 18, 2021
```
* Fix distributed evaluation

* Use logger
```
008672e6

[examples/seq2seq/README.md] fix t5 examples (#10734) · 9352b515

Stas Bekman authored Mar 18, 2021

* [examples/seq2seq] fix t5 examples

This PR:
* fixes T5 examples to include `--source_prefix` - it's **not** optional. If you give it a try you will see that you get 10x worse bleu scores w/o it. w/ `27.6849`, w/ `2.374`
* added a normal translation example w/o the peculiarities of MBart and T5
* reduces the default max samples to 50 so it's much faster to test quickly

summarization seems to be broken for t5 score-wise: https://github.com/huggingface/transformers/issues/10733

@sgugger

* specify explicitly the t5 models requiring the special handling

* one more

* update the t5 summarization example to use cnn_dailymail

* move max*samples into the top level README.md

* better wording

* better wording

9352b515

from_pretrained: check that the pretrained model is for the right model architecture (#10586) · 094afa51

Vimarsh Chaturvedi authored Mar 18, 2021



* Added check to ensure model name passed to from_pretrained and model are the same

* Added test to check from_pretrained throws assert error when passed an incompatiable model name

* Modified assert in from_pretrained with f-strings. Modified test to ensure desired assert message is being generated

* Added check to ensure config and model has model_type

* Fix FlauBERT heads

Co-authored-by: vimarsh chaturvedi <vimarsh chaturvedi>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

094afa51

[file_utils] do not gobble certain kinds of requests.ConnectionError (#10235) · 4f3e93cf

Julien Chaumond authored Mar 18, 2021



* do not gobble certain kinds of requests.ConnectionError

* Apply review comments
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

4f3e93cf

Fix bug in input check for LengthGroupSampler (#10783) · ce9724e1

James Thomin authored Mar 18, 2021

This commit fixes a bug in the LengthGroupSampler where if
model_input_name is not set, the default value is None instead of
"input_ids"

ce9724e1

add run_common_voice script (#10767) · 5f19c07a

Suraj Patil authored Mar 18, 2021

* add initial script

* finish script

* add shell script example

* accept chars_to_ignor as cl arg

* align the script with other example scripts

* add torchaudio dep

5f19c07a

wav2vec2: support datasets other than LibriSpeech (#10581) · af8afdc8

Mohamed El-Geish authored Mar 18, 2021

* wav2vec2: support datasets other than LibriSpeech

* Formatting run_asr.py to pass code quality test

* bundled orthography options and added verbose logs

* fixing a typo in timit fine-tuning script

* update comment for clarity

* resize_lm_head and load custom vocab from file

* adding a max_duration_in_seconds filter

* do not assign `duration_filter` lambda, use a def

* log untransliterated text as well

* fix base model for arabic

* fix duration filter when target_sr is not set

* drop duration_in_seconds when unneeded

* script for wav2vec2-large-lv60-timit-asr

* fix for "tha" in arabic corpus (huggingface#10581)

* adding more options to work with common_voice

* PR feedback (huggingface#10581)

* small README change

af8afdc8

[Flax] Adapt Flax models to new structure (#9484) · 0b98ca36

Patrick von Platen authored Mar 18, 2021



* Create modeling_flax_eletra with code copied from modeling_flax_bert

* Add ElectraForMaskedLM and ElectraForPretraining

* Add modeling test for Flax electra and fix naming and arg in Flax Electra model

* Add documentation

* Fix code style

* Create modeling_flax_eletra with code copied from modeling_flax_bert

* Add ElectraForMaskedLM and ElectraForPretraining

* Add modeling test for Flax electra and fix naming and arg in Flax Electra model

* Add documentation

* Fix code style

* Fix code quality

* Adjust tol in assert_almost_equal due to very small difference between model output, ranging 0.0010 - 0.0016

* Remove redundant ElectraPooler

* save intermediate

* adapt

* correct bert flax design

* adapt roberta as well

* finish roberta flax

* finish

* apply suggestions

* apply suggestions
Co-authored-by: Chris Nguyen <anhtu2687@gmail.com>

0b98ca36

Add support for detecting intel-tensorflow version (#10781) · 5c0bf397
Funtowicz Morgan authored Mar 18, 2021
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
5c0bf397

17 Mar, 2021 11 commits

Smmp batch not divisible by microbatches fix (#10778) · 0282e24e

Mansi Mane authored Mar 17, 2021



* Added debug prints

* Added config

* Added prints

* Added prints

* Added extra samples to SequentialDistributedSampler

* Added extra samples to SequentialDistributedSampler

Updated SequentialDistributedSampler call

* Added deubg prints

* Removed extra prints

* Making predicitons and labels multiple of batchsize

* updated number of microbatches

* Removed extra prints

* Made start_remainder similar to DistributedSamplerWithLoop

* Minor spacing update

* Added debug prints

Added config

Added prints

Added prints

* Added extra samples to SequentialDistributedSampler

Updated SequentialDistributedSampler call

Added extra samples to SequentialDistributedSampler

Added deubg prints

Removed extra prints

Making predicitons and labels multiple of batchsize

updated number of microbatches

Removed extra prints

Squashing redundant commits

* Made start_remainder similar to DistributedSamplerWithLoop

Minor spacing update

Made start_remainder similar to DistributedSamplerWithLoop

* Test and styling

* Rename test
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

0282e24e

Check copies blackify (#10775) · 40b049c7

Sylvain Gugger authored Mar 17, 2021

* Apply black before checking copies

* Fix for class methods

* Deal with lonely brackets

* Remove debug and add forward changes

* Separate copies and fix test

* Add black as a test dependency

40b049c7

[examples] document resuming (#10776) · 39373919

Stas Bekman authored Mar 17, 2021



* document resuming in examples

* fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* put trainer code last, adjust notes
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

39373919

[Issue template] need to update/extend who to tag (#10728) · 85a114ef

Stas Bekman authored Mar 17, 2021

* [Issue template] need to update/extend who to tag

1. need to update who to tag for `tensorflow`
2. also requesting to add someone to tag for models hub issues - perhaps separate sub-entries for UI and code - e.g. I don't know who to tag for broken models: https://github.com/huggingface/transformers/issues/10726

Thanks.

* model hub instructions

* s/jplu/LysandreJik/

85a114ef

make failure to find a resume checkpoint fatal + tests (#10777) · 3318c246
Stas Bekman authored Mar 17, 2021

3318c246
[DeepSpeed] improve checkpoint loading code plus tests (#10760) · cd8c93f7
Stas Bekman authored Mar 17, 2021
```
* deepspeed checkpoint loading code plus tests

* style

* style
```
cd8c93f7
[DeepSpeed] simplify init (#10762) · 01c7fb04
Stas Bekman authored Mar 17, 2021

01c7fb04
small improvements (#10773) · 0486ccdd
Patrick von Platen authored Mar 17, 2021

0486ccdd
Fix URLs · d7e0d59b
Sylvain Gugger authored Mar 17, 2021

d7e0d59b

[doc] [testing] extend the pytest -k section with more examples (#10761) · 8715d20c

Stas Bekman authored Mar 17, 2021

* [doc] [testing] extend -k section

This PR adds more examples on using `pytest -k` - I always forget that I want to use `-k A OR B` when I want several tests - I keep trying AND and it doesn't match any.

* style

8715d20c

up (#10771) · f20d75a1
Patrick von Platen authored Mar 17, 2021

f20d75a1

16 Mar, 2021 3 commits

[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464) · c83fbc5f

Cheng Li authored Mar 16, 2021



* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* update

* make init_deepspeed support config dict

* fix docstring formatting

* clean up trainer's comments

* add new tests

* fix type

* composit argparse doesn't work

* style

* add a new test, rename others

* document new functionality

* complete tests, add docs

* style

* correct level

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add new methods to the doc

* must tell DS we are using a non-native optimizer

* add protection against cpu_offload + HF optimizer combo

* fix the cli overrides

* sync docs + tests

* restore AdamW

* better docs

* need new version

* no longer needed

* remove outdate information

* refactor duplicated code
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c83fbc5f

Patches full import failure when sentencepiece is not installed (#10752) · c2324844
Lysandre Debut authored Mar 16, 2021
```
* Patches full import failure when sentencepiece is not installed

* Dummies :)
```
c2324844
Docs for v4.4.1 · 73fe4089
Lysandre authored Mar 16, 2021

73fe4089