Commits · 9352b5151a5c34507b684adc3f8d1194db62a0d9 · chenpangpang / transformers

18 Mar, 2021 4 commits

[examples/seq2seq/README.md] fix t5 examples (#10734) · 9352b515

Stas Bekman authored Mar 18, 2021

* [examples/seq2seq] fix t5 examples

This PR:
* fixes T5 examples to include `--source_prefix` - it's **not** optional. If you give it a try you will see that you get 10x worse bleu scores w/o it. w/ `27.6849`, w/ `2.374`
* added a normal translation example w/o the peculiarities of MBart and T5
* reduces the default max samples to 50 so it's much faster to test quickly

summarization seems to be broken for t5 score-wise: https://github.com/huggingface/transformers/issues/10733

@sgugger

* specify explicitly the t5 models requiring the special handling

* one more

* update the t5 summarization example to use cnn_dailymail

* move max*samples into the top level README.md

* better wording

* better wording

9352b515

[file_utils] do not gobble certain kinds of requests.ConnectionError (#10235) · 4f3e93cf

Julien Chaumond authored Mar 18, 2021



* do not gobble certain kinds of requests.ConnectionError

* Apply review comments
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

4f3e93cf

add run_common_voice script (#10767) · 5f19c07a

Suraj Patil authored Mar 18, 2021

* add initial script

* finish script

* add shell script example

* accept chars_to_ignor as cl arg

* align the script with other example scripts

* add torchaudio dep

5f19c07a

wav2vec2: support datasets other than LibriSpeech (#10581) · af8afdc8

Mohamed El-Geish authored Mar 18, 2021

* wav2vec2: support datasets other than LibriSpeech

* Formatting run_asr.py to pass code quality test

* bundled orthography options and added verbose logs

* fixing a typo in timit fine-tuning script

* update comment for clarity

* resize_lm_head and load custom vocab from file

* adding a max_duration_in_seconds filter

* do not assign `duration_filter` lambda, use a def

* log untransliterated text as well

* fix base model for arabic

* fix duration filter when target_sr is not set

* drop duration_in_seconds when unneeded

* script for wav2vec2-large-lv60-timit-asr

* fix for "tha" in arabic corpus (huggingface#10581)

* adding more options to work with common_voice

* PR feedback (huggingface#10581)

* small README change

af8afdc8

17 Mar, 2021 2 commits

[examples] document resuming (#10776) · 39373919

Stas Bekman authored Mar 17, 2021



* document resuming in examples

* fix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* put trainer code last, adjust notes
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

39373919

[DeepSpeed] improve checkpoint loading code plus tests (#10760) · cd8c93f7
Stas Bekman authored Mar 17, 2021
```
* deepspeed checkpoint loading code plus tests

* style

* style
```
cd8c93f7

16 Mar, 2021 3 commits

[Deepspeed] Allow HF optimizer and scheduler to be passed to deepspeed (#10464) · c83fbc5f

Cheng Li authored Mar 16, 2021



* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* pass hf optimizer and scheduler to deepspeed if not specified in ds config

* update

* make init_deepspeed support config dict

* fix docstring formatting

* clean up trainer's comments

* add new tests

* fix type

* composit argparse doesn't work

* style

* add a new test, rename others

* document new functionality

* complete tests, add docs

* style

* correct level

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add new methods to the doc

* must tell DS we are using a non-native optimizer

* add protection against cpu_offload + HF optimizer combo

* fix the cli overrides

* sync docs + tests

* restore AdamW

* better docs

* need new version

* no longer needed

* remove outdate information

* refactor duplicated code
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c83fbc5f

Development on v4.5.0dev0 · 1b5ce1e6
Lysandre authored Mar 16, 2021

1b5ce1e6
Release v4.4.0 · c988db5a
Lysandre authored Mar 16, 2021

c988db5a

15 Mar, 2021 4 commits

independent training / eval with local files (#10710) · 87d685b8
Russell Klopfer authored Mar 15, 2021
```
* independent training / eval with local files

* remove redundant assert
```
87d685b8

Add minimum version check in examples (#10724) · 4c379daf

Sylvain Gugger authored Mar 15, 2021

* Add minimum version check in examples

* Style

* No need for new line maybe?

* Add helpful comment

4c379daf

zero-shot pipeline multi_class -> multi_label (#10727) · 966ba081
Joe Davison authored Mar 15, 2021

966ba081

split seq2seq script into summarization & translation (#10611) · 6f840990

Théo Matussière authored Mar 15, 2021



* split seq2seq script, update docs

* needless diff

* fix readme

* remove test diff

* s/summarization/translation
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* cr

* fix arguments & better mbart/t5 refs

* copyright
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* reword readme
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* s/summarization/translation

* short script names

* fix tests

* fix isort, include mbart doc

* delete old script, update tests

* automate source prefix

* automate source prefix for translation

* s/translation/trans
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* fix script name (short version)

* typos
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* exact parameter
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* remove superfluous source_prefix calls in docs

* rename scripts & warn for source prefix

* black

* flake8
Co-authored-by: theo <theo@matussie.re>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

6f840990

12 Mar, 2021 1 commit
- AdamW is now supported by default (#9624) · 4c32f9f2
  Stas Bekman authored Mar 12, 2021
  
  4c32f9f2
11 Mar, 2021 2 commits
- Specify minimum version for sacrebleu (#10662) · 9fbb4cdc
  Lysandre Debut authored Mar 11, 2021
  
  9fbb4cdc
- Update README.md (#10647) · 27d9e05c
  ArvidYin authored Mar 11, 2021
```
correct spell error: 'nether'
```
  27d9e05c
10 Mar, 2021 2 commits
- Add new GLUE example with no Trainer. (#10555) · efb5c0a4
  Sylvain Gugger authored Mar 10, 2021
```
* Add new GLUE example with no Trainer.

* Style

* Address review comments
```
  efb5c0a4
- Fixes an issue in `text-classification` where MNLI eval/test datasets are not... · 6f52fce6
  Allen Wang authored Mar 09, 2021
```
Fixes an issue in `text-classification` where MNLI eval/test datasets are not being preprocessed. (#10621)

* Fix MNLI tests

* Linter fix
```
  6f52fce6
09 Mar, 2021 1 commit
- Fairscale FSDP fix model save (#10596) · 0d909f6b
  Sylvain Gugger authored Mar 09, 2021
```
* Hotfix fairscale FSDP

* Evaluation works

* Save on process zero
```
  0d909f6b
08 Mar, 2021 4 commits
- [examples tests on multigpu] resolving require_torch_non_multi_gpu_but_fix_me (#10561) · f284089e
  Stas Bekman authored Mar 08, 2021
```
* batch 1

* this is tpu

* deebert attempt

* the rest
```
  f284089e
- Added max_sample_ arguments (#10551) · dfd16af8
  Bhadresh Savani authored Mar 09, 2021
```
* reverted changes of logging and saving metrics

* added max_sample arguments

* fixed code

* white space diff

* reformetting code

* reformatted code
```
  dfd16af8
- [examples tests] various fixes (#10584) · 917f1045
  Stas Bekman authored Mar 08, 2021
```
* fix sharded ddp enum

* test fixes

* stronger validation + apex breaks other tests
```
  917f1045
- fix nltk lookup (#10585) · e6ce636e
  Stas Bekman authored Mar 07, 2021
  
  e6ce636e
06 Mar, 2021 1 commit

offline mode for firewalled envs (#10407) · 88a951e3

Stas Bekman authored Mar 05, 2021



* offline mode start

* add specific values

* fix fallback

* add test

* better values check and range

* test that actually works

* document the offline mode

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more strict check

* cleaner test

* pt-only test

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

88a951e3

05 Mar, 2021 1 commit
- fix run seq2seq (#10547) · 395ffcd7
  Patrick von Platen authored Mar 05, 2021
  
  395ffcd7
04 Mar, 2021 3 commits
- Not always consider a local model a checkpoint in run_glue (#10517) · a5bd40b7
  Sylvain Gugger authored Mar 04, 2021
  
  a5bd40b7
- Revert "Not always consider a local model a checkpoint in run_glue" · 745ea78d
  Sylvain Gugger authored Mar 04, 2021
```
This reverts commit f3660613.
```
  745ea78d
- Not always consider a local model a checkpoint in run_glue · f3660613
  Sylvain Gugger authored Mar 04, 2021
  
  f3660613
01 Mar, 2021 1 commit

Add Fine-Tuning for Wav2Vec2 (#10145) · 0234de84

Patrick von Platen authored Mar 01, 2021



* add encode labels function to tokenizer

* start adding finetuning

* init dropout

* upload

* correct convert script

* apply changes

* fix second typo

* make first dummy training run

* adapt convert script

* push confg for comparison

* remove conf

* finish training

* adapt data collator

* add research folder

* update according to fairseq feedback

* some minor corrections

* refactor masking indices a bit

* some minor changes

* clean tokenizer

* finish clean-up

* remove previous logic

* update run script

* correct training

* finish changes

* finish model

* correct bug

* fix training a bit more

* add some tests

* finish gradient checkpointing

* finish example

* correct gradient checkpointing

* improve tokenization method

* revert changes in tokenizer

* revert general change

* adapt fine-tuning

* update

* save intermediate test

* Update README.md

* finish finetuning

* delete conversion script

* Update src/transformers/models/wav2vec2/configuration_wav2vec2.py

* Update src/transformers/models/wav2vec2/processing_wav2vec2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* finish wav2vec2 script

* finish wav2vec2 fine-tuning

* finalize test

* correct test

* adapt tests

* finish

* remove test file
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

0234de84

27 Feb, 2021 3 commits
- updated logging and saving metrics (#10436) · aca6288f
  Bhadresh Savani authored Feb 27, 2021
```
* updated logging and saving metrics

* space removal
```
  aca6288f
- [run_seq2seq.py] restore functionality: saving to test_generations.txt (#10428) · f52a1589
  Stas Bekman authored Feb 27, 2021
```
This PR restores the original functionality that for some reason was modified.

Fixes: https://github.com/huggingface/transformers/issues/10381

@sgugger
```
  f52a1589
- [examples] better model example (#10427) · ee04b698
  Stas Bekman authored Feb 26, 2021
```
* refactors

* typo
```
  ee04b698
25 Feb, 2021 3 commits

Fix run_glue evaluation when model has a label correspondence (#10401) · 17b6e0d4
Sylvain Gugger authored Feb 25, 2021

17b6e0d4

Add support for ZeRO-2/3 and ZeRO-offload in fairscale (#10354) · 9d14be5c

Sylvain Gugger authored Feb 25, 2021



* Ass support for ZeRO-2/3 and ZeRO-offload in fairscale

* Quality

* Rework from review comments

* Add doc

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

9d14be5c

[PretrainedFeatureExtractor] + Wav2Vec2FeatureExtractor, Wav2Vec2Processor,... · cb38ffcc

Patrick von Platen authored Feb 25, 2021

[PretrainedFeatureExtractor] + Wav2Vec2FeatureExtractor, Wav2Vec2Processor, Wav2Vec2Tokenizer (#10324)

* push to show

* small improvement

* small improvement

* Update src/transformers/feature_extraction_utils.py

* Update src/transformers/feature_extraction_utils.py

* implement base

* add common tests

* make all tests pass for wav2vec2

* make padding work & add more tests

* finalize feature extractor utils

* add call method to feature extraction

* finalize feature processor

* finish tokenizer

* finish general processor design

* finish tests

* typo

* remove bogus file

* finish docstring

* add docs

* finish docs

* small fix

* correct docs

* save intermediate

* load changes

* apply changes

* apply changes to doc

* change tests

* apply surajs recommend

* final changes

* Apply suggestions from code review

* fix typo

* fix import

* correct docstring

cb38ffcc

24 Feb, 2021 1 commit

[Trainer/Deepspeed] handle get_last_lr() before first step() (#10362) · 3437d121

Stas Bekman authored Feb 23, 2021

* handle get_last_lr() before first step()

* abstract away the lr getting logic

* cleanup

* add test

* move to utils

3437d121

23 Feb, 2021 1 commit
- Fix broken examples/seq2seq/README.md markdown (#10344) · 23e87c27
  Akmal authored Feb 23, 2021
  
  23e87c27
22 Feb, 2021 3 commits
- [trainer] add Trainer methods for metrics logging and saving (#10266) · 622a8c59
  Stas Bekman authored Feb 22, 2021
```
* make logging and saving trainer built-in

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  622a8c59
- [Trainer] implement gradient_accumulation_steps support in DeepSpeed integration (#10310) · eab0afc1
  Stas Bekman authored Feb 22, 2021
```
* implement gradient_accumulation_steps support in DeepSpeed integration

* typo

* cleanup

* cleanup
```
  eab0afc1
- defensive programming + expand/correct README (#10295) · f991daed
  Stas Bekman authored Feb 22, 2021
  
  f991daed