Commits · db9dd09cf9d8f5de9a5293ec16e7b3d0c01dcbbb · chenpangpang / transformers

30 Apr, 2021 8 commits

Adding `AutomaticSpeechRecognitionPipeline`. (#11337) · db9dd09c

Nicolas Patry authored Apr 30, 2021



* Adding `AutomaticSpeechRecognitionPipeline`.

- Because we added everything to enable this pipeline, we probably
should add it to `transformers`.
- This PR tries to limit the scope and focuses only on the pipeline part
(what should go in, and out).
- The tests are very specific for S2T and Wav2vec2 to make sure both
architectures are supported by the pipeline. We don't use the mixin for
tests right now, because that requires more work in the `pipeline`
function (will be done in a follow up PR).
- Unsure about the "helper" function `ffmpeg_read`. It makes a lot of
  sense from a user perspective, it does not add any additional
dependencies (as in hard dependency, because users can always use their
own load mechanism). Meanwhile, it feels slightly clunky to have so much
optional preprocessing.
- The pipeline is not done to support streaming audio right now.

Future work:

- Add `automatic-speech-recognition` as a `task`. And add the
FeatureExtractor.from_pretrained within `pipeline` function.
- Add small models within tests
- Add the Mixin to tests.
- Make the logic between ForCTC vs ForConditionalGeneration better.

* Update tests/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Adding docs + main import + type checking + LICENSE.

* Doc style !.

* Fixing TYPE_HINT.

* Specifying waveform shape in the docs.

* Adding asserts + specify in the documentation the shape of the input
np.ndarray.

* Update src/transformers/pipelines/automatic_speech_recognition.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Adding require to tests + move the `feature_extractor` doc.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

db9dd09c

T5 Gradient Checkpointing (#11353) · 76116f47

CeShine Lee authored Apr 30, 2021

* Implement gradient checkpoinging for T5Stack

* A bit more robust type checking

* Add `gradient_checkpointing` to T5Config

* Formatting

* Set requires_grad only when training

* None return value will only cause problems when training

* Change the output tuple according to `use_cache`

* Enable gradient checkpointing for the decoder

Squashed commit of the following:

commit 658bdd0bd1215353a8770f558bda2ea69a0ad0c7
Author: Ceshine Lee <shuanck@gmail.com>
Date:   Sat Apr 24 14:08:17 2021 +0800

    Only set `require_grad` for gradient checkpointing

commit acaeee6b2e675045fb28ce2176444c1d63e908bd
Author: Ceshine Lee <shuanck@gmail.com>
Date:   Sat Apr 24 13:59:35 2021 +0800

    Make gradient checkpointing work with the decoder

* Formatting

76116f47

Update README.md (#11489) · 58c789e3
Manuel Romero authored Apr 30, 2021
```
Add link to code
```
58c789e3
make style (#11520) · 022a1e9e
Patrick von Platen authored Apr 30, 2021

022a1e9e

add sp_model_kwargs to unpickle of xlm roberta tok (#11430) · e0db8276

Philip May authored Apr 30, 2021

add test for pickle

simplify test

fix test code style

add missing pickle import

fix test

fix test

fix test

e0db8276

correct the dimension comment of matrix multiplication (#11494) · b43e3f93
Frederik Bode authored Apr 30, 2021
```
Co-authored-by: Frederik Bode <frederik@paperbox.ai>
```
b43e3f93
Pin HuggingFace Hub dependency (#11502) · f37f2adb
Lysandre Debut authored Apr 30, 2021

f37f2adb
Patch notification service · 60d5bda4
Lysandre authored Apr 30, 2021

60d5bda4

29 Apr, 2021 4 commits

Split checkpoint from model_name_or_path in examples (#11492) · b29eb247
Sylvain Gugger authored Apr 29, 2021
```
* Split checkpoint from model_name_or_path in examples

* Address review comments

* Address review comments
```
b29eb247
solved coefficient issue for the TF version of gelu_fast (#11514) · d6ec54ba
Michael Benayoun authored Apr 29, 2021
```
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
d6ec54ba
Reformat to make code clearer in tokenizer call (#11497) · ad1f7bef
Sylvain Gugger authored Apr 29, 2021
```
* Reformat to make code clearer

* Reformat to make code clearer
```
ad1f7bef

[Flax] Add docstrings & model outputs (#11498) · f748bd42

Patrick von Platen authored Apr 29, 2021



* add attentions & hidden states

* add model outputs + docs

* finish docs

* finish tests

* finish impl

* del @

* finish

* finish

* correct test

* apply sylvains suggestions

* Update src/transformers/models/bert/modeling_flax_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* simplify more
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f748bd42

28 Apr, 2021 3 commits

fix #1149 (#11493) · 3f6add8b
Hamel Husain authored Apr 28, 2021

3f6add8b

Update `PreTrainedTokenizerBase` to check/handle batch length for `text_pair` parameter (#11486) · c0eb218a

Hamel Husain authored Apr 28, 2021



* Update tokenization_utils_base.py

* add assertion

* check batch len

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add error message
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c0eb218a

Update min versions in README and add Flax (#11472) · 2d27900b
Sylvain Gugger authored Apr 28, 2021
```
* Update min versions in README and add Flax

* Adapt index
```
2d27900b

27 Apr, 2021 3 commits

fix docs for decoder_input_ids (#11466) · 8d43c71a
Suraj Patil authored Apr 27, 2021
```
* fix docs for decoder_input_ids

* revert the changes for bart and mbart
```
8d43c71a

Finish Making Quick Tour respect the model object (#11467) · 7ceff67e

Hamel Husain authored Apr 27, 2021



* finish quicktour

* fix import

* fix print

* explain config default better

* Update docs/source/quicktour.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7ceff67e

update QuickTour docs to reflect model output object (#11462) · 88ac60f7
Hamel Husain authored Apr 26, 2021
```
* update docs to reflect model output object

* run make style`
```
88ac60f7

26 Apr, 2021 20 commits
- Remove max length beam scorer (#11378) · 741d48f5
  Ashwin Geet D'Sa authored Apr 27, 2021
```
* removed max_len

* removed max_length from BeamSearchScorer

* correct max length

* finish

* del vim

* finish & add test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  741d48f5
- [Deepspeed] ZeRO-Infinity integration plus config revamp (#11418) · bc2571e6
  Stas Bekman authored Apr 26, 2021
```
* adding Z-inf

* revamp config process

* up version requirement

* wip

* massive rewrite

* cleanup

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* consistent json commas

* act on suggestions

* leave this feature for 0.3.16

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  bc2571e6
- Variable Correction for Consistency in Distillation Example (#11444) · 0661abc5
  Jaimeen Ahn authored Apr 27, 2021
```
As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively,  the correction makes the example work
```
  0661abc5
- [Examples] Fixes inconsistency around eval vs val and predict vs test (#11380) · 1d30ec95
  Bhadresh Savani authored Apr 26, 2021
```
* added changes for uniformity

* modified files

* corrected typo

* fixed qa scripts

* fix typos

* fixed predict typo in qa no trainer

* fixed test file

* reverted trainer changes

* reverted trainer changes in custom exmaples

* updated readme

* added changes in deepspeed test

* added changes for predict and eval
```
  1d30ec95
- Give each test a different repo name (#11453) · 7959d835
  Sylvain Gugger authored Apr 26, 2021
  
  7959d835
- Style · b03b2a65
  Sylvain Gugger authored Apr 26, 2021
  
  b03b2a65
- make sure to test against the local checkout (#11437) · ce11318e
  Stas Bekman authored Apr 26, 2021
  
  ce11318e
- [docs] fix invalid class name (#11438) · a753cafd
  Stas Bekman authored Apr 26, 2021
```
* fix invalid class name

* proper ref

* proper ref
```
  a753cafd
- Clarify description of the is_split_into_words argument (#11449) · 6715e3b6
  Kostas Stathoulopoulos authored Apr 26, 2021
```
* Improve documentation for is_split_into_words argument

* Change description wording
```
  6715e3b6
- Pass along seed to DistributedSampler (#11406) · ab2cabb9
  Sylvain Gugger authored Apr 26, 2021
```
* Pass along seed to DistributedSampler

* Add seed to DistributedLengthGroupedSampler
```
  ab2cabb9
- fix some typos in docs, comments, logging/errors (#11432) · b24ead87
  LSinev authored Apr 26, 2021
  
  b24ead87
- docs(examples): fix link to TPU launcher script (#11427) · e3e70f95
  Amine Abdaoui authored Apr 26, 2021
  
  e3e70f95
- Add basic support for FP16 in SageMaker model parallelism (#11407) · d7633a4e
  Sylvain Gugger authored Apr 26, 2021
```
* Add FP16 support for SageMaker MP

* Add print debugs

* Squeeze

* Remove debug statements

* Add defensive check

* Typo
```
  d7633a4e
- TF BART models - Add `cross_attentions` to model output and fix... · 38a716cd
  Daniel Stancl authored Apr 26, 2021
```
TF BART models - Add `cross_attentions` to model output and fix cross-attention head masking (#10699)

* Add cross_attn_head_mask to BART

* Fix cross_attentions in TFBart-like models

* This commit enables returning of `cross_attentions`
for TFBart-like models

* It also fixes attention head masking in cross-attenion module

* Update TF model templates

* Fix missing , in TF model templates

* Fix typo: congig -> config
```
  38a716cd
- Pin black to 21.4b0 · 4bd6b54f
  Sylvain Gugger authored Apr 26, 2021
  
  4bd6b54f
- With style · c1625b32
  Sylvain Gugger authored Apr 26, 2021
  
  c1625b32
- Pin black to 20.8.b1 · 4b72cfd9
  Sylvain Gugger authored Apr 26, 2021
  
  4b72cfd9
- make style (#11442) · 32dbb2d9
  Patrick von Platen authored Apr 26, 2021
  
  32dbb2d9
- add pooling layer support (#11439) · 04ab2ca6
  Vasudev Gupta authored Apr 26, 2021
  
  04ab2ca6
- updating the checkpoint for GPT2ForSequence Classification to one with classification head (#11434) · 30f06589
  abiolaTresor authored Apr 26, 2021
  
  30f06589
25 Apr, 2021 2 commits

EncoderDecoderConfigs should not create new objects (#11300) · 35cd8eed

cronoik authored Apr 25, 2021



* removes the creation of separate config objects and uses the existing ones instead+overwrite resize_token_embeddings from parent class because it is not working for the EncoderDecoderModel

* rollback to current version of the huggingface master branch

* reworked version that ties the encoder and decoder config of the parent encoderdecoder instance

* overwrite of resize_token_embeddings throws an error now

* review comment suggestion
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* implemented warning in case encoderdecoder is created with differing configs of encoderdecoderconfig and decoderconfig or encoderconfig

* added test to avoid diverging configs of wrapper class and wrapped classes

* Update src/transformers/models/encoder_decoder/modeling_encoder_decoder.py

* make style
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

35cd8eed

Add head_mask, decoder_head_mask, cross_head_mask to ProphetNet (#9964) · f45cb66b

Daniel Stancl authored Apr 25, 2021

* Add head_mask & decoder_head_mask + some corrections

* Fix head masking for N-grams

* Enable test_headmasking for encoder and decod

* Fix one typo regarding in modeling_propgetnet.py

* Enable test_headmasking for ProphetNetStandaloneDecoderModelTest
and ProphetNetStandaloneEncoderModelTest in test_modeling_prophetnet.py

* make style

* Fix cross_head_mask

* Fix attention head mask naming

* `cross_head_mask` -> `cross_attn_head_mask`

* `cross_layer_head_mask` -> `cross_attn_layer_head_mask`

* Still need to merge #10605 to master to pass the tests

f45cb66b