Commits · 9a7e63729f3ff6ddf065fd0d443421e46b1a2ffb · chenpangpang / transformers

19 Feb, 2021 9 commits

Integrate DeBERTa v2(the 1.5B model surpassed human performance on Su… (#10018) · 9a7e6372

Pengcheng He authored Feb 19, 2021



* Integrate DeBERTa v2(the 1.5B model surpassed human performance on SuperGLUE); Add DeBERTa v2 900M,1.5B models;

* DeBERTa-v2

* Fix v2 model loading issue (#10129)

* Doc members

* Update src/transformers/models/deberta/modeling_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address Sylvain's comments

* Address Patrick's comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

9a7e6372

Making TF OpenAI GPT model compliant with AMP and XLA (#10261) · 34df26ec
Julien Plu authored Feb 19, 2021
```
* Fix AMP and XLA

* Remove useless var
```
34df26ec
Making TF TransfoXL model compliant with AMP (#10264) · 3e116ed3
Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Apply style

* Remove unused import
```
3e116ed3
Fix XLA and AMP (#10262) · 86caeb76
Julien Plu authored Feb 19, 2021

86caeb76
Making TF MPNet model compliant with XLA (#10260) · 3d72d47f
Julien Plu authored Feb 19, 2021
```
* Fix XLA

* Rework cast

* Apply style
```
3d72d47f
Making TF MobileBert model compliant with AMP (#10259) · fb56bf25
Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Trigger CI

* Rework cast
```
fb56bf25
Making TF Lxmert model compliant with AMP (#10257) · 2fc6284f
Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Rework cast

* Apply style
```
2fc6284f

[trainer] implement support for full fp16 in evaluation/predict (#10268) · 4eddc459

Stas Bekman authored Feb 18, 2021



* implement --fp16_full_eval

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* add test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4eddc459

fix func signature (#10271) · d9a81fc0
Stas Bekman authored Feb 18, 2021

d9a81fc0

18 Feb, 2021 4 commits

[Trainer] memory tracker metrics (#10225) · 97e688bc

Stas Bekman authored Feb 18, 2021



* memory tracker metrics

* go back to eval for somewhat consistency

* handle no-gpu case

* deal with stackable eval calls

* restore callback order

* style

* simplify the API

* add test

* docs

* consistently use eval_ prefix

* improve docs

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename method

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

97e688bc

Reduce the time spent for the TF slow tests (#10152) · 2acae50a
Julien Plu authored Feb 18, 2021
```
* rework savedmodel slow test

* Improve savedmodel tests

* Remove useless content
```
2acae50a
Fix AMP (#10216) · 14ed3b97
Julien Plu authored Feb 18, 2021

14ed3b97
Making TF GPT2 compliant with XLA and AMP (#10230) · bdf1669e
Julien Plu authored Feb 18, 2021
```
* Fix XLA and AMP

* Fix AMP and XLA

* Apply style

* Apply Patrick's comment
```
bdf1669e

17 Feb, 2021 4 commits

Make TF CTRL compliant with XLA and AMP (#10209) · 7246785a
Julien Plu authored Feb 17, 2021
```
* Fix XLA and AMP

* Apply style

* Remove useless cast
```
7246785a
Making TF XLM-like models XLA and AMP compliant (#10211) · fdb2351e
Julien Plu authored Feb 17, 2021
```
* Fix Flaubert and XLM

* Remove useless cast

* Tiny fix

* Tiny fix
```
fdb2351e

Making TF BART-like models XLA and AMP compliant (#10191) · 83d803ba

Julien Plu authored Feb 17, 2021

* Update BART

* Update Blenderbot

* Update BlenderbotSmall

* Update Marian

* Update MBart

* Update MBart

* Update Pegasus

* Update template

* Fix Marian and Pegasus

* Apply style

* Default initializer

* Default initializer

* Default initializer

* Remove int32 casts

* Fix template

* Remove more cast

83d803ba

Fix head masking for TFT5 (#9877) · 8d79e5ca

Daniel Stancl authored Feb 17, 2021



* Fix head_mask and decoder_head_mask in TFT5 models

* Enable test_headmasking both fot TFT5 tester
and TFT5EncoderOnly tester
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

8d79e5ca

16 Feb, 2021 2 commits
- Store FLOS as floats to avoid overflow. (#10213) · 7169d1ea
  Sylvain Gugger authored Feb 16, 2021
  
  7169d1ea
- Unlock XLA test for convbert (#10207) · 5c2d66a2
  Julien Plu authored Feb 16, 2021
  
  5c2d66a2
15 Feb, 2021 5 commits

Specify dataset dtype (#10195) · 8cbd0bd1

Lysandre Debut authored Feb 15, 2021


Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

8cbd0bd1

Add AMP for Albert (#10141) · 31b0560a
Julien Plu authored Feb 15, 2021

31b0560a

Add mBART-50 (#10154) · 6fc940ed

Suraj Patil authored Feb 15, 2021

* add tokenizer for mBART-50

* update tokenizers

* make src_lang and tgt_lang optional

* update tokenizer test

* add setter

* update docs

* update conversion script

* update docs

* update conversion script

* update tokenizer

* update test

* update docs

* doc

* address Sylvain's suggestions

* fix test

* fix formatting

* nits

6fc940ed

Check TF ops for ONNX compliance (#10025) · c8d3fa0d

Julien Plu authored Feb 15, 2021



* Add check-ops script

* Finish to implement check_tf_ops and start the test

* Make the test mandatory only for BERT

* Update tf_ops folder

* Remove useless classes

* Add the ONNX test for GPT2 and BART

* Add a onnxruntime slow test + better opset flexibility

* Fix test + apply style

* fix tests

* Switch min opset from 12 to 10

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix GPT2

* Remove extra shape_list usage

* Fix GPT2

* Address Morgan's comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c8d3fa0d

Fixing NER pipeline for list inputs. (#10184) · 900daec2
Nicolas Patry authored Feb 15, 2021
```
Fixes #10168
```
900daec2

13 Feb, 2021 1 commit

Conversion from slow to fast for BPE spm vocabs contained an error. (#10120) · c9837a0d

Nicolas Patry authored Feb 13, 2021

* Conversion from slow to fast for BPE spm vocabs contained an error.

- There is only 1 test currently (tokenizers + slow) that used the modified path
and it's reformer, which does not contain any ids modification so the
bug was silent for now.
- The real issue is that vocab variable was overloaded by
SentencePieceExtractor, leading to Slow specific vocab oddities to be
completely ignored
- The bug was reported here https://github.com/huggingface/transformers/issues/9518
- Ran the complete tokenization test suite with slow without error
(`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)

* Remove rebase error.

* Adding the fixture.

c9837a0d

12 Feb, 2021 2 commits
- [hf_api] delete deprecated methods and tests (2) · 641f418e
  Julien Chaumond authored Feb 12, 2021
  
  641f418e
- [hf_api] delete deprecated methods and tests (#10159) · eed31db9
  Julien Chaumond authored Feb 12, 2021
```
* [hf_api] delete deprecated methods and tests

cc @lhoestq

* Update test_hf_api.py
```
  eed31db9
11 Feb, 2021 1 commit

[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117) · 495c157d

Patrick von Platen authored Feb 11, 2021

* save intermediate

* finish batch the same as fairseq

* add normalization

* fix batched input

* add better comment

* Update src/transformers/models/wav2vec2/modeling_wav2vec2.py

* add nice docstring

* add tokenizer tests

* make all slow tests pass

* finish PR

* correct import

495c157d

10 Feb, 2021 3 commits

remove adjust_logits_during_generation method (#10087) · c130e67d

Suraj Patil authored Feb 10, 2021

* add forced logits processors

* delete adjust_logits method

* add forced_eos_token_id argument in config

* add tests for forced logits processors

* update gen utils tests

* add forced option to tf generate

* remove adjust_logits method from tf models

* update adjust_logits for marian

* delete _force_token_id_to_be_generated method

* style

* import warnings

* pass max_length to _get_logits_processor

* set forced_eos_token_id to None

* set forced attributes in conf utils

* typo

* fix rag generate

* add forced_eos_token_id in rag config

* remove force_bos_token_to_be_generated from BartConfig

* remove _force_token_ids_generation from FSMT

* nit

* fix negative constant

* apply suggestions from code review

c130e67d

Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4

Julien Plu authored Feb 10, 2021

* Fix test

* Remove commented test

* Fix name

* Apply style

* Fix check copies

* Remove prints

* Restore boolean

* Fix reshape

22a32cf4

Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
Lysandre Debut authored Feb 10, 2021

0d8e554d

09 Feb, 2021 3 commits
- Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104) · 480a9d6b
  abhishek thakur authored Feb 09, 2021
  
  480a9d6b
- Add head_mask and decoder_head_mask to TF LED (#9988) · e7381c45
  Daniel Stancl authored Feb 09, 2021
```
* Add head masking to TF LED

* Add head_mask to Longformer + one doc piece to LED

* Fix integration tests
```
  e7381c45
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
08 Feb, 2021 6 commits

Integration test for electra model (#10073) · 263fac71
sandip authored Feb 09, 2021

263fac71
Implementing the test integration of BertGeneration (#9990) · 3b7e612a
demSd authored Feb 08, 2021
```
* claiming this issue

* Integration test for BertGeneration(Encoder and Decoder)

* fix code quality
```
3b7e612a
fix bert2bert test (#10063) · 9e795eac
Patrick von Platen authored Feb 08, 2021

9e795eac

Restore TF embeddings and attention layers to their previous version (#9890) · 31563e05

Julien Plu authored Feb 08, 2021

* Refacto BERT

* Restore all the concerned models

* Remove print

* Update template

* Apply Sylvain's and Morgan's comments

* Fix cast

* Put the cast inside call

* Remove cond in ebds

* Fix funnel

* Restore previous dot product (attention_scores) computation

* Add ConvBERT and BART

* Make all the S2S models ONNX compliant

* Fix test

* Fix check copies

31563e05

Disable temporarily too slow tests (Longformer/LED) (#10062) · 8bb52bd2
Julien Plu authored Feb 08, 2021
```
* Disable temporarily too slow tests

* Fix style

* Fix template
```
8bb52bd2

Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002) · b1aa4982

Nicolas Patry authored Feb 08, 2021

* Cleaning up `ConversationalPipeline` to support more than DialoGPT.

Currently ConversationalPipeline was heavily biased towards DialoGPT
,which is the default model for this pipeline.

This PR proposes changes to put back the modifications specific to
DialoGPT into tokenizer-specific behavior wherever possible, by
creating `_build_conversation_input_ids` function that takes
conversation as input, and returns a list of ints corresponding
to the tokens. It feels natural to put here because all models
have probably different strategies to build input_ids from the
full conversation and it's the tokenizer's job to transform strings
into tokens (and vice-versa)

If `_build_conversation_input_ids` is missing, previous behavior is
used so we don't break anything so far (except for blenderbot where it's a fix).

This PR also contains a fix for too long inputs. There used
to be dead code for trying to limit the size of incoming input.
The introduced fixed is that we limit
within `_build_conversation_input_ids` to `tokenizer.model_max_length`.
It corresponds to the intent of the removed dead code and is actually
better because it corresponds to `model_max_length` which is different
from `max_length` (which is a default parameter for `generate`).

- Removed `history` logic from the Conversation as it's not relevant
anymore because tokenization logic has been moved to tokenizer.
And tokenizer cannot save any cache, and conversation cannot know
what is relevant or not.
Also it's not usable from `blenderbot` because the input_ids are
not append only (EOS tokens is always at the end).

- Added `iter_texts` method on `Conversation` because all
the code was literred with some form of this iteration of
past/generated_responses.

* Removing torch mention in types.

* Adding type checking to `_build_conversation_input_ids`.

* Fixing import in strings.

b1aa4982