Commits · 9dc7825744f64df2c5e6de3736463fc5614d56d7 · chenpangpang / transformers

24 Feb, 2021 1 commit

ConvBERT fix torch <> tf weights conversion (#10314) · 2d458b2c

abhishek thakur authored Feb 24, 2021



* convbert conversion test

* fin

* fin

* fin

* clean up tf<->pt conversion

* remove from_pt
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2d458b2c

22 Feb, 2021 2 commits

Deprecate prepare_seq2seq_batch (#10287) · 9e147d31

Sylvain Gugger authored Feb 22, 2021



* Deprecate prepare_seq2seq_batch

* Fix last tests

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* More review comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

9e147d31

Making TF Longformer-like models compliant with AMP (#10233) · 19e737b9
Julien Plu authored Feb 22, 2021
```
* AMP

* Add LED

* Apply style

* Fix longformer
```
19e737b9

19 Feb, 2021 9 commits

Integrate DeBERTa v2(the 1.5B model surpassed human performance on Su… (#10018) · 9a7e6372

Pengcheng He authored Feb 19, 2021



* Integrate DeBERTa v2(the 1.5B model surpassed human performance on SuperGLUE); Add DeBERTa v2 900M,1.5B models;

* DeBERTa-v2

* Fix v2 model loading issue (#10129)

* Doc members

* Update src/transformers/models/deberta/modeling_deberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address Sylvain's comments

* Address Patrick's comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

9a7e6372

Making TF OpenAI GPT model compliant with AMP and XLA (#10261) · 34df26ec
Julien Plu authored Feb 19, 2021
```
* Fix AMP and XLA

* Remove useless var
```
34df26ec
Making TF TransfoXL model compliant with AMP (#10264) · 3e116ed3
Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Apply style

* Remove unused import
```
3e116ed3
Fix XLA and AMP (#10262) · 86caeb76
Julien Plu authored Feb 19, 2021

86caeb76
Making TF MPNet model compliant with XLA (#10260) · 3d72d47f
Julien Plu authored Feb 19, 2021
```
* Fix XLA

* Rework cast

* Apply style
```
3d72d47f
Making TF MobileBert model compliant with AMP (#10259) · fb56bf25
Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Trigger CI

* Rework cast
```
fb56bf25
Making TF Lxmert model compliant with AMP (#10257) · 2fc6284f
Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Rework cast

* Apply style
```
2fc6284f

[trainer] implement support for full fp16 in evaluation/predict (#10268) · 4eddc459

Stas Bekman authored Feb 18, 2021



* implement --fp16_full_eval

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* add test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4eddc459

fix func signature (#10271) · d9a81fc0
Stas Bekman authored Feb 18, 2021

d9a81fc0

18 Feb, 2021 4 commits

[Trainer] memory tracker metrics (#10225) · 97e688bc

Stas Bekman authored Feb 18, 2021



* memory tracker metrics

* go back to eval for somewhat consistency

* handle no-gpu case

* deal with stackable eval calls

* restore callback order

* style

* simplify the API

* add test

* docs

* consistently use eval_ prefix

* improve docs

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename method

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

97e688bc

Reduce the time spent for the TF slow tests (#10152) · 2acae50a
Julien Plu authored Feb 18, 2021
```
* rework savedmodel slow test

* Improve savedmodel tests

* Remove useless content
```
2acae50a
Fix AMP (#10216) · 14ed3b97
Julien Plu authored Feb 18, 2021

14ed3b97
Making TF GPT2 compliant with XLA and AMP (#10230) · bdf1669e
Julien Plu authored Feb 18, 2021
```
* Fix XLA and AMP

* Fix AMP and XLA

* Apply style

* Apply Patrick's comment
```
bdf1669e

17 Feb, 2021 4 commits

Make TF CTRL compliant with XLA and AMP (#10209) · 7246785a
Julien Plu authored Feb 17, 2021
```
* Fix XLA and AMP

* Apply style

* Remove useless cast
```
7246785a
Making TF XLM-like models XLA and AMP compliant (#10211) · fdb2351e
Julien Plu authored Feb 17, 2021
```
* Fix Flaubert and XLM

* Remove useless cast

* Tiny fix

* Tiny fix
```
fdb2351e

Making TF BART-like models XLA and AMP compliant (#10191) · 83d803ba

Julien Plu authored Feb 17, 2021

* Update BART

* Update Blenderbot

* Update BlenderbotSmall

* Update Marian

* Update MBart

* Update MBart

* Update Pegasus

* Update template

* Fix Marian and Pegasus

* Apply style

* Default initializer

* Default initializer

* Default initializer

* Remove int32 casts

* Fix template

* Remove more cast

83d803ba

Fix head masking for TFT5 (#9877) · 8d79e5ca

Daniel Stancl authored Feb 17, 2021



* Fix head_mask and decoder_head_mask in TFT5 models

* Enable test_headmasking both fot TFT5 tester
and TFT5EncoderOnly tester
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

8d79e5ca

16 Feb, 2021 2 commits
- Store FLOS as floats to avoid overflow. (#10213) · 7169d1ea
  Sylvain Gugger authored Feb 16, 2021
  
  7169d1ea
- Unlock XLA test for convbert (#10207) · 5c2d66a2
  Julien Plu authored Feb 16, 2021
  
  5c2d66a2
15 Feb, 2021 5 commits

Specify dataset dtype (#10195) · 8cbd0bd1

Lysandre Debut authored Feb 15, 2021


Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

8cbd0bd1

Add AMP for Albert (#10141) · 31b0560a
Julien Plu authored Feb 15, 2021

31b0560a

Add mBART-50 (#10154) · 6fc940ed

Suraj Patil authored Feb 15, 2021

* add tokenizer for mBART-50

* update tokenizers

* make src_lang and tgt_lang optional

* update tokenizer test

* add setter

* update docs

* update conversion script

* update docs

* update conversion script

* update tokenizer

* update test

* update docs

* doc

* address Sylvain's suggestions

* fix test

* fix formatting

* nits

6fc940ed

Check TF ops for ONNX compliance (#10025) · c8d3fa0d

Julien Plu authored Feb 15, 2021



* Add check-ops script

* Finish to implement check_tf_ops and start the test

* Make the test mandatory only for BERT

* Update tf_ops folder

* Remove useless classes

* Add the ONNX test for GPT2 and BART

* Add a onnxruntime slow test + better opset flexibility

* Fix test + apply style

* fix tests

* Switch min opset from 12 to 10

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix GPT2

* Remove extra shape_list usage

* Fix GPT2

* Address Morgan's comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c8d3fa0d

Fixing NER pipeline for list inputs. (#10184) · 900daec2
Nicolas Patry authored Feb 15, 2021
```
Fixes #10168
```
900daec2

13 Feb, 2021 1 commit

Conversion from slow to fast for BPE spm vocabs contained an error. (#10120) · c9837a0d

Nicolas Patry authored Feb 13, 2021

* Conversion from slow to fast for BPE spm vocabs contained an error.

- There is only 1 test currently (tokenizers + slow) that used the modified path
and it's reformer, which does not contain any ids modification so the
bug was silent for now.
- The real issue is that vocab variable was overloaded by
SentencePieceExtractor, leading to Slow specific vocab oddities to be
completely ignored
- The bug was reported here https://github.com/huggingface/transformers/issues/9518
- Ran the complete tokenization test suite with slow without error
(`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)

* Remove rebase error.

* Adding the fixture.

c9837a0d

12 Feb, 2021 2 commits
- [hf_api] delete deprecated methods and tests (2) · 641f418e
  Julien Chaumond authored Feb 12, 2021
  
  641f418e
- [hf_api] delete deprecated methods and tests (#10159) · eed31db9
  Julien Chaumond authored Feb 12, 2021
```
* [hf_api] delete deprecated methods and tests

cc @lhoestq

* Update test_hf_api.py
```
  eed31db9
11 Feb, 2021 1 commit

[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117) · 495c157d

Patrick von Platen authored Feb 11, 2021

* save intermediate

* finish batch the same as fairseq

* add normalization

* fix batched input

* add better comment

* Update src/transformers/models/wav2vec2/modeling_wav2vec2.py

* add nice docstring

* add tokenizer tests

* make all slow tests pass

* finish PR

* correct import

495c157d

10 Feb, 2021 3 commits

remove adjust_logits_during_generation method (#10087) · c130e67d

Suraj Patil authored Feb 10, 2021

* add forced logits processors

* delete adjust_logits method

* add forced_eos_token_id argument in config

* add tests for forced logits processors

* update gen utils tests

* add forced option to tf generate

* remove adjust_logits method from tf models

* update adjust_logits for marian

* delete _force_token_id_to_be_generated method

* style

* import warnings

* pass max_length to _get_logits_processor

* set forced_eos_token_id to None

* set forced attributes in conf utils

* typo

* fix rag generate

* add forced_eos_token_id in rag config

* remove force_bos_token_to_be_generated from BartConfig

* remove _force_token_ids_generation from FSMT

* nit

* fix negative constant

* apply suggestions from code review

c130e67d

Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4

Julien Plu authored Feb 10, 2021

* Fix test

* Remove commented test

* Fix name

* Apply style

* Fix check copies

* Remove prints

* Restore boolean

* Fix reshape

22a32cf4

Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
Lysandre Debut authored Feb 10, 2021

0d8e554d

09 Feb, 2021 3 commits
- Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104) · 480a9d6b
  abhishek thakur authored Feb 09, 2021
  
  480a9d6b
- Add head_mask and decoder_head_mask to TF LED (#9988) · e7381c45
  Daniel Stancl authored Feb 09, 2021
```
* Add head masking to TF LED

* Add head_mask to Longformer + one doc piece to LED

* Fix integration tests
```
  e7381c45
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
08 Feb, 2021 3 commits
- Integration test for electra model (#10073) · 263fac71
  sandip authored Feb 09, 2021
  
  263fac71
- Implementing the test integration of BertGeneration (#9990) · 3b7e612a
  demSd authored Feb 08, 2021
```
* claiming this issue

* Integration test for BertGeneration(Encoder and Decoder)

* fix code quality
```
  3b7e612a
- fix bert2bert test (#10063) · 9e795eac
  Patrick von Platen authored Feb 08, 2021
  
  9e795eac