Commits · 14ed3b978eb0b26b7904107f0b8ebf0a976a0060 · chenpangpang / transformers

18 Feb, 2021 2 commits
- Fix AMP (#10216) · 14ed3b97
  Julien Plu authored Feb 18, 2021
  
  14ed3b97
- Making TF GPT2 compliant with XLA and AMP (#10230) · bdf1669e
  Julien Plu authored Feb 18, 2021
```
* Fix XLA and AMP

* Fix AMP and XLA

* Apply style

* Apply Patrick's comment
```
  bdf1669e
17 Feb, 2021 4 commits

Make TF CTRL compliant with XLA and AMP (#10209) · 7246785a
Julien Plu authored Feb 17, 2021
```
* Fix XLA and AMP

* Apply style

* Remove useless cast
```
7246785a
Making TF XLM-like models XLA and AMP compliant (#10211) · fdb2351e
Julien Plu authored Feb 17, 2021
```
* Fix Flaubert and XLM

* Remove useless cast

* Tiny fix

* Tiny fix
```
fdb2351e

Making TF BART-like models XLA and AMP compliant (#10191) · 83d803ba

Julien Plu authored Feb 17, 2021

* Update BART

* Update Blenderbot

* Update BlenderbotSmall

* Update Marian

* Update MBart

* Update MBart

* Update Pegasus

* Update template

* Fix Marian and Pegasus

* Apply style

* Default initializer

* Default initializer

* Default initializer

* Remove int32 casts

* Fix template

* Remove more cast

83d803ba

Fix head masking for TFT5 (#9877) · 8d79e5ca

Daniel Stancl authored Feb 17, 2021



* Fix head_mask and decoder_head_mask in TFT5 models

* Enable test_headmasking both fot TFT5 tester
and TFT5EncoderOnly tester
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

8d79e5ca

16 Feb, 2021 2 commits
- Store FLOS as floats to avoid overflow. (#10213) · 7169d1ea
  Sylvain Gugger authored Feb 16, 2021
  
  7169d1ea
- Unlock XLA test for convbert (#10207) · 5c2d66a2
  Julien Plu authored Feb 16, 2021
  
  5c2d66a2
15 Feb, 2021 5 commits

Specify dataset dtype (#10195) · 8cbd0bd1

Lysandre Debut authored Feb 15, 2021


Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

8cbd0bd1

Add AMP for Albert (#10141) · 31b0560a
Julien Plu authored Feb 15, 2021

31b0560a

Add mBART-50 (#10154) · 6fc940ed

Suraj Patil authored Feb 15, 2021

* add tokenizer for mBART-50

* update tokenizers

* make src_lang and tgt_lang optional

* update tokenizer test

* add setter

* update docs

* update conversion script

* update docs

* update conversion script

* update tokenizer

* update test

* update docs

* doc

* address Sylvain's suggestions

* fix test

* fix formatting

* nits

6fc940ed

Check TF ops for ONNX compliance (#10025) · c8d3fa0d

Julien Plu authored Feb 15, 2021



* Add check-ops script

* Finish to implement check_tf_ops and start the test

* Make the test mandatory only for BERT

* Update tf_ops folder

* Remove useless classes

* Add the ONNX test for GPT2 and BART

* Add a onnxruntime slow test + better opset flexibility

* Fix test + apply style

* fix tests

* Switch min opset from 12 to 10

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix GPT2

* Remove extra shape_list usage

* Fix GPT2

* Address Morgan's comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c8d3fa0d

Fixing NER pipeline for list inputs. (#10184) · 900daec2
Nicolas Patry authored Feb 15, 2021
```
Fixes #10168
```
900daec2

13 Feb, 2021 1 commit

Conversion from slow to fast for BPE spm vocabs contained an error. (#10120) · c9837a0d

Nicolas Patry authored Feb 13, 2021

* Conversion from slow to fast for BPE spm vocabs contained an error.

- There is only 1 test currently (tokenizers + slow) that used the modified path
and it's reformer, which does not contain any ids modification so the
bug was silent for now.
- The real issue is that vocab variable was overloaded by
SentencePieceExtractor, leading to Slow specific vocab oddities to be
completely ignored
- The bug was reported here https://github.com/huggingface/transformers/issues/9518
- Ran the complete tokenization test suite with slow without error
(`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)

* Remove rebase error.

* Adding the fixture.

c9837a0d

12 Feb, 2021 2 commits
- [hf_api] delete deprecated methods and tests (2) · 641f418e
  Julien Chaumond authored Feb 12, 2021
  
  641f418e
- [hf_api] delete deprecated methods and tests (#10159) · eed31db9
  Julien Chaumond authored Feb 12, 2021
```
* [hf_api] delete deprecated methods and tests

cc @lhoestq

* Update test_hf_api.py
```
  eed31db9
11 Feb, 2021 1 commit

[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117) · 495c157d

Patrick von Platen authored Feb 11, 2021

* save intermediate

* finish batch the same as fairseq

* add normalization

* fix batched input

* add better comment

* Update src/transformers/models/wav2vec2/modeling_wav2vec2.py

* add nice docstring

* add tokenizer tests

* make all slow tests pass

* finish PR

* correct import

495c157d

10 Feb, 2021 3 commits

remove adjust_logits_during_generation method (#10087) · c130e67d

Suraj Patil authored Feb 10, 2021

* add forced logits processors

* delete adjust_logits method

* add forced_eos_token_id argument in config

* add tests for forced logits processors

* update gen utils tests

* add forced option to tf generate

* remove adjust_logits method from tf models

* update adjust_logits for marian

* delete _force_token_id_to_be_generated method

* style

* import warnings

* pass max_length to _get_logits_processor

* set forced_eos_token_id to None

* set forced attributes in conf utils

* typo

* fix rag generate

* add forced_eos_token_id in rag config

* remove force_bos_token_to_be_generated from BartConfig

* remove _force_token_ids_generation from FSMT

* nit

* fix negative constant

* apply suggestions from code review

c130e67d

Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4

Julien Plu authored Feb 10, 2021

* Fix test

* Remove commented test

* Fix name

* Apply style

* Fix check copies

* Remove prints

* Restore boolean

* Fix reshape

22a32cf4

Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
Lysandre Debut authored Feb 10, 2021

0d8e554d

09 Feb, 2021 3 commits
- Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104) · 480a9d6b
  abhishek thakur authored Feb 09, 2021
  
  480a9d6b
- Add head_mask and decoder_head_mask to TF LED (#9988) · e7381c45
  Daniel Stancl authored Feb 09, 2021
```
* Add head masking to TF LED

* Add head_mask to Longformer + one doc piece to LED

* Fix integration tests
```
  e7381c45
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
08 Feb, 2021 9 commits

Integration test for electra model (#10073) · 263fac71
sandip authored Feb 09, 2021

263fac71
Implementing the test integration of BertGeneration (#9990) · 3b7e612a
demSd authored Feb 08, 2021
```
* claiming this issue

* Integration test for BertGeneration(Encoder and Decoder)

* fix code quality
```
3b7e612a
fix bert2bert test (#10063) · 9e795eac
Patrick von Platen authored Feb 08, 2021

9e795eac

Restore TF embeddings and attention layers to their previous version (#9890) · 31563e05

Julien Plu authored Feb 08, 2021

* Refacto BERT

* Restore all the concerned models

* Remove print

* Update template

* Apply Sylvain's and Morgan's comments

* Fix cast

* Put the cast inside call

* Remove cond in ebds

* Fix funnel

* Restore previous dot product (attention_scores) computation

* Add ConvBERT and BART

* Make all the S2S models ONNX compliant

* Fix test

* Fix check copies

31563e05

Disable temporarily too slow tests (Longformer/LED) (#10062) · 8bb52bd2
Julien Plu authored Feb 08, 2021
```
* Disable temporarily too slow tests

* Fix style

* Fix template
```
8bb52bd2

Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002) · b1aa4982

Nicolas Patry authored Feb 08, 2021

* Cleaning up `ConversationalPipeline` to support more than DialoGPT.

Currently ConversationalPipeline was heavily biased towards DialoGPT
,which is the default model for this pipeline.

This PR proposes changes to put back the modifications specific to
DialoGPT into tokenizer-specific behavior wherever possible, by
creating `_build_conversation_input_ids` function that takes
conversation as input, and returns a list of ints corresponding
to the tokens. It feels natural to put here because all models
have probably different strategies to build input_ids from the
full conversation and it's the tokenizer's job to transform strings
into tokens (and vice-versa)

If `_build_conversation_input_ids` is missing, previous behavior is
used so we don't break anything so far (except for blenderbot where it's a fix).

This PR also contains a fix for too long inputs. There used
to be dead code for trying to limit the size of incoming input.
The introduced fixed is that we limit
within `_build_conversation_input_ids` to `tokenizer.model_max_length`.
It corresponds to the intent of the removed dead code and is actually
better because it corresponds to `model_max_length` which is different
from `max_length` (which is a default parameter for `generate`).

- Removed `history` logic from the Conversation as it's not relevant
anymore because tokenization logic has been moved to tokenizer.
And tokenizer cannot save any cache, and conversation cannot know
what is relevant or not.
Also it's not usable from `blenderbot` because the input_ids are
not append only (EOS tokens is always at the end).

- Added `iter_texts` method on `Conversation` because all
the code was literred with some form of this iteration of
past/generated_responses.

* Removing torch mention in types.

* Adding type checking to `_build_conversation_input_ids`.

* Fixing import in strings.

b1aa4982

fix bart tests (#10060) · 9a0399e1
Patrick von Platen authored Feb 08, 2021

9a0399e1
Fix slow dpr test (#10059) · d51302cc
Lysandre Debut authored Feb 08, 2021
```
* Correct cast to device

* Comment back the slow test
```
d51302cc
Integration test for FlauBert (#10022) · 12e44af5
sandip authored Feb 08, 2021

12e44af5

04 Feb, 2021 4 commits

Hotfixing tests (blenderbot decoderonly tests, also need to remove (#10003) · d5888ef0
Nicolas Patry authored Feb 04, 2021
```
`encoder_no_repeat_ngram_size` from their config.
```
d5888ef0

Adding new `encoder_no_repeat_ngram_size` to `generate`. (#9984) · aeb18b92

Nicolas Patry authored Feb 04, 2021

Adding new `encoder_no_repeat_ngram_size` to `generate`.

Blenderbot results seemed off compared to original ParlAI script:
`https://parl.ai/projects/recipes/`

. Notably the model seems
to repeat a lot what was said during the conversation.

The actual problem was that `no_repeat_ngram_size` actually applies
to the `encoder_input_ids` but HF's `no_repeat_ngram_size` applies
to the previously generated ids (within the decoder). The history
conversation of blenderbot is within the `encoder` part so that
explains why HF's implementation had the repetitions.

This fix was focused on blenderbot *not* small and added tests
for those because they are quite different in configuration.

This change includes:

- Adding a new EncoderNoRepeatLogitProcessor.
- Adding 1 new arg to `generate` (`encoder_no_repeat_ngram_size`)
- Adding 1 new config parameter `encoder_no_repeat_ngram_size`.
- Adding 2 tests, one for the pipeline (high level, inputs exhibited
repeat behavior, one low level for EncoderNoRepeatLogitProcessor)
- Factored NoRepeatLogitProcessor so that logic could be reused.

Further work:

- Blenderbot conversational pipeline still does not behave correctly
 as they way input is prepared within the pipeline is still incorrect
(follow up PR)
- Blenderbot allows the bot to have personas, which is done by
prepending "your personna: XXXX" to the input, this could be explored
too in a follow up PR.

@patrickvonplaten
@LysandreJik

* Update src/transformers/generation_logits_process.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Doc quality.

* Fixing test.

* Last fixes.

* Fixing to account for batch_size.

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aeb18b92

Added Integration testing for DistilBert model from issue #9948' (#9995) · 804cd185
Daniel Hug authored Feb 04, 2021

804cd185

BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128) · 00031785

demSd authored Feb 04, 2021



* initiliaze bart4causalLM

* create BartDecoderWrapper, setters/getters

* delete spaces

* forward and additional methods

* update cache function, loss function, remove ngram* params in data class.

* add bartcausallm, bartdecoder testing

* correct bart for causal lm

* remove at

* add mbart as well

* up

* fix typo

* up

* correct

* add pegasusforcausallm

* add blenderbotforcausallm

* add blenderbotsmallforcausallm

* add marianforcausallm

* add test for MarianForCausalLM

* add Pegasus test

* add BlenderbotSmall test

* add blenderbot test

* fix a fail

* fix an import fail

* a fix

* fix

* Update modeling_pegasus.py

* fix models

* fix inputs_embeds setting getter

* adapt tests

* correct repo utils check

* finish test improvement

* fix tf models as well

* make style

* make fix-copies

* fix copies

* run all tests

* last changes

* fix all tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

00031785

03 Feb, 2021 4 commits
- Alber model integration testing added (#9980) · 2f06f2bc
  sandip authored Feb 03, 2021
  
  2f06f2bc
- Integration test added for TF MPnet (#9979) · 75fd00fb
  sandip authored Feb 03, 2021
  
  75fd00fb
- Integration test for mobilebert (#9978) · ce08043f
  sandip authored Feb 03, 2021
  
  ce08043f
- TF DistilBERT integration tests (#9975) · 1486205d
  sandip authored Feb 03, 2021
```
* TF DistilBERT integration test

* Update test_modeling_tf_distilbert.py
```
  1486205d