Commits · 0b1f552a249540c3e5397b09da723be0f7b79066 · chenpangpang / transformers

"...lm-evaluation-harness.git" did not exist on "5dcbcd63090136ceed0bbc52aaa7dcb1914621e2"

15 Feb, 2021 4 commits

Add AMP for Albert (#10141) · 31b0560a
Julien Plu authored Feb 15, 2021

31b0560a

Suraj Patil authored Feb 15, 2021

* add tokenizer for mBART-50

* update tokenizers

* make src_lang and tgt_lang optional

* update tokenizer test

* add setter

* update docs

* update conversion script

* update docs

* update conversion script

* update tokenizer

* update test

* update docs

* doc

* address Sylvain's suggestions

* fix test

* fix formatting

* nits

6fc940ed

Check TF ops for ONNX compliance (#10025) · c8d3fa0d

Julien Plu authored Feb 15, 2021



* Add check-ops script

* Finish to implement check_tf_ops and start the test

* Make the test mandatory only for BERT

* Update tf_ops folder

* Remove useless classes

* Add the ONNX test for GPT2 and BART

* Add a onnxruntime slow test + better opset flexibility

* Fix test + apply style

* fix tests

* Switch min opset from 12 to 10

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Fix GPT2

* Remove extra shape_list usage

* Fix GPT2

* Address Morgan's comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c8d3fa0d

Fixing NER pipeline for list inputs. (#10184) · 900daec2
Nicolas Patry authored Feb 15, 2021
```
Fixes #10168
```
900daec2

13 Feb, 2021 1 commit

Conversion from slow to fast for BPE spm vocabs contained an error. (#10120) · c9837a0d

Nicolas Patry authored Feb 13, 2021

* Conversion from slow to fast for BPE spm vocabs contained an error.

- There is only 1 test currently (tokenizers + slow) that used the modified path
and it's reformer, which does not contain any ids modification so the
bug was silent for now.
- The real issue is that vocab variable was overloaded by
SentencePieceExtractor, leading to Slow specific vocab oddities to be
completely ignored
- The bug was reported here https://github.com/huggingface/transformers/issues/9518
- Ran the complete tokenization test suite with slow without error
(`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)

* Remove rebase error.

* Adding the fixture.

c9837a0d

12 Feb, 2021 2 commits
- [hf_api] delete deprecated methods and tests (2) · 641f418e
  Julien Chaumond authored Feb 12, 2021
  
  641f418e
- [hf_api] delete deprecated methods and tests (#10159) · eed31db9
  Julien Chaumond authored Feb 12, 2021
```
* [hf_api] delete deprecated methods and tests

cc @lhoestq

* Update test_hf_api.py
```
  eed31db9
11 Feb, 2021 1 commit

[Wav2Vec2] Improve Tokenizer & Model for batched inference (#10117) · 495c157d

Patrick von Platen authored Feb 11, 2021

* save intermediate

* finish batch the same as fairseq

* add normalization

* fix batched input

* add better comment

* Update src/transformers/models/wav2vec2/modeling_wav2vec2.py

* add nice docstring

* add tokenizer tests

* make all slow tests pass

* finish PR

* correct import

495c157d

10 Feb, 2021 3 commits

remove adjust_logits_during_generation method (#10087) · c130e67d

Suraj Patil authored Feb 10, 2021

* add forced logits processors

* delete adjust_logits method

* add forced_eos_token_id argument in config

* add tests for forced logits processors

* update gen utils tests

* add forced option to tf generate

* remove adjust_logits method from tf models

* update adjust_logits for marian

* delete _force_token_id_to_be_generated method

* style

* import warnings

* pass max_length to _get_logits_processor

* set forced_eos_token_id to None

* set forced attributes in conf utils

* typo

* fix rag generate

* add forced_eos_token_id in rag config

* remove force_bos_token_to_be_generated from BartConfig

* remove _force_token_ids_generation from FSMT

* nit

* fix negative constant

* apply suggestions from code review

c130e67d

Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4

Julien Plu authored Feb 10, 2021

* Fix test

* Remove commented test

* Fix name

* Apply style

* Fix check copies

* Remove prints

* Restore boolean

* Fix reshape

22a32cf4

Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
Lysandre Debut authored Feb 10, 2021

0d8e554d

09 Feb, 2021 3 commits
- Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104) · 480a9d6b
  abhishek thakur authored Feb 09, 2021
  
  480a9d6b
- Add head_mask and decoder_head_mask to TF LED (#9988) · e7381c45
  Daniel Stancl authored Feb 09, 2021
```
* Add head masking to TF LED

* Add head_mask to Longformer + one doc piece to LED

* Fix integration tests
```
  e7381c45
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
08 Feb, 2021 9 commits

Integration test for electra model (#10073) · 263fac71
sandip authored Feb 09, 2021

263fac71
Implementing the test integration of BertGeneration (#9990) · 3b7e612a
demSd authored Feb 08, 2021
```
* claiming this issue

* Integration test for BertGeneration(Encoder and Decoder)

* fix code quality
```
3b7e612a
fix bert2bert test (#10063) · 9e795eac
Patrick von Platen authored Feb 08, 2021

9e795eac

Restore TF embeddings and attention layers to their previous version (#9890) · 31563e05

Julien Plu authored Feb 08, 2021

* Refacto BERT

* Restore all the concerned models

* Remove print

* Update template

* Apply Sylvain's and Morgan's comments

* Fix cast

* Put the cast inside call

* Remove cond in ebds

* Fix funnel

* Restore previous dot product (attention_scores) computation

* Add ConvBERT and BART

* Make all the S2S models ONNX compliant

* Fix test

* Fix check copies

31563e05

Disable temporarily too slow tests (Longformer/LED) (#10062) · 8bb52bd2
Julien Plu authored Feb 08, 2021
```
* Disable temporarily too slow tests

* Fix style

* Fix template
```
8bb52bd2

Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002) · b1aa4982

Nicolas Patry authored Feb 08, 2021

* Cleaning up `ConversationalPipeline` to support more than DialoGPT.

Currently ConversationalPipeline was heavily biased towards DialoGPT
,which is the default model for this pipeline.

This PR proposes changes to put back the modifications specific to
DialoGPT into tokenizer-specific behavior wherever possible, by
creating `_build_conversation_input_ids` function that takes
conversation as input, and returns a list of ints corresponding
to the tokens. It feels natural to put here because all models
have probably different strategies to build input_ids from the
full conversation and it's the tokenizer's job to transform strings
into tokens (and vice-versa)

If `_build_conversation_input_ids` is missing, previous behavior is
used so we don't break anything so far (except for blenderbot where it's a fix).

This PR also contains a fix for too long inputs. There used
to be dead code for trying to limit the size of incoming input.
The introduced fixed is that we limit
within `_build_conversation_input_ids` to `tokenizer.model_max_length`.
It corresponds to the intent of the removed dead code and is actually
better because it corresponds to `model_max_length` which is different
from `max_length` (which is a default parameter for `generate`).

- Removed `history` logic from the Conversation as it's not relevant
anymore because tokenization logic has been moved to tokenizer.
And tokenizer cannot save any cache, and conversation cannot know
what is relevant or not.
Also it's not usable from `blenderbot` because the input_ids are
not append only (EOS tokens is always at the end).

- Added `iter_texts` method on `Conversation` because all
the code was literred with some form of this iteration of
past/generated_responses.

* Removing torch mention in types.

* Adding type checking to `_build_conversation_input_ids`.

* Fixing import in strings.

b1aa4982

fix bart tests (#10060) · 9a0399e1
Patrick von Platen authored Feb 08, 2021

9a0399e1
Fix slow dpr test (#10059) · d51302cc
Lysandre Debut authored Feb 08, 2021
```
* Correct cast to device

* Comment back the slow test
```
d51302cc
Integration test for FlauBert (#10022) · 12e44af5
sandip authored Feb 08, 2021

12e44af5

04 Feb, 2021 4 commits

Hotfixing tests (blenderbot decoderonly tests, also need to remove (#10003) · d5888ef0
Nicolas Patry authored Feb 04, 2021
```
`encoder_no_repeat_ngram_size` from their config.
```
d5888ef0

Adding new `encoder_no_repeat_ngram_size` to `generate`. (#9984) · aeb18b92

Nicolas Patry authored Feb 04, 2021

Adding new `encoder_no_repeat_ngram_size` to `generate`.

Blenderbot results seemed off compared to original ParlAI script:
`https://parl.ai/projects/recipes/`

. Notably the model seems
to repeat a lot what was said during the conversation.

The actual problem was that `no_repeat_ngram_size` actually applies
to the `encoder_input_ids` but HF's `no_repeat_ngram_size` applies
to the previously generated ids (within the decoder). The history
conversation of blenderbot is within the `encoder` part so that
explains why HF's implementation had the repetitions.

This fix was focused on blenderbot *not* small and added tests
for those because they are quite different in configuration.

This change includes:

- Adding a new EncoderNoRepeatLogitProcessor.
- Adding 1 new arg to `generate` (`encoder_no_repeat_ngram_size`)
- Adding 1 new config parameter `encoder_no_repeat_ngram_size`.
- Adding 2 tests, one for the pipeline (high level, inputs exhibited
repeat behavior, one low level for EncoderNoRepeatLogitProcessor)
- Factored NoRepeatLogitProcessor so that logic could be reused.

Further work:

- Blenderbot conversational pipeline still does not behave correctly
 as they way input is prepared within the pipeline is still incorrect
(follow up PR)
- Blenderbot allows the bot to have personas, which is done by
prepending "your personna: XXXX" to the input, this could be explored
too in a follow up PR.

@patrickvonplaten
@LysandreJik

* Update src/transformers/generation_logits_process.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Doc quality.

* Fixing test.

* Last fixes.

* Fixing to account for batch_size.

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aeb18b92

Added Integration testing for DistilBert model from issue #9948' (#9995) · 804cd185
Daniel Hug authored Feb 04, 2021

804cd185

BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128) · 00031785

demSd authored Feb 04, 2021



* initiliaze bart4causalLM

* create BartDecoderWrapper, setters/getters

* delete spaces

* forward and additional methods

* update cache function, loss function, remove ngram* params in data class.

* add bartcausallm, bartdecoder testing

* correct bart for causal lm

* remove at

* add mbart as well

* up

* fix typo

* up

* correct

* add pegasusforcausallm

* add blenderbotforcausallm

* add blenderbotsmallforcausallm

* add marianforcausallm

* add test for MarianForCausalLM

* add Pegasus test

* add BlenderbotSmall test

* add blenderbot test

* fix a fail

* fix an import fail

* a fix

* fix

* Update modeling_pegasus.py

* fix models

* fix inputs_embeds setting getter

* adapt tests

* correct repo utils check

* finish test improvement

* fix tf models as well

* make style

* make fix-copies

* fix copies

* run all tests

* last changes

* fix all tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

00031785

03 Feb, 2021 6 commits
- Alber model integration testing added (#9980) · 2f06f2bc
  sandip authored Feb 03, 2021
  
  2f06f2bc
- Integration test added for TF MPnet (#9979) · 75fd00fb
  sandip authored Feb 03, 2021
  
  75fd00fb
- Integration test for mobilebert (#9978) · ce08043f
  sandip authored Feb 03, 2021
  
  ce08043f
- TF DistilBERT integration tests (#9975) · 1486205d
  sandip authored Feb 03, 2021
```
* TF DistilBERT integration test

* Update test_modeling_tf_distilbert.py
```
  1486205d
- Added integration tests for TensorFlow implementation of the ALBERT model (#9976) · f2d5c04e
  sandip authored Feb 03, 2021
```
* TF Albert integration test

* TF Alber integration test added
```
  f2d5c04e
- Fix Longformer and LED (#9942) · 3f77c26d
  Julien Plu authored Feb 03, 2021
```
* Fix Longformer and LED

* Add a test for graph execution with inputs_embeds

* Apply style
```
  3f77c26d
02 Feb, 2021 4 commits

Add head_mask and decoder_head_mask to PyTorch LED (#9856) · 71bdc076

Daniel Stancl authored Feb 02, 2021

* Add {decoder_,}head_mask to LED

* Fix create_custom_forward signatue in encoder

* Add head_mask to longformer

* Add head_mask to longformer to fix dependencies
of LED on Longformer.

* Not working yet

* Add mising one input in longofrmer_modeling.py

* make fix-copies

71bdc076

Wav2Vec2 (#9659) · d6217fb3

Patrick von Platen authored Feb 02, 2021



* add raw scaffold

* implement feat extract layers

* make style

* remove +

* correctly convert weights

* make feat extractor work

* make feature extraction proj work

* run forward pass

* finish forward pass

* Succesful decoding example

* remove unused files

* more changes

* add wav2vec tokenizer

* add new structure

* fix run forward

* add other layer norm architecture

* finish 2nd structure

* add model tests

* finish tests for tok and model

* clean-up

* make style

* finish docstring for model and config

* make style

* correct docstring

* correct tests

* change checkpoints to fairseq

* fix examples

* finish wav2vec2

* make style

* apply sylvains suggestions

* apply lysandres suggestions

* change print to log.info

* re-add assert statement

* add input_values as required input name

* finish wav2vec2 tokenizer

* Update tests/test_tokenization_wav2vec2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* apply sylvains suggestions
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d6217fb3

ALBERT Tokenizer integration test (#9943) · 1809de51
Lysandre Debut authored Feb 02, 2021
```
* ALBERT Tokenizer integration test

* Batching

* Style
```
1809de51

[Tokenizer Utils Base] Make pad function more flexible (#9928) · 538b3b46

Patrick von Platen authored Feb 02, 2021

* change tokenizer requirement

* split line

* Correct typo from list to str

* improve style

* make other function pretty as well

* add comment

* correct typo

* add new test

* pass tests for tok without padding token

* Apply suggestions from code review

538b3b46

01 Feb, 2021 1 commit

Add head_mask and decoder_head_mask to FSMT (#9819) · 0c6c0afc

Daniel Stancl authored Feb 01, 2021

* Add {decoder_,}head_mask to fsmt_modeling.py

* Enable test_headmasking and some changes to docs

* Remove test_head_masking flag from fsmt test file

Remove test_head_masking flag from test_modeling_fsmt.py
since test_head_masking is set to be True by default (thus it is redundant to store).

* Merge master and remove test_head_masking = True

* Rebase necessary due to an update of jaxlib

* Remove test_head_masking=True in tests/test_modeling_fsmt.py
as it is redundant.

0c6c0afc

29 Jan, 2021 2 commits

Add XLA test (#9848) · fdcde144
Julien Plu authored Jan 29, 2021

fdcde144

Adding a new `return_full_text` parameter to TextGenerationPipeline. (#9852) · c2d0ffec

Nicolas Patry authored Jan 29, 2021

* Adding a new `return_full_text` parameter to TextGenerationPipeline.

For text-generation, it's sometimes used as prompting text.
In that context, prefixing `generated_text` with the actual input
forces the caller to take an extra step to remove it.

The proposed change adds a new parameter (for backward compatibility).
`return_full_text` that enables the caller to prevent adding the prefix.

* Doc quality.

c2d0ffec