Commits · 8bb52bd24005599996fd21eab0770857cd1a37fe · chenpangpang / transformers

08 Feb, 2021 11 commits

Disable temporarily too slow tests (Longformer/LED) (#10062) · 8bb52bd2
Julien Plu authored Feb 08, 2021
```
* Disable temporarily too slow tests

* Fix style

* Fix template
```
8bb52bd2

Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002) · b1aa4982

Nicolas Patry authored Feb 08, 2021

* Cleaning up `ConversationalPipeline` to support more than DialoGPT.

Currently ConversationalPipeline was heavily biased towards DialoGPT
,which is the default model for this pipeline.

This PR proposes changes to put back the modifications specific to
DialoGPT into tokenizer-specific behavior wherever possible, by
creating `_build_conversation_input_ids` function that takes
conversation as input, and returns a list of ints corresponding
to the tokens. It feels natural to put here because all models
have probably different strategies to build input_ids from the
full conversation and it's the tokenizer's job to transform strings
into tokens (and vice-versa)

If `_build_conversation_input_ids` is missing, previous behavior is
used so we don't break anything so far (except for blenderbot where it's a fix).

This PR also contains a fix for too long inputs. There used
to be dead code for trying to limit the size of incoming input.
The introduced fixed is that we limit
within `_build_conversation_input_ids` to `tokenizer.model_max_length`.
It corresponds to the intent of the removed dead code and is actually
better because it corresponds to `model_max_length` which is different
from `max_length` (which is a default parameter for `generate`).

- Removed `history` logic from the Conversation as it's not relevant
anymore because tokenization logic has been moved to tokenizer.
And tokenizer cannot save any cache, and conversation cannot know
what is relevant or not.
Also it's not usable from `blenderbot` because the input_ids are
not append only (EOS tokens is always at the end).

- Added `iter_texts` method on `Conversation` because all
the code was literred with some form of this iteration of
past/generated_responses.

* Removing torch mention in types.

* Adding type checking to `_build_conversation_input_ids`.

* Fixing import in strings.

b1aa4982

Fix typo (#10064) · ae37ceac
Lysandre Debut authored Feb 08, 2021

ae37ceac
fix bart tests (#10060) · 9a0399e1
Patrick von Platen authored Feb 08, 2021

9a0399e1
Truncate max length if needed in all examples (#10034) · b01483fa
Sylvain Gugger authored Feb 08, 2021

b01483fa
A few fixes in the documentation (#10033) · 45aaf5f7
Sylvain Gugger authored Feb 08, 2021

45aaf5f7
Check copies match full class/function names (#10030) · 04fd783c
Sylvain Gugger authored Feb 08, 2021

04fd783c
Fix slow dpr test (#10059) · d51302cc
Lysandre Debut authored Feb 08, 2021
```
* Correct cast to device

* Comment back the slow test
```
d51302cc
Integration test for FlauBert (#10022) · 12e44af5
sandip authored Feb 08, 2021

12e44af5
Can't mix --fp16 and --device cpu (#10041) · 24db8cc3
Stas Bekman authored Feb 07, 2021

24db8cc3
json to jsonlines, and doc, and typo (#10043) · 769948fa
Stas Bekman authored Feb 07, 2021

769948fa

05 Feb, 2021 7 commits

[examples] make run scripts executable (#10037) · 8ea412a8
Stas Bekman authored Feb 05, 2021
```
* make executable

* make executable

* same for the template

* cleanup
```
8ea412a8

[examples/seq2seq] support label smoothing (#9844) · 1cd16512

Suraj Patil authored Feb 05, 2021

* add prepare_decoder_input_ids_from_labels in s2s models

* support lbl smoothing and enc/emb freezing

* fix freezing

* use pad_token_id from config

* remove embed freezing and add warning

* prepare decoder_input_ids inside DataCollatorForSeq2Seq

1cd16512

Bump minimum Jax requirement to 2.8.0 (#10027) · b9720dd6
Patrick von Platen authored Feb 05, 2021
```
* Bump minimum Jax requirement to 2.8.0

* update table
```
b9720dd6

[Templates] Add template "call-for-model" markdown and "call-for-big-bird" markdown (#9921) · 89be094e

Patrick von Platen authored Feb 05, 2021

* add big bird

* change teacher to mentor

* add proposal template

* adapt template

* delete old template

* correct some links

* finish template

* create big bird from template

* add big bird

* improve boxes

* finish boxes

* add pointers for BigBird

* finish big bird

* up

* up

* up

* up

* apply lysandres and sylvains suggestions

* delete bogus file

* correct markdown

* try different style

* try different style

* finalize

89be094e

Clarify QA pipeline output based on character (#10021) · 4bbad604
Lysandre Debut authored Feb 05, 2021
```
* Clarify QA pipeline output based on character

* Style
```
4bbad604
Update doc deployment script path · ad2c4310
Lysandre authored Feb 05, 2021

ad2c4310
Update doc deployment script · 95a5f271
Lysandre authored Feb 05, 2021

95a5f271

04 Feb, 2021 14 commits

Update doc for pre-release (#10014) · 3be965c5

Sylvain Gugger authored Feb 04, 2021

* Update doc for pre-release

* Use stable as default

* Use the right commit :facepalms:

3be965c5

Bump version · ba607db1
Sylvain Gugger authored Feb 04, 2021

ba607db1
Release: 4.3.0.rc1 · 4cd22512
Sylvain Gugger authored Feb 04, 2021

4cd22512
Fix test for sagemaker and TPU integrations · 4739ce17
Sylvain Gugger authored Feb 04, 2021

4739ce17

Authorize last version of tokenizer (#9799) · 21b3922e

Sylvain Gugger authored Feb 04, 2021



* Authorize last version of tokenizer

* Update version table

* Fix conversion of spm tokenizers and fix some hub links

* Bump tokenizers version to 0.10.1rc1

* Add script to check tokenizers conversion with XNLI

* Add some more mask_token lstrip support

* Must modify mask_token in slow tokenizers too

* Keep using the old method for Pegasus

* add missing import
Co-authored-by: Anthony MOI <m.anthony.moi@gmail.com>

21b3922e

Hotfixing tests (blenderbot decoderonly tests, also need to remove (#10003) · d5888ef0
Nicolas Patry authored Feb 04, 2021
```
`encoder_no_repeat_ngram_size` from their config.
```
d5888ef0

[trainer] a few fixes (#9993) · 8c3b1fcb

Stas Bekman authored Feb 04, 2021

* trainer fixes

* don't switch the model  just for deepspeed and mp

* correct the fix

8c3b1fcb

Remove "double" assignment in TF-BART like models (#9997) · 714855bd

Daniel Stancl authored Feb 04, 2021

* Replace `attn_weights = attn_wegihts = tf.reshape(...)`
with `attn_weights = tf.reshape(...)` and thus remove
unintentionally used "double" assignment.

714855bd

Fix doc for TFConverBertModel · b72f16b3
Sylvain Gugger authored Feb 04, 2021

b72f16b3

Adding new `encoder_no_repeat_ngram_size` to `generate`. (#9984) · aeb18b92

Nicolas Patry authored Feb 04, 2021

Adding new `encoder_no_repeat_ngram_size` to `generate`.

Blenderbot results seemed off compared to original ParlAI script:
`https://parl.ai/projects/recipes/`

. Notably the model seems
to repeat a lot what was said during the conversation.

The actual problem was that `no_repeat_ngram_size` actually applies
to the `encoder_input_ids` but HF's `no_repeat_ngram_size` applies
to the previously generated ids (within the decoder). The history
conversation of blenderbot is within the `encoder` part so that
explains why HF's implementation had the repetitions.

This fix was focused on blenderbot *not* small and added tests
for those because they are quite different in configuration.

This change includes:

- Adding a new EncoderNoRepeatLogitProcessor.
- Adding 1 new arg to `generate` (`encoder_no_repeat_ngram_size`)
- Adding 1 new config parameter `encoder_no_repeat_ngram_size`.
- Adding 2 tests, one for the pipeline (high level, inputs exhibited
repeat behavior, one low level for EncoderNoRepeatLogitProcessor)
- Factored NoRepeatLogitProcessor so that logic could be reused.

Further work:

- Blenderbot conversational pipeline still does not behave correctly
 as they way input is prepared within the pipeline is still incorrect
(follow up PR)
- Blenderbot allows the bot to have personas, which is done by
prepending "your personna: XXXX" to the input, this could be explored
too in a follow up PR.

@patrickvonplaten
@LysandreJik

* Update src/transformers/generation_logits_process.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/configuration_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Doc quality.

* Fixing test.

* Last fixes.

* Fixing to account for batch_size.

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/generation_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aeb18b92

Fix model templates (#9999) · e89c959a
Lysandre Debut authored Feb 04, 2021

e89c959a
Added Integration testing for DistilBert model from issue #9948' (#9995) · 804cd185
Daniel Hug authored Feb 04, 2021

804cd185

BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128) · 00031785

demSd authored Feb 04, 2021



* initiliaze bart4causalLM

* create BartDecoderWrapper, setters/getters

* delete spaces

* forward and additional methods

* update cache function, loss function, remove ngram* params in data class.

* add bartcausallm, bartdecoder testing

* correct bart for causal lm

* remove at

* add mbart as well

* up

* fix typo

* up

* correct

* add pegasusforcausallm

* add blenderbotforcausallm

* add blenderbotsmallforcausallm

* add marianforcausallm

* add test for MarianForCausalLM

* add Pegasus test

* add BlenderbotSmall test

* add blenderbot test

* fix a fail

* fix an import fail

* a fix

* fix

* Update modeling_pegasus.py

* fix models

* fix inputs_embeds setting getter

* adapt tests

* correct repo utils check

* finish test improvement

* fix tf models as well

* make style

* make fix-copies

* fix copies

* run all tests

* last changes

* fix all tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

00031785

Add `from_slow` in fast tokenizers build and fixes some bugs (#9987) · 7898fc03
Sylvain Gugger authored Feb 04, 2021

7898fc03

03 Feb, 2021 8 commits
- distilbert: fix creation of sinusoidal embeddings when using PyTorch 1.8+ (#9917) · 6244727e
  Stefan Schweter authored Feb 03, 2021
  
  6244727e
- Alber model integration testing added (#9980) · 2f06f2bc
  sandip authored Feb 03, 2021
  
  2f06f2bc
- Integration test added for TF MPnet (#9979) · 75fd00fb
  sandip authored Feb 03, 2021
  
  75fd00fb
- Integration test for mobilebert (#9978) · ce08043f
  sandip authored Feb 03, 2021
  
  ce08043f
- TF DistilBERT integration tests (#9975) · 1486205d
  sandip authored Feb 03, 2021
```
* TF DistilBERT integration test

* Update test_modeling_tf_distilbert.py
```
  1486205d
- Added integration tests for TensorFlow implementation of the ALBERT model (#9976) · f2d5c04e
  sandip authored Feb 03, 2021
```
* TF Albert integration test

* TF Alber integration test added
```
  f2d5c04e
- [run_clm.py] fix getting extention · bca0dd5e
  Suraj Patil authored Feb 03, 2021
  
  bca0dd5e
- fix steps_in_epoch variable in trainer when using max_steps (#9969) · 5442a11f
  yylun authored Feb 03, 2021
```
* fix steps_in_epoch variable when using max_steps

* redundant sentence

* Revert "redundant sentence"

This reverts commit ad5c0e9b6e66d65732dee2239cdc9c76dfa0dc5a.

* remove redundant sentence
Co-authored-by: wujindou <wujindou@sogou-inc.com>
```
  5442a11f