Commits · c6d5e56595c0c0134c744f9e38ab2dedb6707388 · chenpangpang / transformers

09 Feb, 2021 5 commits
- Fix naming (#10095) · c6d5e565
  Julien Plu authored Feb 09, 2021
  
  c6d5e565
- Fix example in Wav2Vec2 documentation (#10096) · 4ed76377
  abhishek thakur authored Feb 09, 2021
```
* Fix example in Wav2Vec2 documentation

* fix style
```
  4ed76377
- Docs for v4.3.1 release · bf1a06a4
  Lysandre authored Feb 09, 2021
  
  bf1a06a4
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
- Fix deployment script · ba542ffb
  Lysandre authored Feb 09, 2021
  
  ba542ffb
08 Feb, 2021 25 commits

Integration test for electra model (#10073) · 263fac71
sandip authored Feb 09, 2021

263fac71
transition to new tests dir (#10080) · 781220ac
Stas Bekman authored Feb 08, 2021

781220ac
remove token_type_ids from TokenizerBertGeneration output (#10070) · 84acf0c7
demSd authored Feb 08, 2021

84acf0c7

Removing run_pl_glue.py from text classification docs, include run_xnli.py &... · e4bf9910

Juan Cruz-Benito authored Feb 08, 2021


Removing run_pl_glue.py from text classification docs, include run_xnli.py & run_tf_text_classification.py (#10066)

* Removing run_pl_glue.py from seq classification docs

* Adding run_tf_text_classification.py

* Using :prefix_link: to refer local files

* Applying "make style" to the branch

* Update docs/source/task_summary.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removing last underscores
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e4bf9910

Docs for v4.3.0 · 0dd579c9
Lysandre authored Feb 08, 2021

0dd579c9
[trainer] deepspeed bug fixes and tests (#10039) · 322037e8
Stas Bekman authored Feb 08, 2021
```
* deepspeed bug fixes and tests

* manual wrap?
```
322037e8
Update tokenizers requirement (#10077) · f285e4c3
Anthony MOI authored Feb 08, 2021

f285e4c3

Fix mlflow param overflow clean (#10071) · ddaafd78

noise-field authored Feb 08, 2021

* Unify logging with f-strings

* Get limits from MLflow rather than hardcode

* Add a check for parameter length overflow

Also constants are marked as internal

* Don't stop run in on_train_end

This causes bad behaviour when there is a seprarte validation step:
validation gets recorded as separate run.

* Fix style

ddaafd78

[s2s examples] Replace -100 token ids with the tokenizer pad_id for compute_metrics (#10046) · ece6c514
Olivier authored Feb 08, 2021
```
* replace -100 token ids with the tokenizer pad_id for compute_metrics

* fixed typo for label_ids
```
ece6c514
Model templates (#10072) · c9df1b1d
Lysandre Debut authored Feb 08, 2021

c9df1b1d
Implementing the test integration of BertGeneration (#9990) · 3b7e612a
demSd authored Feb 08, 2021
```
* claiming this issue

* Integration test for BertGeneration(Encoder and Decoder)

* fix code quality
```
3b7e612a
Fix TF template (#10069) · cdd86592
Julien Plu authored Feb 08, 2021
```
* Fix template

* Fix template
```
cdd86592
fix bert2bert test (#10063) · 9e795eac
Patrick von Platen authored Feb 08, 2021

9e795eac

Restore TF embeddings and attention layers to their previous version (#9890) · 31563e05

Julien Plu authored Feb 08, 2021

* Refacto BERT

* Restore all the concerned models

* Remove print

* Update template

* Apply Sylvain's and Morgan's comments

* Fix cast

* Put the cast inside call

* Remove cond in ebds

* Fix funnel

* Restore previous dot product (attention_scores) computation

* Add ConvBERT and BART

* Make all the S2S models ONNX compliant

* Fix test

* Fix check copies

31563e05

Disable temporarily too slow tests (Longformer/LED) (#10062) · 8bb52bd2
Julien Plu authored Feb 08, 2021
```
* Disable temporarily too slow tests

* Fix style

* Fix template
```
8bb52bd2

Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002) · b1aa4982

Nicolas Patry authored Feb 08, 2021

* Cleaning up `ConversationalPipeline` to support more than DialoGPT.

Currently ConversationalPipeline was heavily biased towards DialoGPT
,which is the default model for this pipeline.

This PR proposes changes to put back the modifications specific to
DialoGPT into tokenizer-specific behavior wherever possible, by
creating `_build_conversation_input_ids` function that takes
conversation as input, and returns a list of ints corresponding
to the tokens. It feels natural to put here because all models
have probably different strategies to build input_ids from the
full conversation and it's the tokenizer's job to transform strings
into tokens (and vice-versa)

If `_build_conversation_input_ids` is missing, previous behavior is
used so we don't break anything so far (except for blenderbot where it's a fix).

This PR also contains a fix for too long inputs. There used
to be dead code for trying to limit the size of incoming input.
The introduced fixed is that we limit
within `_build_conversation_input_ids` to `tokenizer.model_max_length`.
It corresponds to the intent of the removed dead code and is actually
better because it corresponds to `model_max_length` which is different
from `max_length` (which is a default parameter for `generate`).

- Removed `history` logic from the Conversation as it's not relevant
anymore because tokenization logic has been moved to tokenizer.
And tokenizer cannot save any cache, and conversation cannot know
what is relevant or not.
Also it's not usable from `blenderbot` because the input_ids are
not append only (EOS tokens is always at the end).

- Added `iter_texts` method on `Conversation` because all
the code was literred with some form of this iteration of
past/generated_responses.

* Removing torch mention in types.

* Adding type checking to `_build_conversation_input_ids`.

* Fixing import in strings.

b1aa4982

Fix typo (#10064) · ae37ceac
Lysandre Debut authored Feb 08, 2021

ae37ceac
fix bart tests (#10060) · 9a0399e1
Patrick von Platen authored Feb 08, 2021

9a0399e1
Truncate max length if needed in all examples (#10034) · b01483fa
Sylvain Gugger authored Feb 08, 2021

b01483fa
A few fixes in the documentation (#10033) · 45aaf5f7
Sylvain Gugger authored Feb 08, 2021

45aaf5f7
Check copies match full class/function names (#10030) · 04fd783c
Sylvain Gugger authored Feb 08, 2021

04fd783c
Fix slow dpr test (#10059) · d51302cc
Lysandre Debut authored Feb 08, 2021
```
* Correct cast to device

* Comment back the slow test
```
d51302cc
Integration test for FlauBert (#10022) · 12e44af5
sandip authored Feb 08, 2021

12e44af5
Can't mix --fp16 and --device cpu (#10041) · 24db8cc3
Stas Bekman authored Feb 07, 2021

24db8cc3
json to jsonlines, and doc, and typo (#10043) · 769948fa
Stas Bekman authored Feb 07, 2021

769948fa

05 Feb, 2021 7 commits

[examples] make run scripts executable (#10037) · 8ea412a8
Stas Bekman authored Feb 05, 2021
```
* make executable

* make executable

* same for the template

* cleanup
```
8ea412a8

[examples/seq2seq] support label smoothing (#9844) · 1cd16512

Suraj Patil authored Feb 05, 2021

* add prepare_decoder_input_ids_from_labels in s2s models

* support lbl smoothing and enc/emb freezing

* fix freezing

* use pad_token_id from config

* remove embed freezing and add warning

* prepare decoder_input_ids inside DataCollatorForSeq2Seq

1cd16512

Bump minimum Jax requirement to 2.8.0 (#10027) · b9720dd6
Patrick von Platen authored Feb 05, 2021
```
* Bump minimum Jax requirement to 2.8.0

* update table
```
b9720dd6

[Templates] Add template "call-for-model" markdown and "call-for-big-bird" markdown (#9921) · 89be094e

Patrick von Platen authored Feb 05, 2021

* add big bird

* change teacher to mentor

* add proposal template

* adapt template

* delete old template

* correct some links

* finish template

* create big bird from template

* add big bird

* improve boxes

* finish boxes

* add pointers for BigBird

* finish big bird

* up

* up

* up

* up

* apply lysandres and sylvains suggestions

* delete bogus file

* correct markdown

* try different style

* try different style

* finalize

89be094e

Clarify QA pipeline output based on character (#10021) · 4bbad604
Lysandre Debut authored Feb 05, 2021
```
* Clarify QA pipeline output based on character

* Style
```
4bbad604
Update doc deployment script path · ad2c4310
Lysandre authored Feb 05, 2021

ad2c4310
Update doc deployment script · 95a5f271
Lysandre authored Feb 05, 2021

95a5f271

04 Feb, 2021 3 commits
- Update doc for pre-release (#10014) · 3be965c5
  Sylvain Gugger authored Feb 04, 2021
```
* Update doc for pre-release

* Use stable as default

* Use the right commit :facepalms:
```
  3be965c5
- Bump version · ba607db1
  Sylvain Gugger authored Feb 04, 2021
  
  ba607db1
- Release: 4.3.0.rc1 · 4cd22512
  Sylvain Gugger authored Feb 04, 2021
  
  4cd22512