Commits · c130e67dce56a092604949a8df6384a17f762189 · chenpangpang / transformers

10 Feb, 2021 8 commits

remove adjust_logits_during_generation method (#10087) · c130e67d

Suraj Patil authored Feb 10, 2021

* add forced logits processors

* delete adjust_logits method

* add forced_eos_token_id argument in config

* add tests for forced logits processors

* update gen utils tests

* add forced option to tf generate

* remove adjust_logits method from tf models

* update adjust_logits for marian

* delete _force_token_id_to_be_generated method

* style

* import warnings

* pass max_length to _get_logits_processor

* set forced_eos_token_id to None

* set forced attributes in conf utils

* typo

* fix rag generate

* add forced_eos_token_id in rag config

* remove force_bos_token_to_be_generated from BartConfig

* remove _force_token_ids_generation from FSMT

* nit

* fix negative constant

* apply suggestions from code review

c130e67d

Fix TF LED/Longformer attentions computation (#10007) · 22a32cf4

Julien Plu authored Feb 10, 2021

* Fix test

* Remove commented test

* Fix name

* Apply style

* Fix check copies

* Remove prints

* Restore boolean

* Fix reshape

22a32cf4

Line endings should be LF across repo and not CRLF (#10119) · 0d8e554d
Lysandre Debut authored Feb 10, 2021

0d8e554d
add deepspeed fairscale (#10116) · 937f6707
Stas Bekman authored Feb 10, 2021

937f6707
[CI] build docs faster (#10115) · d478257d
Stas Bekman authored Feb 10, 2021
```
I assume the CI machine should have at least 4 cores, so let's build docs faster
```
d478257d

[DeepSpeed docs] new information (#9610) · 7c07a47d

Stas Bekman authored Feb 09, 2021

* how to specify a specific gpu

* new paper

* expand on buffer sizes

* style

* where to find config examples

* specific example

* small updates

7c07a47d

Fix tokenizers training in notebook (#10110) · 1fbaa3c1
Anthony MOI authored Feb 09, 2021

1fbaa3c1
Remove speed metrics from default compute objective (#10107) · 85395e49
Shiva Zamani authored Feb 09, 2021

85395e49

09 Feb, 2021 16 commits
- doc: update W&B related doc (#10086) · 7c7962ba
  Boris Dayma authored Feb 09, 2021
```
* doc: update W&B related doc

* doc(wandb): mention report_to

* doc(wandb): commit suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* doc(wandb): fix typo

* doc(wandb): remove WANDB_DISABLED
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  7c7962ba
- Fix TFConvBertModelIntegrationTest::test_inference_masked_lm Test (#10104) · 480a9d6b
  abhishek thakur authored Feb 09, 2021
  
  480a9d6b
- Add patch releases to the doc · 0c3d23df
  Sylvain Gugger authored Feb 09, 2021
  
  0c3d23df
- [RAG] fix generate (#10094) · 3e0c62b6
  Suraj Patil authored Feb 10, 2021
```
* fix rag generate and tests

* put back adjust_logits_during_generation

* tests are okay
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  3e0c62b6
- fix import (#10103) · 226973a9
  Patrick von Platen authored Feb 09, 2021
  
  226973a9
- Update ADD_BIG_BIRD.md · 4cda2d73
  Patrick von Platen authored Feb 09, 2021
  
  4cda2d73
- Replace strided slice with tf.expand_dims (#10078) · b82fe7d2
  Julien Plu authored Feb 09, 2021
```
* Replace tf.newaxis -> tf.expand_dims

* Fix tests

* Fix tests

* Use reshape when a tensors needs a double expand

* Fix GPT2

* Fix GPT2
```
  b82fe7d2
- Add head_mask and decoder_head_mask to TF LED (#9988) · e7381c45
  Daniel Stancl authored Feb 09, 2021
```
* Add head masking to TF LED

* Add head_mask to Longformer + one doc piece to LED

* Fix integration tests
```
  e7381c45
- Fix some edge cases in report_to and add deprecation warnings (#10100) · 77c0ce8c
  Sylvain Gugger authored Feb 09, 2021
  
  77c0ce8c
- Logging propagation (#10092) · 78f4a0e7
  Lysandre Debut authored Feb 09, 2021
```
* Enable propagation by default

* Document enable/disable default handler
```
  78f4a0e7
- [examples/s2s] add test set predictions (#10085) · 63fddcf6
  Suraj Patil authored Feb 09, 2021
```
* add do_predict, pass eval_beams durig eval

* update help

* apply suggestions from code review
```
  63fddcf6
- Fix naming (#10095) · c6d5e565
  Julien Plu authored Feb 09, 2021
  
  c6d5e565
- Fix example in Wav2Vec2 documentation (#10096) · 4ed76377
  abhishek thakur authored Feb 09, 2021
```
* Fix example in Wav2Vec2 documentation

* fix style
```
  4ed76377
- Docs for v4.3.1 release · bf1a06a4
  Lysandre authored Feb 09, 2021
  
  bf1a06a4
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · b972125c
  Patrick von Platen authored Feb 09, 2021
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  b972125c
- Fix deployment script · ba542ffb
  Lysandre authored Feb 09, 2021
  
  ba542ffb
08 Feb, 2021 16 commits

Integration test for electra model (#10073) · 263fac71
sandip authored Feb 09, 2021

263fac71
transition to new tests dir (#10080) · 781220ac
Stas Bekman authored Feb 08, 2021

781220ac
remove token_type_ids from TokenizerBertGeneration output (#10070) · 84acf0c7
demSd authored Feb 08, 2021

84acf0c7

Removing run_pl_glue.py from text classification docs, include run_xnli.py &... · e4bf9910

Juan Cruz-Benito authored Feb 08, 2021


Removing run_pl_glue.py from text classification docs, include run_xnli.py & run_tf_text_classification.py (#10066)

* Removing run_pl_glue.py from seq classification docs

* Adding run_tf_text_classification.py

* Using :prefix_link: to refer local files

* Applying "make style" to the branch

* Update docs/source/task_summary.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Removing last underscores
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e4bf9910

Docs for v4.3.0 · 0dd579c9
Lysandre authored Feb 08, 2021

0dd579c9
[trainer] deepspeed bug fixes and tests (#10039) · 322037e8
Stas Bekman authored Feb 08, 2021
```
* deepspeed bug fixes and tests

* manual wrap?
```
322037e8
Update tokenizers requirement (#10077) · f285e4c3
Anthony MOI authored Feb 08, 2021

f285e4c3

Fix mlflow param overflow clean (#10071) · ddaafd78

noise-field authored Feb 08, 2021

* Unify logging with f-strings

* Get limits from MLflow rather than hardcode

* Add a check for parameter length overflow

Also constants are marked as internal

* Don't stop run in on_train_end

This causes bad behaviour when there is a seprarte validation step:
validation gets recorded as separate run.

* Fix style

ddaafd78

[s2s examples] Replace -100 token ids with the tokenizer pad_id for compute_metrics (#10046) · ece6c514
Olivier authored Feb 08, 2021
```
* replace -100 token ids with the tokenizer pad_id for compute_metrics

* fixed typo for label_ids
```
ece6c514
Model templates (#10072) · c9df1b1d
Lysandre Debut authored Feb 08, 2021

c9df1b1d
Implementing the test integration of BertGeneration (#9990) · 3b7e612a
demSd authored Feb 08, 2021
```
* claiming this issue

* Integration test for BertGeneration(Encoder and Decoder)

* fix code quality
```
3b7e612a
Fix TF template (#10069) · cdd86592
Julien Plu authored Feb 08, 2021
```
* Fix template

* Fix template
```
cdd86592
fix bert2bert test (#10063) · 9e795eac
Patrick von Platen authored Feb 08, 2021

9e795eac

Restore TF embeddings and attention layers to their previous version (#9890) · 31563e05

Julien Plu authored Feb 08, 2021

* Refacto BERT

* Restore all the concerned models

* Remove print

* Update template

* Apply Sylvain's and Morgan's comments

* Fix cast

* Put the cast inside call

* Remove cond in ebds

* Fix funnel

* Restore previous dot product (attention_scores) computation

* Add ConvBERT and BART

* Make all the S2S models ONNX compliant

* Fix test

* Fix check copies

31563e05

Disable temporarily too slow tests (Longformer/LED) (#10062) · 8bb52bd2
Julien Plu authored Feb 08, 2021
```
* Disable temporarily too slow tests

* Fix style

* Fix template
```
8bb52bd2

Cleaning up `ConversationalPipeline` to support more than DialoGPT. (#10002) · b1aa4982

Nicolas Patry authored Feb 08, 2021

* Cleaning up `ConversationalPipeline` to support more than DialoGPT.

Currently ConversationalPipeline was heavily biased towards DialoGPT
,which is the default model for this pipeline.

This PR proposes changes to put back the modifications specific to
DialoGPT into tokenizer-specific behavior wherever possible, by
creating `_build_conversation_input_ids` function that takes
conversation as input, and returns a list of ints corresponding
to the tokens. It feels natural to put here because all models
have probably different strategies to build input_ids from the
full conversation and it's the tokenizer's job to transform strings
into tokens (and vice-versa)

If `_build_conversation_input_ids` is missing, previous behavior is
used so we don't break anything so far (except for blenderbot where it's a fix).

This PR also contains a fix for too long inputs. There used
to be dead code for trying to limit the size of incoming input.
The introduced fixed is that we limit
within `_build_conversation_input_ids` to `tokenizer.model_max_length`.
It corresponds to the intent of the removed dead code and is actually
better because it corresponds to `model_max_length` which is different
from `max_length` (which is a default parameter for `generate`).

- Removed `history` logic from the Conversation as it's not relevant
anymore because tokenization logic has been moved to tokenizer.
And tokenizer cannot save any cache, and conversation cannot know
what is relevant or not.
Also it's not usable from `blenderbot` because the input_ids are
not append only (EOS tokens is always at the end).

- Added `iter_texts` method on `Conversation` because all
the code was literred with some form of this iteration of
past/generated_responses.

* Removing torch mention in types.

* Adding type checking to `_build_conversation_input_ids`.

* Fixing import in strings.

b1aa4982