Commits · 8b240a06617455eae59e1116af6a1a016664e963 · chenpangpang / transformers

12 Oct, 2021 6 commits

Add TFEncoderDecoderModel + Add cross-attention to some TF models (#13222) · 8b240a06

Yih-Dar authored Oct 13, 2021



* Add cross attentions to TFGPT2Model

* Add TFEncoderDecoderModel

* Add TFBaseModelOutputWithPoolingAndCrossAttentions

* Add cross attentions to TFBertModel

* Fix past or past_key_values argument issue

* Fix generation

* Fix save and load

* Add some checks and comments

* Clean the code that deals with past keys/values

* Add kwargs to processing_inputs

* Add serving_output to TFEncoderDecoderModel

* Some cleaning + fix use_cache value issue

* Fix tests + add bert2bert/bert2gpt2 tests

* Fix more tests

* Ignore crossattention.bias when loading GPT2 weights into TFGPT2

* Fix return_dict_in_generate in tf generation

* Fix is_token_logit_eos_token bug in tf generation

* Finalize the tests after fixing some bugs

* Fix another is_token_logit_eos_token bug in tf generation

* Add/Update docs

* Add TFBertEncoderDecoderModelTest

* Clean test script

* Add TFEncoderDecoderModel to the library

* Add cross attentions to TFRobertaModel

* Add TFRobertaEncoderDecoderModelTest

* make style

* Change the way of position_ids computation

* bug fix

* Fix copies in tf_albert

* Remove some copied from and apply some fix-copies

* Remove some copied

* Add cross attentions to some other TF models

* Remove encoder_hidden_states from TFLayoutLMModel.call for now

* Make style

* Fix TFRemBertForCausalLM

* Revert the change to longformer + Remove copies

* Revert the change to albert and convbert + Remove copies

* make quality

* make style

* Add TFRembertEncoderDecoderModelTest

* make quality and fix-copies

* test TFRobertaForCausalLM

* Fixes for failed tests

* Fixes for failed tests

* fix more tests

* Fixes for failed tests

* Fix Auto mapping order

* Fix TFRemBertEncoder return value

* fix tf_rembert

* Check copies are OK

* Fix missing TFBaseModelOutputWithPastAndCrossAttentions is not defined

* Add TFEncoderDecoderModelSaveLoadTests

* fix tf weight loading

* check the change of use_cache

* Revert the change

* Add missing test_for_causal_lm for TFRobertaModelTest

* Try cleaning past

* fix _reorder_cache

* Revert some files to original versions

* Keep as many copies as possible

* Apply suggested changes - Use raise ValueError instead of assert

* Move import to top

* Fix wrong require_torch

* Replace more assert by raise ValueError

* Add test_pt_tf_model_equivalence (the test won't pass for now)

* add test for loading/saving

* finish

* finish

* Remove test_pt_tf_model_equivalence

* Update tf modeling template

* Remove pooling, added in the prev. commit, from MainLayer

* Update tf modeling test template

* Move inputs["use_cache"] = False to modeling_tf_utils.py

* Fix torch.Tensor in the comment

* fix use_cache

* Fix missing use_cache in ElectraConfig

* Add a note to from_pretrained

* Fix style

* Change test_encoder_decoder_save_load_from_encoder_decoder_from_pt

* Fix TFMLP (in TFGPT2) activation issue

* Fix None past_key_values value in serving_output

* Don't call get_encoderdecoder_model in TFEncoderDecoderModelTest.test_configuration_tie until we have a TF checkpoint on Hub

* Apply review suggestions - style for cross_attns in serving_output

* Apply review suggestions - change assert + docstrings

* break the error message to respect the char limit

* deprecate the argument past

* fix docstring style

* Update the encoder-decoder rst file

* fix Unknown interpreted text role "method"

* fix typo
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

8b240a06

Fixing the lecture values by making sure defaults are not changed (#13976) · 26b6ef79
Nicolas Patry authored Oct 12, 2021
```
384 // 4 < 128 would break `doc_stride`.
```
26b6ef79
[Wav2Vec2] Make sure tensors are always bool for mask_indices (#13977) · 58bf8825
Patrick von Platen authored Oct 12, 2021
```
* correct long to bool

* up

* correct code
```
58bf8825
Specify im-seg mask greyscole mode (#13974) · 11c043d2
Mishig Davaadorj authored Oct 12, 2021

11c043d2
Fix missing tpu variable in benchmark_args_tf.py (#13968) · 85d69a7d
Hardian Lawi authored Oct 12, 2021

85d69a7d
Remove pip 21.3 from installation candidates for model templates · 990de2c1
Lysandre Debut authored Oct 11, 2021

990de2c1

11 Oct, 2021 10 commits

[Speech Examples] Add pytorch speech pretraining (#13877) · d45fc7da

Patrick von Platen authored Oct 12, 2021

* adapt wav2vec2

* add example

* add files

* adapt

* remove bogus file

* Apply suggestions from code review

* adapt files more

* upload changes

* del old files

* up

* up

* up

* up

* up

* correct gradient checkpoitning

* add readme

* finish

* finish

* up

* more fixes

* up

* up

* add demo run to readme

* up

d45fc7da

Replace assert by ValueError of... · 3499728d

Lahfa Samy authored Oct 11, 2021


Replace assert by ValueError of src/transformers/models/electra/modeling_{electra,tf_electra}.py and all other models that had copies (#13955)

* Replace all assert by ValueError in src/transformers/models/electra

* Reformat with black to pass check_code_quality test

* Change some assert to ValueError of modeling_bert & modeling_tf_albert

* Change some assert in multiples models

* Change multiples models assertion to ValueError in order to validate
  check_code_style test and models template test.

* Black reformat

* Change some more asserts in multiples models

* Change assert to ValueError in modeling_layoutlm.py to fix copy error in code_style_check

* Add proper message to ValueError in modeling_tf_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in models/bert/modeling_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add ValueError message to models/convbert/modeling_tf_convbert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add error message for ValueError to modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in models/tapas/modeling_tapas.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in models/electra/modeling_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add ValueError message in src/transformers/models/bert/modeling_tf_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in src/transformers/models/rembert/modeling_rembert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in src/transformers/models/albert/modeling_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3499728d

Raise exceptions instead of asserts (#13938) · 64743d0a
Lukas Weiner authored Oct 11, 2021

64743d0a
Make username optional in hub_model_id (#13940) · 32634bce
Sylvain Gugger authored Oct 11, 2021

32634bce
Raise exceptions instead of asserts in xnli.py (#13945) · 708ffff6
Midhun R Nair authored Oct 11, 2021

708ffff6
Replace assert with unittest assertions (#13957) · e1bb2ebd
Luis F. Talavera R authored Oct 11, 2021

e1bb2ebd
change to apply `pad_to_multiple_of` to labels (#13949) · 6e4c8f68
Jungwoo Park authored Oct 11, 2021

6e4c8f68

[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer... · dca67968

Patrick von Platen authored Oct 11, 2021

[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961)

* up

* correct test

dca67968

Honor existing attention mask in tokenzier.pad (#13926) · 4a18337b

Sylvain Gugger authored Oct 11, 2021

* Honor existing attention mask in tokenzier.pad

* Fix initialization of attention mask

* Roll the implem on all subclasses

* Fix tests

4a18337b

Raise ValueError instead of asserts in src/transformers/benchmark/benchmark.py (#13951) · 3c0c699f
Lahfa Samy authored Oct 11, 2021
```
* Raise ValueError exception instead of assert

* Remove f unnecessary f-strings

* Remove unused f-strings
```
3c0c699f

09 Oct, 2021 1 commit
- fix issue 13904 -attribute does not exist- by change self_.mapping to self._model_mapping (#13942) · 91758e39
  oraby8 authored Oct 09, 2021
  
  91758e39
08 Oct, 2021 11 commits

Update bug-report.md (#13934) · 239bd61b

Lysandre Debut authored Oct 08, 2021



* Update bug-report.md

* Update .github/ISSUE_TEMPLATE/bug-report.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update .github/ISSUE_TEMPLATE/bug-report.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update .github/ISSUE_TEMPLATE/bug-report.md
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update .github/ISSUE_TEMPLATE/bug-report.md
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

239bd61b

Fix typo in README.md (#13883) · 46dfe99e
Chungman Lee authored Oct 09, 2021

46dfe99e
Merge remote-tracking branch 'origin/master' · 3e218523
Sylvain Gugger authored Oct 08, 2021

3e218523
Move to TF only · 9e15b511
Sylvain Gugger authored Oct 08, 2021

9e15b511
Style · cb911e5b
Sylvain Gugger authored Oct 08, 2021

cb911e5b
[Generation] Fix max_new_tokens (#13919) · c8b07612
Patrick von Platen authored Oct 08, 2021
```
* up

* Update src/transformers/generation_stopping_criteria.py

* finish
```
c8b07612
Register `keras_callbacks` as a submodule · 5a1b5e4b
Sylvain Gugger authored Oct 08, 2021

5a1b5e4b
Fixed typo: herBERT -> HerBERT (#13936) · 23ee06ed
Adam Kaczmarek authored Oct 08, 2021

23ee06ed

Adds `PreTrainedModel.framework` attribute (#13817) · de344815

Stella Biderman authored Oct 08, 2021



* Added `framework` attribute

* Update modeling_utils.py

* Update modeling_flax_utils.py

* Update modeling_tf_utils.py

* Update modeling_utils.py

* Update modeling_tf_utils.py

* Update modeling_tf_utils.py

* Update modeling_flax_utils.py

* Update modeling_tf_utils.py

* Update modeling_utils.py

* Update modeling_utils.py

* Update modeling_tf_utils.py

* Update modeling_flax_utils.py

* string -> str

* Update modeling_tf_utils.py

* string -> str

* fixup

* make flake happy
Co-authored-by: patil-suraj <surajp815@gmail.com>

de344815

Adding support for tokens being suffixes or part of each other. (#13918) · d70919e6
Nicolas Patry authored Oct 08, 2021
```
* Adding support for tokens being suffixes or part of each other.

* Better test name.
```
d70919e6

Image Segmentation pipeline (#13828) · 026866df

Mishig Davaadorj authored Oct 08, 2021



* Implement img seg pipeline

* Update src/transformers/pipelines/image_segmentation.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/pipelines/image_segmentation.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update output shape with individual masks

* Rm dev change

* Remove loops in test
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

026866df

07 Oct, 2021 8 commits
- [trainer] memory metrics: add memory at the start report (#13915) · be71ac3b
  Stas Bekman authored Oct 07, 2021
```
* [trainer] memory metrics: add memory at start

* fix for no-gpu
```
  be71ac3b
- Fix incorrect output shapes for TF/PT LED (#13882) · 61cf2ea9
  Matt authored Oct 07, 2021
```
* Fix issues with LED model

* Style pass

* Bugfixes

* correct attentions as well
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  61cf2ea9
- Add missing character (#13922) · 5f34163b
  Mishig Davaadorj authored Oct 07, 2021
  
  5f34163b
- [Wav2Vec2] Fix mask_feature_prob (#13921) · 0f5488f7
  Patrick von Platen authored Oct 07, 2021
```
* up

* overwrite hubert
```
  0f5488f7
- Add missing whitespace to multiline strings (#13916) · 57420b10
  Alex Hedges authored Oct 07, 2021
  
  57420b10
- #12789 Replace assert statements with exceptions (#13909) · 319beb64
  Dhananjay Shettigar authored Oct 07, 2021
```
* #12789 Replace assert statements with exceptions

* fix-copies: made copy changes to utils_qa.py in examples/pytorch/question-answering and examples/tensorflow/question-answering

* minor refactor for clarity
```
  319beb64
- Add an example of exporting BartModel + BeamSearch to ONNX module. (#13765) · 279ce5b7
  Jay Zhang authored Oct 07, 2021
```
* Add all example files.

* Reformat files by black.

* Style.

* Remove unused imports.
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
  279ce5b7
- Raise exceptions instead of asserts (#13907) · 0d309ce3
  Максим Заякин authored Oct 07, 2021
  
  0d309ce3
06 Oct, 2021 4 commits
- Deploy docs for v4.11.3 · 5be59a36
  Lysandre authored Oct 06, 2021
  
  5be59a36
- Fix nan-loss condition (#13911) · 5d390e9e
  Anton Lozhkov authored Oct 06, 2021
  
  5d390e9e
- Fix hp search for non sigopt backends (#13897) · 8f2c07d3
  Sylvain Gugger authored Oct 06, 2021
  
  8f2c07d3
- Fix trainer logging_nan_inf_filter in torch_xla mode (#13896) · 77770ec7
  Yanming Wang authored Oct 06, 2021
```
* Fix logging_nan_inf_filter in torch_xla mode

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix format
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  77770ec7