Commits · 3499728dc47b87b8ca74ec7f43c62049e8279611 · chenpangpang / transformers

11 Oct, 2021 8 commits

Replace assert by ValueError of... · 3499728d

Lahfa Samy authored Oct 11, 2021


Replace assert by ValueError of src/transformers/models/electra/modeling_{electra,tf_electra}.py and all other models that had copies (#13955)

* Replace all assert by ValueError in src/transformers/models/electra

* Reformat with black to pass check_code_quality test

* Change some assert to ValueError of modeling_bert & modeling_tf_albert

* Change some assert in multiples models

* Change multiples models assertion to ValueError in order to validate
  check_code_style test and models template test.

* Black reformat

* Change some more asserts in multiples models

* Change assert to ValueError in modeling_layoutlm.py to fix copy error in code_style_check

* Add proper message to ValueError in modeling_tf_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in models/bert/modeling_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add ValueError message to models/convbert/modeling_tf_convbert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add error message for ValueError to modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in models/tapas/modeling_tapas.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in models/electra/modeling_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add ValueError message in src/transformers/models/bert/modeling_tf_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in src/transformers/models/rembert/modeling_rembert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Simplify logic in src/transformers/models/albert/modeling_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3499728d

Raise exceptions instead of asserts (#13938) · 64743d0a
Lukas Weiner authored Oct 11, 2021

64743d0a
Make username optional in hub_model_id (#13940) · 32634bce
Sylvain Gugger authored Oct 11, 2021

32634bce
Raise exceptions instead of asserts in xnli.py (#13945) · 708ffff6
Midhun R Nair authored Oct 11, 2021

708ffff6
change to apply `pad_to_multiple_of` to labels (#13949) · 6e4c8f68
Jungwoo Park authored Oct 11, 2021

6e4c8f68

[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer... · dca67968

Patrick von Platen authored Oct 11, 2021

[Gradient checkpoining] Correct disabling `find_unused_parameters` in Trainer when gradient checkpointing is enabled (#13961)

* up

* correct test

dca67968

Honor existing attention mask in tokenzier.pad (#13926) · 4a18337b

Sylvain Gugger authored Oct 11, 2021

* Honor existing attention mask in tokenzier.pad

* Fix initialization of attention mask

* Roll the implem on all subclasses

* Fix tests

4a18337b

Raise ValueError instead of asserts in src/transformers/benchmark/benchmark.py (#13951) · 3c0c699f
Lahfa Samy authored Oct 11, 2021
```
* Raise ValueError exception instead of assert

* Remove f unnecessary f-strings

* Remove unused f-strings
```
3c0c699f

09 Oct, 2021 1 commit
- fix issue 13904 -attribute does not exist- by change self_.mapping to self._model_mapping (#13942) · 91758e39
  oraby8 authored Oct 09, 2021
  
  91758e39
08 Oct, 2021 7 commits

Move to TF only · 9e15b511
Sylvain Gugger authored Oct 08, 2021

9e15b511
Style · cb911e5b
Sylvain Gugger authored Oct 08, 2021

cb911e5b
[Generation] Fix max_new_tokens (#13919) · c8b07612
Patrick von Platen authored Oct 08, 2021
```
* up

* Update src/transformers/generation_stopping_criteria.py

* finish
```
c8b07612
Register `keras_callbacks` as a submodule · 5a1b5e4b
Sylvain Gugger authored Oct 08, 2021

5a1b5e4b

Adds `PreTrainedModel.framework` attribute (#13817) · de344815

Stella Biderman authored Oct 08, 2021



* Added `framework` attribute

* Update modeling_utils.py

* Update modeling_flax_utils.py

* Update modeling_tf_utils.py

* Update modeling_utils.py

* Update modeling_tf_utils.py

* Update modeling_tf_utils.py

* Update modeling_flax_utils.py

* Update modeling_tf_utils.py

* Update modeling_utils.py

* Update modeling_utils.py

* Update modeling_tf_utils.py

* Update modeling_flax_utils.py

* string -> str

* Update modeling_tf_utils.py

* string -> str

* fixup

* make flake happy
Co-authored-by: patil-suraj <surajp815@gmail.com>

de344815

Adding support for tokens being suffixes or part of each other. (#13918) · d70919e6
Nicolas Patry authored Oct 08, 2021
```
* Adding support for tokens being suffixes or part of each other.

* Better test name.
```
d70919e6

Image Segmentation pipeline (#13828) · 026866df

Mishig Davaadorj authored Oct 08, 2021



* Implement img seg pipeline

* Update src/transformers/pipelines/image_segmentation.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/pipelines/image_segmentation.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update output shape with individual masks

* Rm dev change

* Remove loops in test
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

026866df

07 Oct, 2021 5 commits
- [trainer] memory metrics: add memory at the start report (#13915) · be71ac3b
  Stas Bekman authored Oct 07, 2021
```
* [trainer] memory metrics: add memory at start

* fix for no-gpu
```
  be71ac3b
- Fix incorrect output shapes for TF/PT LED (#13882) · 61cf2ea9
  Matt authored Oct 07, 2021
```
* Fix issues with LED model

* Style pass

* Bugfixes

* correct attentions as well
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  61cf2ea9
- Add missing character (#13922) · 5f34163b
  Mishig Davaadorj authored Oct 07, 2021
  
  5f34163b
- [Wav2Vec2] Fix mask_feature_prob (#13921) · 0f5488f7
  Patrick von Platen authored Oct 07, 2021
```
* up

* overwrite hubert
```
  0f5488f7
- Add missing whitespace to multiline strings (#13916) · 57420b10
  Alex Hedges authored Oct 07, 2021
  
  57420b10
06 Oct, 2021 7 commits
- Fix nan-loss condition (#13911) · 5d390e9e
  Anton Lozhkov authored Oct 06, 2021
  
  5d390e9e
- Fix hp search for non sigopt backends (#13897) · 8f2c07d3
  Sylvain Gugger authored Oct 06, 2021
  
  8f2c07d3
- Fix trainer logging_nan_inf_filter in torch_xla mode (#13896) · 77770ec7
  Yanming Wang authored Oct 06, 2021
```
* Fix logging_nan_inf_filter in torch_xla mode

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix format
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  77770ec7
- T5ForConditionalGeneration: enabling using past_key_values and labels in training (#13805) · aea7c5b0
  yssjtu authored Oct 06, 2021
```
* enabling using past_key_values together with labels when training in T5ForConditionalGeneration

* test

* Enable past_key_values in T5ForconditionalGeneration while training.

* delete comments
```
  aea7c5b0
- Fixing Backward compatiblity for zero-shot (#13855) · 013bdc6d
  Nicolas Patry authored Oct 06, 2021
```
Fixes #13846
```
  013bdc6d
- Replace assert statements with exceptions (#13871) · 9f58becc
  David del Río Medina authored Oct 06, 2021
  
  9f58becc
- Fixing GPU for token-classification in a better way. (#13856) · e7b16f33
  Nicolas Patry authored Oct 06, 2021
```
Co-authored-by: Pierre Snell <pierre.snell@botpress.com>
Co-authored-by: Pierre Snell <pierre.snell@botpress.com>
```
  e7b16f33
05 Oct, 2021 9 commits
- fix: replace asserts by error (#13894) · 7af7d7ce
  Siarhei Melnik authored Oct 06, 2021
  
  7af7d7ce
- fix(integrations): consider test metrics (#13888) · f099249c
  Boris Dayma authored Oct 05, 2021
  
  f099249c
- Fixing question-answering with long contexts (#13873) · 0ddadbf0
  Nicolas Patry authored Oct 05, 2021
```
* Tmp.

* Fixing BC for question answering with long context.

* Capping model_max_length to avoid tf overflow.

* Bad workaround bugged roberta.

* Fixing name.
```
  0ddadbf0
- Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler (#13820) · 1b74af76
  Zhaofeng Wu authored Oct 05, 2021
```
* Allow dataset to be an optional argument for (Distributed)LengthGroupedSampler

* Fix
```
  1b74af76
- Initial support for symbolic tracing with torch.fx allowing dynamic axes (#13579) · d4e4efce
  Michael Benayoun authored Oct 05, 2021
```
* Symbolic trace dynamic axes support for BERT like models (albert, bert, distilbert, mobilebert, electra, megatron-bert)
* Sanity checks before tracing that make sure the model to trace is supported
* Adapted to PyTorch 1.9
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
  d4e4efce
- Improve error message when loading models from Hub (#13836) · 46efc580
  Alex Hedges authored Oct 05, 2021
```
* Improve error message when loading models from Hub

* Adjust error message wording
```
  46efc580
- Fixing empty prompts for text-generation when BOS exists. (#13859) · 3a9c0f23
  Nicolas Patry authored Oct 05, 2021
```
* Fixing empty prompts for text-generation when BOS exists.

* Fixing odd case with Pegasus.

* Fixing Bert is Assertion Error.
```
  3a9c0f23
- Fixing 1-length special tokens cut. (#13862) · 7079a99e
  Nicolas Patry authored Oct 05, 2021
  
  7079a99e
- Update Tatoeba conversion (#13757) · 7051b892
  Sam Hardwick authored Oct 05, 2021
```
* Update Tatoeba conversion
```
  7051b892
04 Oct, 2021 3 commits

Update no_* argument (HfArgumentParser) (#13865) · 12b4d66a

Bram Vanroy authored Oct 04, 2021

* update no_* argument

Changes the order so that the no_* argument is created after the original argument AND sets the default for this no_* argument to False

* import copy

* update test

* make style

* Use kwargs to set default=False

* make style

12b4d66a

Add Mistral GPT-2 Stability Tweaks (#13573) · 3a8de58c

Sidd Karamcheti authored Oct 04, 2021



* Add layer-wise scaling

* Add reorder & upcasting argument

* Add OpenAI GPT-2 weight initialization scheme

* start `layer_idx` count at zero for consistency

* disentangle attn and reordered and upscaled attn function

* rename `scale_attn_by_layer` to `scale_attn_by_layer_id`

* make autocast from amp compatible with pytorch<1.6

* fix docstring

* style fixes

* Add fixes from PR feedback, style tweaks

* Fix doc whitespace

* Reformat

* First pass scale_attn_by_layer_idx and reorder_and_upcast_attn tests

* Rename scale_attn_by_layer_idx, add tip

* Remove extra newline

* add test for weight initialization

* update code format

* add assert check weights are fp32

* remove assert

* Fix incorrect merge

* Fix shape mismatch in baddbmm

* Add generation test for Mistral flags
Co-authored-by: leandro <leandro.vonwerra@spoud.io>
Co-authored-by: Keshav Santhanam <keshav2@stanford.edu>
Co-authored-by: J38 <jebolton@stanford.edu>

3a8de58c

Delete convert_multiberts_checkpoint_to_pytorch.py (#13852) · de948350
Gunjan Chhablani authored Oct 04, 2021

de948350