Commits · 58c789e3d226a35e6ff6e2d175a5d0b78ef1fc04 · chenpangpang / transformers

30 Apr, 2021 6 commits
- Update README.md (#11489) · 58c789e3
  Manuel Romero authored Apr 30, 2021
```
Add link to code
```
  58c789e3
- make style (#11520) · 022a1e9e
  Patrick von Platen authored Apr 30, 2021
  
  022a1e9e
- add sp_model_kwargs to unpickle of xlm roberta tok (#11430) · e0db8276
  Philip May authored Apr 30, 2021
```
add test for pickle

simplify test

fix test code style

add missing pickle import

fix test

fix test

fix test
```
  e0db8276
- correct the dimension comment of matrix multiplication (#11494) · b43e3f93
  Frederik Bode authored Apr 30, 2021
```
Co-authored-by: Frederik Bode <frederik@paperbox.ai>
```
  b43e3f93
- Pin HuggingFace Hub dependency (#11502) · f37f2adb
  Lysandre Debut authored Apr 30, 2021
  
  f37f2adb
- Patch notification service · 60d5bda4
  Lysandre authored Apr 30, 2021
  
  60d5bda4
29 Apr, 2021 4 commits

Split checkpoint from model_name_or_path in examples (#11492) · b29eb247
Sylvain Gugger authored Apr 29, 2021
```
* Split checkpoint from model_name_or_path in examples

* Address review comments

* Address review comments
```
b29eb247
solved coefficient issue for the TF version of gelu_fast (#11514) · d6ec54ba
Michael Benayoun authored Apr 29, 2021
```
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
d6ec54ba
Reformat to make code clearer in tokenizer call (#11497) · ad1f7bef
Sylvain Gugger authored Apr 29, 2021
```
* Reformat to make code clearer

* Reformat to make code clearer
```
ad1f7bef

[Flax] Add docstrings & model outputs (#11498) · f748bd42

Patrick von Platen authored Apr 29, 2021



* add attentions & hidden states

* add model outputs + docs

* finish docs

* finish tests

* finish impl

* del @

* finish

* finish

* correct test

* apply sylvains suggestions

* Update src/transformers/models/bert/modeling_flax_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* simplify more
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f748bd42

28 Apr, 2021 3 commits

fix #1149 (#11493) · 3f6add8b
Hamel Husain authored Apr 28, 2021

3f6add8b

Update `PreTrainedTokenizerBase` to check/handle batch length for `text_pair` parameter (#11486) · c0eb218a

Hamel Husain authored Apr 28, 2021



* Update tokenization_utils_base.py

* add assertion

* check batch len

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add error message
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c0eb218a

Update min versions in README and add Flax (#11472) · 2d27900b
Sylvain Gugger authored Apr 28, 2021
```
* Update min versions in README and add Flax

* Adapt index
```
2d27900b

27 Apr, 2021 3 commits

fix docs for decoder_input_ids (#11466) · 8d43c71a
Suraj Patil authored Apr 27, 2021
```
* fix docs for decoder_input_ids

* revert the changes for bart and mbart
```
8d43c71a

Finish Making Quick Tour respect the model object (#11467) · 7ceff67e

Hamel Husain authored Apr 27, 2021



* finish quicktour

* fix import

* fix print

* explain config default better

* Update docs/source/quicktour.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7ceff67e

update QuickTour docs to reflect model output object (#11462) · 88ac60f7
Hamel Husain authored Apr 26, 2021
```
* update docs to reflect model output object

* run make style`
```
88ac60f7

26 Apr, 2021 20 commits
- Remove max length beam scorer (#11378) · 741d48f5
  Ashwin Geet D'Sa authored Apr 27, 2021
```
* removed max_len

* removed max_length from BeamSearchScorer

* correct max length

* finish

* del vim

* finish & add test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  741d48f5
- [Deepspeed] ZeRO-Infinity integration plus config revamp (#11418) · bc2571e6
  Stas Bekman authored Apr 26, 2021
```
* adding Z-inf

* revamp config process

* up version requirement

* wip

* massive rewrite

* cleanup

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* consistent json commas

* act on suggestions

* leave this feature for 0.3.16

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  bc2571e6
- Variable Correction for Consistency in Distillation Example (#11444) · 0661abc5
  Jaimeen Ahn authored Apr 27, 2021
```
As the error comes from the inconsistency of variable meaning number of gpus in parser and its actual usage in the train.py script, 'gpus' and 'n_gpu' respectively,  the correction makes the example work
```
  0661abc5
- [Examples] Fixes inconsistency around eval vs val and predict vs test (#11380) · 1d30ec95
  Bhadresh Savani authored Apr 26, 2021
```
* added changes for uniformity

* modified files

* corrected typo

* fixed qa scripts

* fix typos

* fixed predict typo in qa no trainer

* fixed test file

* reverted trainer changes

* reverted trainer changes in custom exmaples

* updated readme

* added changes in deepspeed test

* added changes for predict and eval
```
  1d30ec95
- Give each test a different repo name (#11453) · 7959d835
  Sylvain Gugger authored Apr 26, 2021
  
  7959d835
- Style · b03b2a65
  Sylvain Gugger authored Apr 26, 2021
  
  b03b2a65
- make sure to test against the local checkout (#11437) · ce11318e
  Stas Bekman authored Apr 26, 2021
  
  ce11318e
- [docs] fix invalid class name (#11438) · a753cafd
  Stas Bekman authored Apr 26, 2021
```
* fix invalid class name

* proper ref

* proper ref
```
  a753cafd
- Clarify description of the is_split_into_words argument (#11449) · 6715e3b6
  Kostas Stathoulopoulos authored Apr 26, 2021
```
* Improve documentation for is_split_into_words argument

* Change description wording
```
  6715e3b6
- Pass along seed to DistributedSampler (#11406) · ab2cabb9
  Sylvain Gugger authored Apr 26, 2021
```
* Pass along seed to DistributedSampler

* Add seed to DistributedLengthGroupedSampler
```
  ab2cabb9
- fix some typos in docs, comments, logging/errors (#11432) · b24ead87
  LSinev authored Apr 26, 2021
  
  b24ead87
- docs(examples): fix link to TPU launcher script (#11427) · e3e70f95
  Amine Abdaoui authored Apr 26, 2021
  
  e3e70f95
- Add basic support for FP16 in SageMaker model parallelism (#11407) · d7633a4e
  Sylvain Gugger authored Apr 26, 2021
```
* Add FP16 support for SageMaker MP

* Add print debugs

* Squeeze

* Remove debug statements

* Add defensive check

* Typo
```
  d7633a4e
- TF BART models - Add `cross_attentions` to model output and fix... · 38a716cd
  Daniel Stancl authored Apr 26, 2021
```
TF BART models - Add `cross_attentions` to model output and fix cross-attention head masking (#10699)

* Add cross_attn_head_mask to BART

* Fix cross_attentions in TFBart-like models

* This commit enables returning of `cross_attentions`
for TFBart-like models

* It also fixes attention head masking in cross-attenion module

* Update TF model templates

* Fix missing , in TF model templates

* Fix typo: congig -> config
```
  38a716cd
- Pin black to 21.4b0 · 4bd6b54f
  Sylvain Gugger authored Apr 26, 2021
  
  4bd6b54f
- With style · c1625b32
  Sylvain Gugger authored Apr 26, 2021
  
  c1625b32
- Pin black to 20.8.b1 · 4b72cfd9
  Sylvain Gugger authored Apr 26, 2021
  
  4b72cfd9
- make style (#11442) · 32dbb2d9
  Patrick von Platen authored Apr 26, 2021
  
  32dbb2d9
- add pooling layer support (#11439) · 04ab2ca6
  Vasudev Gupta authored Apr 26, 2021
  
  04ab2ca6
- updating the checkpoint for GPT2ForSequence Classification to one with classification head (#11434) · 30f06589
  abiolaTresor authored Apr 26, 2021
  
  30f06589
25 Apr, 2021 2 commits

EncoderDecoderConfigs should not create new objects (#11300) · 35cd8eed

cronoik authored Apr 25, 2021



* removes the creation of separate config objects and uses the existing ones instead+overwrite resize_token_embeddings from parent class because it is not working for the EncoderDecoderModel

* rollback to current version of the huggingface master branch

* reworked version that ties the encoder and decoder config of the parent encoderdecoder instance

* overwrite of resize_token_embeddings throws an error now

* review comment suggestion
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* implemented warning in case encoderdecoder is created with differing configs of encoderdecoderconfig and decoderconfig or encoderconfig

* added test to avoid diverging configs of wrapper class and wrapped classes

* Update src/transformers/models/encoder_decoder/modeling_encoder_decoder.py

* make style
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

35cd8eed

Add head_mask, decoder_head_mask, cross_head_mask to ProphetNet (#9964) · f45cb66b

Daniel Stancl authored Apr 25, 2021

* Add head_mask & decoder_head_mask + some corrections

* Fix head masking for N-grams

* Enable test_headmasking for encoder and decod

* Fix one typo regarding in modeling_propgetnet.py

* Enable test_headmasking for ProphetNetStandaloneDecoderModelTest
and ProphetNetStandaloneEncoderModelTest in test_modeling_prophetnet.py

* make style

* Fix cross_head_mask

* Fix attention head mask naming

* `cross_head_mask` -> `cross_attn_head_mask`

* `cross_layer_head_mask` -> `cross_attn_layer_head_mask`

* Still need to merge #10605 to master to pass the tests

f45cb66b

24 Apr, 2021 2 commits
- Style · 52166f67
  Sylvain Gugger authored Apr 23, 2021
  
  52166f67
- documentation linked to the parent class PreTrainedTokenizerFast but it should... · 9cac4fab
  cronoik authored Apr 24, 2021
```
documentation linked to the parent class PreTrainedTokenizerFast but it should be the slow tokenizer (#11410)
```
  9cac4fab