Commits · 2a7e8e1608aae8b5719a33c341401756e6e8897c · chenpangpang / transformers

15 Dec, 2020 15 commits

[Examples] Add automatic dataset splitting in language-modeling examples (#9133) · 2a7e8e16

Teven authored Dec 15, 2020

* replaced jnp.split + removing textual model inputs + ensuring warmup_steps > 0

* Add automatic dataset splitting in language-modeling examples

2a7e8e16

Fix add order (#9129) · e7717497
Julien Plu authored Dec 15, 2020

e7717497
Fix Bart Shift (#9135) · 18ecd36f
Patrick von Platen authored Dec 15, 2020
```
* correct mistake in order

* fix tensor copy

* clone tensor correctly
```
18ecd36f
correct mistake in order (#9134) · d018622d
Patrick von Platen authored Dec 15, 2020

d018622d
fix bart loss masking (#9131) · 80bdb9c3
Patrick von Platen authored Dec 15, 2020

80bdb9c3
Fix typo in trainer_tf.py (#9132) · 3caba8d3
Manbish authored Dec 15, 2020

3caba8d3

[TF Bart] Refactor TFBart (#9029) · abc573f5

Patrick von Platen authored Dec 15, 2020

* reorder file

* delete unnecesarry function

* make style

* save intermediate

* fix attention masks

* correct tf bart past key values

* solve merge conflict bug

* correct tensor dims

* save intermediate tf

* change attn layer

* fix typo re-order past

* inputs_embeds

* make fix copies

* finish tests

* fix graph mode

* appyl lysandres suggestions

abc573f5

Added TF OpenAi GPT1 Sequence Classification (#9105) · 389aba34

sandip authored Dec 15, 2020



* TF OpenAI GPT Sequence Classification

* Update src/transformers/models/openai/modeling_tf_openai.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

389aba34

Fix tf2.4 (#9120) · ef2d4cd4

Julien Plu authored Dec 15, 2020



* Fix tests for TF 2.4

* Remove <2.4 limitation

* Add version condition

* Update tests/test_optimization_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_optimization_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_optimization_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ef2d4cd4

Fix T5 model parallel tes (#9107) · 6ccea048
Lysandre Debut authored Dec 15, 2020
```
k
```
6ccea048
Fix stack overflow (#9114) · 59da3f27
Lysandre Debut authored Dec 15, 2020

59da3f27
native amp leak fix landed in 1.7.1 (#9115) · 14c79c3e
Stas Bekman authored Dec 15, 2020
```
update README with good news that the leak fix has been applied to pytorch-1.7.1.
```
14c79c3e

Clarify use of TrainingArguments.disable_tqdm in Jupyter Notebooks (#9076) · ed1845ef

lewtun authored Dec 15, 2020



* Clarify impact of disable_tqdm on Jupyter Notebooks

* Add weblink to argparse

* Replace "dev set" with more common "validation set" in do_eval

* Tweak prediction_loss_only

* Tweak description of Adam hyperparameters

* Add weblink to TensorBoard

* Capitalise apex

* Tweak local_rank description

* Add weblink for wandb

* Replace nlp with datasets

* Tweak grammar in model_parallel

* Capitalise apex

* Update TensorFlow training args to match PyTorch ones

* Fix style

* Fix underscore in weblink
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix underscore in weblink
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix underscore in weblink
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix underscore in weblink
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add obj to datasets.Dataset
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ed1845ef

fix a bug in eval_batch_retrieval (#9089) · 44c340f4
Yoshitomo Matsubara authored Dec 15, 2020

44c340f4

[finetune_trainer] enhancements and fixes (#9042) · c19d0462

Stas Bekman authored Dec 14, 2020



* trainer and finetune_trainer enhancements and fixes

* add fallback default

* move the fixing of incorrect keys back into finetune trainer

* s/eval/val/ to match the split

* trainer can now use a different prefix than eval_ for metrics

* document new arg

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* use 'eval' as the default for metric_key_prefix

* complete adjust var names + disambiguate

* fix logger

* add clarifying comment

* add clarifying comment

* style

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/trainer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* complete removal of optional for metric_key_prefix

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

c19d0462

14 Dec, 2020 10 commits

Also pin TF CPU · 251eb70c
Sylvain Gugger authored Dec 14, 2020

251eb70c
Pin TF to < 2.4 · e4ef57a9
Sylvain Gugger authored Dec 14, 2020

e4ef57a9

Fix T5 and BART for TF (#9063) · df3f4d2a

Julien Plu authored Dec 14, 2020

* Fix T5 for graphe compilation+execution

* Fix BART

* Fix import

* Fix naming

* fix attribute name

* Oops

* fix import

* fix tests

* fix tests

* Update test

* Add mising import

* Address Patrick's comments

* Style

* Address Patrick's comment

df3f4d2a

Add parallelization support for T5EncoderModel (#9082) · a9c8bff7

Ahmed Elnaggar authored Dec 14, 2020



* add model parallelism to T5EncoderModel

add model parallelism to T5EncoderModel

* remove decoder from T5EncoderModel parallelize

* uodate T5EncoderModel docs

* Extend T5ModelTest for T5EncoderModel

* fix T5Stask using range for get_device_map

* fix style
Co-authored-by: Ahmed Elnaggar <elnaggar@rostlab.informatik.tu-muenchen.de>

a9c8bff7

Testing Experimental CI Features (#9070) · b00eb4fb
Stas Bekman authored Dec 14, 2020

b00eb4fb
Fixed a broken link in documentation (#9101) · 74daf1f9
Simon Brandeis authored Dec 14, 2020

74daf1f9
correct var name in TrainingArguments docstring (#9096) · d6af344c
Navjot authored Dec 14, 2020

d6af344c
[RAG, Bart] Align RAG, Bart cache with T5 and other models of transformers (#9098) · fa1ddced
Patrick von Platen authored Dec 14, 2020
```
* fix rag

* fix slow test

* fix past in bart
```
fa1ddced
Patch *ForCausalLM model (#9092) · 6587cf9f
Lysandre Debut authored Dec 14, 2020

6587cf9f

Fix embeddings resizing in TF models (#8657) · 51d9c569

Julien Plu authored Dec 14, 2020

* Resize the biases in same time than the embeddings

* Trigger CI

* Biases are not reset anymore

* Remove get_output_embeddings + better LM model detection in generation utils

* Apply style

* First test on BERT

* Update docstring + new name

* Apply the new resizing logic to all the models

* fix tests

* Apply style

* Update the template

* Fix naming

* Fix naming

* Apply style

* Apply style

* Remove unused import

* Revert get_output_embeddings

* Trigger CI

* Update num parameters

* Restore get_output_embeddings in TFPretrainedModel and add comments

* Style

* Add decoder resizing

* Style

* Fix tests

* Separate bias and decoder resize

* Fix tests

* Fix tests

* Apply style

* Add bias resizing in MPNet

* Trigger CI

* Apply style

51d9c569

11 Dec, 2020 15 commits

[model_cards] Migrate cards from this repo to model repos on huggingface.co (#9013) · 3552d0e0

Julien Chaumond authored Dec 12, 2020



* rm all model cards

* Update the .rst

@sgugger it is still not super crystal clear/streamlined so let me know if any ideas to make it simpler

* Add a rootlevel README.md with simple instructions/context

* Update docs/source/model_sharing.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* make style

* rm all model cards
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3552d0e0

Fix min_null_pred in the run_qa script (#9067) · 29e45979
Sylvain Gugger authored Dec 11, 2020

29e45979
Make ProphetNetModel really compatible with EncoderDecoder (#9033) · 9cc9f412
Patrick von Platen authored Dec 11, 2020
```
* improve

* finish

* upload model

* fix lm head

* fix test
```
9cc9f412

Bump notebook in /examples/research_projects/movement-pruning/lxmert (#9062) · 24f6cdea

dependabot[bot] authored Dec 11, 2020

Bumps [notebook](https://github.com/jupyter/jupyterhub) from 6.1.4 to 6.1.5.
- [Release notes](https://github.com/jupyter/jupyterhub/releases)
- [Changelog](https://github.com/jupyterhub/jupyterhub/blob/master/CHECKLIST-Release.md)
- [Commits](https://github.com/jupyter/jupyterhub/commits

)
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

24f6cdea

Remove docs only check (#9065) · 91fa7072
Lysandre Debut authored Dec 11, 2020

91fa7072
Fix PreTrainedTokenizer.pad when first inputs are empty (#9018) · 70527ba6
Sylvain Gugger authored Dec 11, 2020
```
* Fix PreTrainedTokenizer.pad when first inputs are empty

* Handle empty inputs case
```
70527ba6

Reorganize examples (#9010) · 783d7d26

Sylvain Gugger authored Dec 11, 2020



* Reorganize example folder

* Continue reorganization

* Change requirements for tests

* Final cleanup

* Finish regroup with tests all passing

* Copyright

* Requirements and readme

* Make a full link for the documentation

* Address review comments

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Add symlink

* Reorg again

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Adapt title

* Update to new strucutre

* Remove test

* Update READMEs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

783d7d26

update tatoeba workflow (#9051) · 86896de0
Suraj Patil authored Dec 11, 2020

86896de0

Create README.md (#8096) · 7c8f5f64

Ganesh Kharad authored Dec 11, 2020



* Create README.md

* Fix model card
Co-authored-by: Julien Chaumond <julien@huggingface.co>

7c8f5f64

Create README.md (#8281) · 5527f787

RamonMamon authored Dec 11, 2020



* Create README.md

* Update model_cards/kiri-ai/distiluse-base-multilingual-cased-et/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

5527f787

Create README.md (#8751) · c615df74

joangines authored Dec 11, 2020



* Create README.md

* Update model_cards/Cinnamon/electra-small-japanese-generator/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

c615df74

QARiB Arabic and dialects models (#8796) · 76df5593

Ahmed Abdelali authored Dec 11, 2020



* Add QARiB models

* fix README.md

* Fix README.md

* Fix README.md

* Fix README.md

* Fix QARiB files

* add models card for QARiB models 860k, 1790k, and 1970k

* try to fix PR

* re-add files

* links aren't allowed here :)
Co-authored-by: Ahmed Abdelali <aabdelali@hbku.edu.qa>
Co-authored-by: Julien Chaumond <julien@huggingface.co>

76df5593

Update README.md (#8820) · b161f1ae
moniquebm authored Dec 11, 2020

b161f1ae

Initial README for `t5-base-indonesian-summarization-cased` model (#9028) · 649d389d

Panggi Libersa Jasri Akadol authored Dec 11, 2020

* Create README.md

Initial README for `t5-base-indonesian-summarization-cased` model

* Update README for t5-base-indonesian-summarization-cased

Typo in README, change from `small` to `base`

649d389d

Create README.md (#9030) · 5e794b66
Panggi Libersa Jasri Akadol authored Dec 11, 2020
```
Initial README for `t5-small-indonesian-summarization-cased` model
```
5e794b66