Commits · 49bee0aea44ef29c08d48f818f356275ef223da8 · chenpangpang / transformers

08 Jun, 2021 7 commits

Add torch to requirements.txt in language-modeling (#12040) · 49bee0ae

cdleong authored Jun 08, 2021



* Add torch to requirements.txt in language-modeling

* Update examples/pytorch/language-modeling/requirements.txt
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

49bee0ae

Replace legacy tensor.Tensor with torch.tensor/torch.empty (#12027) · f5eec0d8
Mario Šaško authored Jun 08, 2021
```
* Replace legacy torch.Tensor constructor with torch.{tensor, empty}

* Remove torch.Tensor in examples
```
f5eec0d8

updated the original RAG implementation to be compatible with latest Pytorch-Lightning (#11806) · e33085d6

Shamane Siri authored Jun 09, 2021

* updated the original RAG implementation to be compatible with the latest PL version

* updated the requirements.txt file

* execute make style

* code quality test

* code quality

* conflix resolved in requirement.txt

* code quality

* changed the MyDDP class name to CustomDDP

e33085d6

Fix tapas issue (#12063) · 70f88eec

NielsRogge authored Jun 08, 2021

* Fix scatter function to be compatible with torch-scatter 2.7.0

* Allow test again

70f88eec

Fix integration tests (#12066) · e56e3140
NielsRogge authored Jun 08, 2021

e56e3140
skip failing test (#12059) · 4abc6dd6
Stas Bekman authored Jun 07, 2021

4abc6dd6
adds metric prefix. (#12057) · e363e1d9
Russell Klopfer authored Jun 07, 2021
```
* adds metric prefix.

* update tests to include prefix
```
e363e1d9

07 Jun, 2021 7 commits

Add optional grouped parsers description to HfArgumentParser (#12042) · 8994c1e4
Peter Izsak authored Jun 07, 2021
```
* Adding optional argument group to HfArgumentParser

* Minor

* remove whitespace

* Minor styling
```
8994c1e4

Extend pipelines for automodel tupels (#12025) · 2056f26e

Nicolas Patry authored Jun 07, 2021



* fix_torch_device_generate_test

* remove @

* finish

* refactor

* add test

* fix test

* Attempt at simplification.

* Small fix.

* Fixing non existing AutoModel for TF.

* Naming.

* Remove extra condition.
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

2056f26e

Fixes bug that appears when using QA bert and distilation. (#12026) · f8bd8c6c

François Lagunas authored Jun 07, 2021

* Fixing bug that appears when using distilation (and potentially other uses).
During backward pass Pytorch complains with:
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
This happens because the QA model code modifies the start_positions and end_positions input tensors, using clamp_ function: as a consequence the teacher and the student both modifies the inputs, and backward pass fails.

* Fixing all models QA clamp_ bug.

f8bd8c6c

[JAX] Bump jax lib (#12053) · 59f75d53
Patrick von Platen authored Jun 07, 2021
```
* fix_torch_device_generate_test

* remove @

* bump up jax lib
```
59f75d53
fix docs of past_key_values (#12049) · 185122ef
Suraj Patil authored Jun 07, 2021

185122ef
fix deberta 2 tokenizer integration test (#12017) · 3857f2b4
Philip May authored Jun 07, 2021

3857f2b4
Fixed Typo in modeling_bart.py (#12035) · 20b6f3b8
Shiva Pundir authored Jun 07, 2021
```
* Fixed Typo in modeling_bart.py - Issue #11895

* Fixed Typo in modeling_bart.py
```
20b6f3b8

04 Jun, 2021 2 commits

[TrainerArguments] format and sort __repr__, add __str__ (#12018) · 1f335aef
Stas Bekman authored Jun 04, 2021
```
* format and sort __repr__, add __str__

* typo

* use __str__ directly

* alias __repr__ = __str__
```
1f335aef

[Deepspeed] Assert on mismatches between ds and hf args (#12021) · 2c73b930

Stas Bekman authored Jun 04, 2021



* wip

* add mismatch validation + test

* renames

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* renames
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

2c73b930

03 Jun, 2021 2 commits

[Flax] Refactor MLM (#12013) · 242ec31a

Patrick von Platen authored Jun 03, 2021



* fix_torch_device_generate_test

* remove @

* finish refactor
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

242ec31a

Fix weight decay masking in `run_flax_glue.py` (#11964) · 4674061b

Nicholas Vadivelu authored Jun 03, 2021



* Fix weight decay masking in `run_flax_glue.py`

Issues with the previous implementation:
- The `dict` from `traverse_util.flatten_dict` has keys which are tuples of strings, not one long string with the path separated by periods.
- `optax.masked` applies the transformation wherever the mask is True, so the masks are flipped.
- Flax's LayerNorm calls the scale parameter `scale` not `weight`

* Fix formatting with black

* adapt results
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

4674061b

02 Jun, 2021 8 commits

[deepspeed] add nvme test skip rule (#11997) · 61c50634
Stas Bekman authored Jun 02, 2021
```
* add nvme skip rule

* fix
```
61c50634
[deepspeed] Move code and doc into standalone files (#11984) · 640318be
Stas Bekman authored Jun 02, 2021
```
* move code and docs

* style

* moved

* restore
```
640318be

Update return introduction (#11976) · d6d747cb

Kou Yong Kang authored Jun 03, 2021

Make it clear that the `forward` method now returns a dict instead of tuple.

Fix style

d6d747cb

[docs] fix xref to `PreTrainedModel.generate` (#11049) · d406a272
Stas Bekman authored Jun 02, 2021
```
* fix xref to generate

* do the same for search methods

* style

* style
```
d406a272
Fix examples (#11990) · 123b597f
Gunjan Chhablani authored Jun 02, 2021

123b597f

VisualBERT (#10534) · 88ca6a23

Gunjan Chhablani authored Jun 02, 2021



* Init VisualBERT

* Add cookie-cutter, Config, and Embeddings

* Add preliminary Model

* Add Bert analogous classes

* Add basic code for NLVR, VQA, Flickr

* Update Init

* Fix VisualBert Downstream Models

* Rename classifier to cls

* Comment position_ids buffer

* Remove sentence image predictor output

* Update output dicts

* Remove unnecessary files

* Fix Auto Modeling

* Fix transformers init

* Add conversion script

* Add conversion script

* Fix docs

* Update visualbert modelling

* Update configuration

* Style fixes

* Add model and integration tests

* Add all tests

* Update model mapping

* Add simple detector from original repository

* Update docs and configs

* Fix style

* Fix style

* Update docs

* Fix style

* Fix import issues in style

* Fix style

* Add changes from review

* Fix style

* Fix style

* Update docs

* Fix style

* Fix style

* Update docs/source/model_doc/visual_bert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/test_modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add changes from review

* Remove convert run script

* Add changes from review

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/visual_bert/modeling_visual_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add changes from review

* Add changes from review

* Add visual embedding example in docs

* Fix "copied from" comments

* Add changes from review

* Fix error, style, checkpoints

* Update docs

* Fix integration tests

* Fix style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

88ca6a23

[RAG] Fix rag from pretrained question encoder generator behavior (#11962) · 43f46aa7
Patrick von Platen authored Jun 02, 2021
```
* fix_torch_device_generate_test

* remove @

* fix rag from pretrained loading

* add test

* uplaod

* finish
```
43f46aa7

Bump urllib3 from 1.25.8 to 1.26.5 in /examples/research_projects/lxmert (#11983) · 6db3a87d

dependabot[bot] authored Jun 02, 2021

Bumps [urllib3](https://github.com/urllib3/urllib3) from 1.25.8 to 1.26.5.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/1.25.8...1.26.5

)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

6db3a87d

01 Jun, 2021 14 commits

[Trainer] add train loss and flops metrics reports (#11980) · 4ba203d9

Stas Bekman authored Jun 01, 2021

* add train loss and flops metrics reports

* consistency

* add train_loss to skip keys

* restore on_train_end call timing

4ba203d9

[DeepSpeed] decouple `DeepSpeedConfigHF` from `Trainer` (#11966) · 7ec596ec

Stas Bekman authored Jun 01, 2021



* decouple DeepSpeedConfigHF from Trainer

* add LoggingLevel ctx manager; add new test

* cleanup

* add docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* implemented suggested renames

* formatter workaround
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7ec596ec

Typo in usage example, changed to device instead of torch_device (#11979) · 1c3ab3e5
Alberto Villa authored Jun 01, 2021

1c3ab3e5

ByT5 model (#11971) · 47a98fc4

Patrick von Platen authored Jun 01, 2021



* allow tf to use uneven num of layers

* add tokenizer

* finish docs

* finish docs

* Apply suggestions from code review

* include in index

* finish

* Update docs/source/model_doc/byt5.rst
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* apply sylvais suggestions

* make style
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

47a98fc4

typo correction (#11973) · 1eb58b45
Jeoung-Minju authored Jun 02, 2021
```
* typo correction

* type corrections
```
1eb58b45
[deepspeed] docs (#11940) · 79712e7e
Stas Bekman authored Jun 01, 2021
```
* deepspeed docs

* cleanup

* cleanup
```
79712e7e
Run the integration tests on schedule tests instead of master tests · 985d7088
Lysandre authored Jun 01, 2021

985d7088

Neptune.ai integration (#11937) · 9996558b

Volodymyr Byno authored Jun 01, 2021

An option that turns on neptune.ai logging
--report_to 'neptune'

Additional ENV variables:
	NEPTUNE_PROJECT
	NEPTUNE_API_TOKEN
	NEPTUNE_RUN_NAME (optional)
	NEPTUNE_STOP_TIMEOUT (optional)

9996558b

Authorize args when instantiating an AutoModel (#11956) · ae6ce28f
Lysandre Debut authored Jun 01, 2021

ae6ce28f

Add regression tests for slow sentencepiece tokenizers. (#11737) · fcad8018

Philip May authored Jun 01, 2021

* add test_vocab_size for sentencepiece tok.

* add test_get_vocab for sentencepiece tok.

* add test_convert_token_and_id for sentencepiece tok.

* add test_tokenize_and_convert_tokens_to_string for all tok.

* improve test_tokenize_and_convert_tokens_to_string for sp. tok.

* add common tokenizer integration tests
- for albert
- for barthez

* add tokenizer integration tests to bert gen.

* add most tokenizer integration tests

* fix camembert tokenizer integration test

* add tokenizer integration test to marian

* add tokenizer integration test to reformer

* add typing and doc to tokenizer_integration_test_util

* fix tokenizer integration test of reformer

* improve test_sentencepiece_tokenize_and_convert_tokens_to_string

* empty commit to trigger CI

* fix tokenizer integration test of reformer

* remove code not needed anymore

* empty commit to trigger CI

* empty commit to trigger CI

fcad8018

reinitialize wandb config for each hyperparameter search run (#11945) · c3d958b2
Josh Tanner authored Jun 01, 2021

c3d958b2

bugfixes training_args.py (#11922) · 99dbbdb9

Riccardo Bassani authored Jun 01, 2021

modified according to:
https://pytorch.org/xla/release/1.8.1/_modules/torch_xla/core/xla_model.html

99dbbdb9

modify qa-trainer (#11872) · 7e73601f
Fan Zhang authored Jun 01, 2021
```
* modify qa-trainer

* fix flax model
```
7e73601f

RAG-2nd2end-revamp (#11893) · 9ec0f01b

Shamane Siri authored Jun 01, 2021



* initial

* code quality test

* code quality

* added test functions in test_modeling_rag.py and test_retrieval_rag.py to test end2end retreiver

* minor change in test_modeling_rag

* fixed tests

* Update examples/research_projects/rag-end2end-retriever/README.md

typo corrected as suggested by lhoestq
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* Update examples/research_projects/rag-end2end-retriever/finetune_rag.py

type change suggested by lhoestq
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* Update src/transformers/models/rag/retrieval_rag.py

Adding this change as mentioned by lhoestq.
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

* completed the minor changes suggested by the reviewers
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>

9ec0f01b