Commits · 26a2e365953e22e5d2762830e59226141f99c304 · chenpangpang / transformers

23 Jun, 2021 3 commits

Add output in a dictionary for TF `generate` method (#12139) · 26a2e365

Daniel Stancl authored Jun 23, 2021

* Add output args to greedy search

* Fix critical typo + make style quality

* Handle generate_beam_search

* Add dict_specific tests and fix the placement of encoder outputs

* Add  specific outputs

* Update doc

* Fix typo

* Adjust handling encoder_outputs + Fix generating for T5

* Fix generate for RAG

* Fix handling ouptut_attentions when target_mapping is not None

Take care of situations when target_mapping is provided
as there are 2-tuple of attentions

Change from:
if inputs["output_attentions"]:
    attentions = tuple(tf.transpose(t, perm(2, 3, 0, 1)) for t in attentions)

to:
if inputs["output_attentions"]:
    if inputs["target_mapping"] is not None:
        # when target_mapping is provided, there are 2-tuple of attentions
         attentions = tuple(
             tuple(tf.transpose(attn_stream, perm=(2, 3, 0, 1)) for attn_stream in t) for t in attentions
        )
    else:
        attentions = tuple(tf.transpose(t, perm=(2, 3, 0, 1)) for t in attentions)

* Rename kwargs to model_kwargs

* make style quality

* Move imports in test_modeling_tf_common.py

Move ModelOutput-related imports in test_modeling_tf_common.py
into the `is_tf_available():` statement.

* Rewrite nested if-statements

* Fix added tests

26a2e365

Optimizing away the `fill-mask` pipeline. (#12113) · d4be4984

Nicolas Patry authored Jun 23, 2021



* Optimizing away the `fill-mask` pipeline.

- Don't send anything to the tokenizer unless needed. Vocab check is
much faster
- Keep BC by sending data to the tokenizer when needed. User handling warning messages will see performance benefits again
- Make `targets` and `top_k` work together better `top_k` cannot be
higher than `len(targets)` but can be smaller still.
- Actually simplify the `target_ids` in case of duplicate (it can happen
because we're parsing raw strings)
- Removed useless code to fail on empty strings. It works only if empty
string is in first position, moved to ignoring them instead.
- Changed the related tests as only the tests would fail correctly
(having incorrect value in first position)

* Make tests compatible for 2 different vocabs... (at the price of a
warning).

Co-authored-by: @EtaoinWu

* ValueError working globally

* Update src/transformers/pipelines/fill_mask.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* `tokenizer.vocab` -> `tokenizer.get_vocab()` for more compatiblity +
fallback.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d4be4984

Add CodeCarbon Integration (#12304) · 037e466b

Kevin Canwen Xu authored Jun 23, 2021

* Add optional dependency

* Add CodeCarbon integration

* Add CodeCarbon integration

* Add CodeCarbon integration

* typo

037e466b

22 Jun, 2021 10 commits

[docs] performance (#12258) · bfd5da8e

Stas Bekman authored Jun 22, 2021



* initial performance document

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* rewrites based on suggestions

* 8x multiple is for AMP only

* add contribute section
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

bfd5da8e

FlaxBartPretrainedModel -> FlaxBartPreTrainedModel (#12313) · 1562c04e
Sylvain Gugger authored Jun 22, 2021

1562c04e
[trainer] 2 bug fixes and a rename (#12309) · ebe54135
Stas Bekman authored Jun 22, 2021
```
* bug fixes and a rename

* add extended DDP test
```
ebe54135

[Flax] Main doc for event orga (#12305) · 64029abe

Patrick von Platen authored Jun 22, 2021

* fix_torch_device_generate_test

* remove @

* push

* finish

* some typos

* add more info on communication

* add suggestions

64029abe

Fix and improve documentation for LEDForConditionalGeneration (#12303) · 032d56a4

Kilian Kluge authored Jun 22, 2021

* Replace conditional generation example (fixes #12268)

* Replace model in summarization example with finetuned checkpoint, adapt example text

* Fix typo in new summarization example

* Fix docstring formatting, add missing import statement to example

032d56a4

add FlaxAutoModelForImageClassification in main init (#12298) · 1498eb98
Suraj Patil authored Jun 22, 2021

1498eb98
trainer_tf: adjust wandb installation command (#12291) · 2affeb29
Stefan Schweter authored Jun 22, 2021

2affeb29

Fix for the issue of device-id getting hardcoded for token_type_ids during Tracing [WIP] (#11252) · af6e01c5

Hamid Shojanazeri authored Jun 22, 2021



* registering a buffer for token_type_ids, to pass the error of device-id getting hardcoded when tracing

* sytle format

* adding persistent flag to the resgitered buffers that prevent from adding them to the state_dict and addresses the Backward compatibility issue

* adding the try catch to the fix as persistent flag is only available from PT >1.6

* adding version check

* added the condition to only use the token_type_ids buffer when its autogenerated not passed by user

* adding comments and making the conidtion where token_type_ids are None to use the registered buffer

* taking out position-embeddding from the if block

* adding comments

* handling the case if buffer for position_ids was not registered

* reverted the changes on position_ids, fix the issue with size of token_type_ids buffer, moved the modification for generated token_type_ids to Bertmodel, instead of Embeddings

* reverting the token_type_ids in case of None to the previous version

* reverting changes on position_ids adding back the if block

* changes added by running make fix-copies

* changes added by running make fix-copies and added the import version as it was getting used

* changes added by running make fix-copies

* changes added by running make fix-copies

* fixing the import format

* fixing the import format

* modified to use temp tensor for trimed and expanded token_type_ids buffer

* changes made by fix-copies after temp tensor modifications

* changes made by fix-copies after temp tensor modifications

* changes made by fix-copies after temp tensor modifications

* clean up

* clean up

* clean up

* clean up

* Nit

* Nit

* Nit

* modified according to support device conversion on traced models

* modified according to support device conversion on traced models

* modified according to support device conversion on traced models

* modified according to support device conversion on traced models

* changes based on latest in master

* Adapt templates

* Add version import
Co-authored-by: Ubuntu <ubuntu@ip-172-31-32-81.us-west-2.compute.internal>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

af6e01c5

[tests] multiple improvements (#12294) · 0d97ba8a
Stas Bekman authored Jun 21, 2021
```
* [tests] multiple improvements

* cleanup

* style

* todo to investigate

* fix
```
0d97ba8a

[trainer + examples] set log level from CLI (#12276) · dad414d5

Stas Bekman authored Jun 21, 2021



* set log level from CLI

* add log_level_replica + test + extended docs

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename datasets objects to allow datasets module

* improve the doc

* style

* doc improve
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

dad414d5

21 Jun, 2021 10 commits
- reset report_to to none, avoid deprecation warning (#12293) · a4ed074d
  Stas Bekman authored Jun 21, 2021
  
  a4ed074d
- [Flax] Add jax flax to env command (#12251) · 7ef309ca
  Patrick von Platen authored Jun 21, 2021
```
* fix_torch_device_generate_test

* remove @

* add commands for flax/jax
```
  7ef309ca
- Tensorflow QA example (#12252) · e3cb7a0b
  Matt authored Jun 21, 2021
```
* New Tensorflow QA example!

* Style pass

* Updating README.md for the new example

* flake8 fixes

* Update examples/tensorflow/question-answering/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e3cb7a0b
- [Flax] Fix flax test save pretrained (#12256) · 4e9a6796
  Patrick von Platen authored Jun 21, 2021
```
* fix_torch_device_generate_test

* remove @

* fix flax save pretrained test
```
  4e9a6796
- [DeepSpeed] don't ignore --adafactor (#12257) · b75b5605
  Stas Bekman authored Jun 21, 2021
  
  b75b5605
- [Flax] [WIP] allow loading head model with base model weights (#12255) · eb881674
  Suraj Patil authored Jun 21, 2021
```
* boom boom

* remove flax clip example

* allow loading head model with base model weights

* add test

* fix imports

* disable save, load test for clip

* add test_save_load_to_base
```
  eb881674
- [FlaxClip] fix test from/save pretrained test (#12284) · 8d5b7f36
  Suraj Patil authored Jun 21, 2021
```
* boom boom

* remove flax clip example

* fix from_save_pretrained
```
  8d5b7f36
- Fix for making student ProphetNet for Seq2Seq Distillation (#12130) · b53bc55b
  Vishal Burman authored Jun 21, 2021
```
* make_student.py: fix to make student ProphetNet

* reformat
```
  b53bc55b
- Better CI feedback (#12279) · b76850a8
  Lysandre Debut authored Jun 21, 2021
```
* Better run ID

* Only part of CI

* Revert "Only part of CI"

This reverts commit 29f7f248d21e0f5792e0670ba8705b31ad8967b7.
```
  b76850a8
- Fix the scheduled CI · 30a5521c
  Lysandre authored Jun 21, 2021
  
  30a5521c
18 Jun, 2021 4 commits

[t5 doc] make the example work out of the box (#12239) · 2e5dbdf2

Stas Bekman authored Jun 18, 2021



* [run_clm.py] restore caching

* style

* [t5 doc] make the example work out of the box

This PR expands the training example to include the correct model type for the example to work, e.g. with `T5Model` this example will break.

* Update docs/source/model_doc/t5.rst
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* expand the other example
Co-authored-by: Suraj Patil <surajp815@gmail.com>

2e5dbdf2

Depreciate pythonic Mish and support PyTorch 1.9 version of Mish (#12240) · f3558bbc
Xa9aX ツ authored Jun 18, 2021
```
* Moved Mish to Torch 1.9 version

* Run black formatting
```
f3558bbc
[FlaxBart] few small fixes (#12247) · 47a97683
Suraj Patil authored Jun 18, 2021
```
* boom boom

* remove flax clip example

* few small fixes
```
47a97683
[Flax] FlaxAutoModelForSeq2SeqLM (#12228) · f74655cd
Suraj Patil authored Jun 18, 2021
```
* add FlaxAutoModelForSeq2SeqLM
```
f74655cd

17 Jun, 2021 9 commits
- update desc for map in all examples (#12226) · e43e1126
  Bhavitvya Malik authored Jun 18, 2021
```
* update desc for map in all examples

* added plm

* suggestions
```
  e43e1126
- AutoTokenizer: infer the class from the tokenizer config if possible (#12208) · adb70eda
  Sylvain Gugger authored Jun 17, 2021
```
* AutoTokenizer: infer the class from the tokenizer config if possible

* Add tests

* Update src/transformers/models/auto/tokenization_auto.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  adb70eda
- Docs for v4.8.0 · 0daadc19
  Lysandre authored Jun 17, 2021
  
  0daadc19
- Release: v4.7.0 · 7a6c9fab
  Lysandre authored Jun 17, 2021
  
  7a6c9fab
- fix pt-1.9.0 `add_` deprecation (#12217) · d6ea91c9
  Stas Bekman authored Jun 17, 2021
```
* fix pt-1.9.0 add_ deprecation

* add () for clarity

* Trigger CI

* require_version(torch
```
  d6ea91c9
- Support for torch 1.9.0 (#12224) · 3a960c48
  Lysandre Debut authored Jun 17, 2021
```
* Support for torch 1.9.0

* Torch scatter for 1.9.0

* Github Actions run on 1.9.0
```
  3a960c48
- Add link to the course (#12229) · afdd9e36
  Sylvain Gugger authored Jun 17, 2021
  
  afdd9e36
- Improve detr (#12147) · 29b0aef8
  NielsRogge authored Jun 17, 2021
```
* Remove unused variables

* Improve docs

* Fix docs of segmentation masks
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
```
  29b0aef8
- Pipeline update & tests (#12207) · b56848c8
  Lysandre Debut authored Jun 17, 2021
  
  b56848c8
16 Jun, 2021 4 commits

[Docs] fixed broken link (#12205) · 700cee34

Bhadresh Savani authored Jun 16, 2021



* fixed broken link

* Update docs/source/benchmarks.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/benchmarks.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

700cee34

Use yaml to create metadata (#12185) · 255a17a0
Sylvain Gugger authored Jun 16, 2021
```
* Use yaml to create metadata

* Fix typo

* Remove pin
```
255a17a0
Enabling AutoTokenizer for HubertConfig. (#12198) · 15ef0dc5
Nicolas Patry authored Jun 16, 2021

15ef0dc5
updated DLC images and sample notebooks (#12191) · afa414d0
Philipp Schmid authored Jun 16, 2021

afa414d0