Commits · d4ce31e839d5e5eedcae62d355467df6873c8841 · chenpangpang / transformers

25 Jun, 2021 2 commits
- fixed typo (#12356) · d4ce31e8
  michal pitr authored Jun 25, 2021
  
  d4ce31e8
- Update README.md · aa550c4a
  Patrick von Platen authored Jun 25, 2021
  
  aa550c4a
24 Jun, 2021 5 commits
- Add flax/jax quickstart (#12342) · f2c4ce7e
  Marc van Zee authored Jun 24, 2021
  
  f2c4ce7e
- Document patch release v4.8.1 · 5b1b5635
  Sylvain Gugger authored Jun 24, 2021
  
  5b1b5635
- Fix torchscript tests (#12336) · 8ef62ec9
  Lysandre Debut authored Jun 24, 2021
```
* Fix torchscript tests

* Better test

* Remove bogus print
```
  8ef62ec9
- [examples/Flax] move the examples table up (#12341) · aef3823e
  Suraj Patil authored Jun 24, 2021
  
  aef3823e
- try-this (#12338) · 7875b638
  Richard Liaw authored Jun 24, 2021
```
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
```
  7875b638
23 Jun, 2021 21 commits

Fix default to logging_dir lost in merge conflict · cf3c9198
Sylvain Gugger authored Jun 23, 2021

cf3c9198

[Deepspeed] new docs (#12077) · 07ae6103

Stas Bekman authored Jun 23, 2021



* document sub_group_size

* style

* install + issues reporting

* style

* style

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* indent 4

* restore

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

07ae6103

Update training_args.py (#12328) · 3694484d

Sam Havens authored Jun 23, 2021

mention in `save_strategy` param description that `load_best_model_at_end` can override

3694484d

v4.9.0.dev0 · 2150dfed
Sylvain Gugger authored Jun 23, 2021

2150dfed
Release: v4.8.0 · 9252a512
Sylvain Gugger authored Jun 23, 2021

9252a512
[Flax T5] Fix weight initialization and fix docs (#12327) · 468cda20
Patrick von Platen authored Jun 23, 2021
```
* finish t5 flax fixes

* improve naming
```
468cda20
Pin good version of huggingface_hub · 12a4457c
Sylvain Gugger authored Jun 23, 2021

12a4457c
changed modeling_fx_utils.py to utils/fx.py for clarity (#12326) · 986ac03e
Michael Benayoun authored Jun 23, 2021
```
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
986ac03e
Temporarily revert the `fill-mask` improvements. · 941b4442
Lysandre authored Jun 23, 2021

941b4442
Conda build (#12323) · 4bdff2cd
Lysandre Debut authored Jun 23, 2021

4bdff2cd
Add all XxxPreTrainedModel to the main init (#12314) · 9eda6b52
Sylvain Gugger authored Jun 23, 2021
```
* Add all XxxPreTrainedModel to the main init

* Add to template

* Add to template bis

* Add FlaxT5
```
9eda6b52

Clean push to hub API (#12187) · 53c60bab

Sylvain Gugger authored Jun 23, 2021



* Clean push to hub API

* Create working dir if it does not exist

* Different tweak

* New API + all models + test Flax

* Adds the Trainer clean up

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* (nit) output types

* No need to set clone_from when folder exists

* Update src/transformers/trainer.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Add generated_from_trainer tag

* Update to new version

* Fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

53c60bab

[TFWav2Vec2] Fix docs (#12283) · 625f512d

chenht2010 authored Jun 23, 2021



* fix error

* make style check happy
Co-authored-by: chenhaitao <chenhaitao@qiyi.com>

625f512d

[Flax/JAX] Add how to propose projects markdown (#12311) · 44739c81
Patrick von Platen authored Jun 23, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* make style
```
44739c81
Add mention of the huggingface_hub methods for offline mode (#12320) · ef3dceff
Lysandre Debut authored Jun 23, 2021

ef3dceff

Flax T5 (#12150) · e98233dd

Vasudev Gupta authored Jun 23, 2021



* copy pytorch-t5

* init

* boom boom

* forward pass same

* make generation work

* add more tests

* make test work

* finish normal tests

* make fix-copies

* finish quality

* correct slow example

* correct slow test

* version table

* upload models

* Update tests/test_modeling_flax_t5.py

* correct incorrectly deleted line
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

e98233dd

Rewrite ProphetNet to adapt converting ONNX friendly (#11981) · 7d4cfa3b
David Fan authored Jun 23, 2021
```
* Rewrite

* [ONNX] rewrite
```
7d4cfa3b

Flax summarization script (#12230) · c0fe3c9a

Suraj Patil authored Jun 23, 2021

* add summrization script

* fix arguments, preprocessing, metrics

* add generation and metrics

* auto model, prediction loop

* prettify

* label smoothing

* adress Sylvain and Patricks suggestions

* dynamically import shift_tokens_right

* fix shift_tokens_right_fn call

c0fe3c9a

Add output in a dictionary for TF `generate` method (#12139) · 26a2e365

Daniel Stancl authored Jun 23, 2021

* Add output args to greedy search

* Fix critical typo + make style quality

* Handle generate_beam_search

* Add dict_specific tests and fix the placement of encoder outputs

* Add  specific outputs

* Update doc

* Fix typo

* Adjust handling encoder_outputs + Fix generating for T5

* Fix generate for RAG

* Fix handling ouptut_attentions when target_mapping is not None

Take care of situations when target_mapping is provided
as there are 2-tuple of attentions

Change from:
if inputs["output_attentions"]:
    attentions = tuple(tf.transpose(t, perm(2, 3, 0, 1)) for t in attentions)

to:
if inputs["output_attentions"]:
    if inputs["target_mapping"] is not None:
        # when target_mapping is provided, there are 2-tuple of attentions
         attentions = tuple(
             tuple(tf.transpose(attn_stream, perm=(2, 3, 0, 1)) for attn_stream in t) for t in attentions
        )
    else:
        attentions = tuple(tf.transpose(t, perm=(2, 3, 0, 1)) for t in attentions)

* Rename kwargs to model_kwargs

* make style quality

* Move imports in test_modeling_tf_common.py

Move ModelOutput-related imports in test_modeling_tf_common.py
into the `is_tf_available():` statement.

* Rewrite nested if-statements

* Fix added tests

26a2e365

Optimizing away the `fill-mask` pipeline. (#12113) · d4be4984

Nicolas Patry authored Jun 23, 2021



* Optimizing away the `fill-mask` pipeline.

- Don't send anything to the tokenizer unless needed. Vocab check is
much faster
- Keep BC by sending data to the tokenizer when needed. User handling warning messages will see performance benefits again
- Make `targets` and `top_k` work together better `top_k` cannot be
higher than `len(targets)` but can be smaller still.
- Actually simplify the `target_ids` in case of duplicate (it can happen
because we're parsing raw strings)
- Removed useless code to fail on empty strings. It works only if empty
string is in first position, moved to ignoring them instead.
- Changed the related tests as only the tests would fail correctly
(having incorrect value in first position)

* Make tests compatible for 2 different vocabs... (at the price of a
warning).

Co-authored-by: @EtaoinWu

* ValueError working globally

* Update src/transformers/pipelines/fill_mask.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* `tokenizer.vocab` -> `tokenizer.get_vocab()` for more compatiblity +
fallback.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d4be4984

Add CodeCarbon Integration (#12304) · 037e466b

Kevin Canwen Xu authored Jun 23, 2021

* Add optional dependency

* Add CodeCarbon integration

* Add CodeCarbon integration

* Add CodeCarbon integration

* typo

037e466b

22 Jun, 2021 10 commits

[docs] performance (#12258) · bfd5da8e

Stas Bekman authored Jun 22, 2021



* initial performance document

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* rewrites based on suggestions

* 8x multiple is for AMP only

* add contribute section
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

bfd5da8e

FlaxBartPretrainedModel -> FlaxBartPreTrainedModel (#12313) · 1562c04e
Sylvain Gugger authored Jun 22, 2021

1562c04e
[trainer] 2 bug fixes and a rename (#12309) · ebe54135
Stas Bekman authored Jun 22, 2021
```
* bug fixes and a rename

* add extended DDP test
```
ebe54135

[Flax] Main doc for event orga (#12305) · 64029abe

Patrick von Platen authored Jun 22, 2021

* fix_torch_device_generate_test

* remove @

* push

* finish

* some typos

* add more info on communication

* add suggestions

64029abe

Fix and improve documentation for LEDForConditionalGeneration (#12303) · 032d56a4

Kilian Kluge authored Jun 22, 2021

* Replace conditional generation example (fixes #12268)

* Replace model in summarization example with finetuned checkpoint, adapt example text

* Fix typo in new summarization example

* Fix docstring formatting, add missing import statement to example

032d56a4

add FlaxAutoModelForImageClassification in main init (#12298) · 1498eb98
Suraj Patil authored Jun 22, 2021

1498eb98
trainer_tf: adjust wandb installation command (#12291) · 2affeb29
Stefan Schweter authored Jun 22, 2021

2affeb29

Fix for the issue of device-id getting hardcoded for token_type_ids during Tracing [WIP] (#11252) · af6e01c5

Hamid Shojanazeri authored Jun 22, 2021



* registering a buffer for token_type_ids, to pass the error of device-id getting hardcoded when tracing

* sytle format

* adding persistent flag to the resgitered buffers that prevent from adding them to the state_dict and addresses the Backward compatibility issue

* adding the try catch to the fix as persistent flag is only available from PT >1.6

* adding version check

* added the condition to only use the token_type_ids buffer when its autogenerated not passed by user

* adding comments and making the conidtion where token_type_ids are None to use the registered buffer

* taking out position-embeddding from the if block

* adding comments

* handling the case if buffer for position_ids was not registered

* reverted the changes on position_ids, fix the issue with size of token_type_ids buffer, moved the modification for generated token_type_ids to Bertmodel, instead of Embeddings

* reverting the token_type_ids in case of None to the previous version

* reverting changes on position_ids adding back the if block

* changes added by running make fix-copies

* changes added by running make fix-copies and added the import version as it was getting used

* changes added by running make fix-copies

* changes added by running make fix-copies

* fixing the import format

* fixing the import format

* modified to use temp tensor for trimed and expanded token_type_ids buffer

* changes made by fix-copies after temp tensor modifications

* changes made by fix-copies after temp tensor modifications

* changes made by fix-copies after temp tensor modifications

* clean up

* clean up

* clean up

* clean up

* Nit

* Nit

* Nit

* modified according to support device conversion on traced models

* modified according to support device conversion on traced models

* modified according to support device conversion on traced models

* modified according to support device conversion on traced models

* changes based on latest in master

* Adapt templates

* Add version import
Co-authored-by: Ubuntu <ubuntu@ip-172-31-32-81.us-west-2.compute.internal>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

af6e01c5

[tests] multiple improvements (#12294) · 0d97ba8a
Stas Bekman authored Jun 21, 2021
```
* [tests] multiple improvements

* cleanup

* style

* todo to investigate

* fix
```
0d97ba8a

[trainer + examples] set log level from CLI (#12276) · dad414d5

Stas Bekman authored Jun 21, 2021



* set log level from CLI

* add log_level_replica + test + extended docs

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename datasets objects to allow datasets module

* improve the doc

* style

* doc improve
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

dad414d5

21 Jun, 2021 2 commits
- reset report_to to none, avoid deprecation warning (#12293) · a4ed074d
  Stas Bekman authored Jun 21, 2021
  
  a4ed074d
- [Flax] Add jax flax to env command (#12251) · 7ef309ca
  Patrick von Platen authored Jun 21, 2021
```
* fix_torch_device_generate_test

* remove @

* add commands for flax/jax
```
  7ef309ca