Commits · ff5cdc086be1e0c3e2bbad8e3469b34cffb55a85 · chenpangpang / transformers

26 Jun, 2021 2 commits
- replace print with logger (#12368) · ff5cdc08
  Bhadresh Savani authored Jun 26, 2021
  
  ff5cdc08
- updated example template (#12365) · 9a754594
  Bhadresh Savani authored Jun 26, 2021
  
  9a754594
25 Jun, 2021 10 commits

[Examples] Replicates the new --log_level feature to all trainer-based pytorch (#12359) · 539ee456
Bhadresh Savani authored Jun 25, 2021
```
* added log_level

* fix comment

* fixed log_level

* Trigger CI

* Unfied logging

* simplified args for log_level
```
539ee456
[trainer] add main_process_first context manager (#12351) · 64e60980
Stas Bekman authored Jun 25, 2021
```
* main_process_first context manager

* handle multi-node, add context description

* sync desc
```
64e60980

fixed multiplechoice tokenization (#12362) · f8664258

cronoik authored Jun 25, 2021

* fixed multiplechoice tokenization

The model would have seen two sequences:
1. [CLS]prompt[SEP]prompt[SEP]
2. [CLS]choice0[SEP]choice1[SEP]
that is not correct as we want a contextualized embedding of prompt and choice

* removed outer brackets for proper sequence generation

f8664258

remove extra white space from log format (#12360) · 4a872cae
Stas Bekman authored Jun 25, 2021

4a872cae
Style · a3daabfe
Sylvain Gugger authored Jun 25, 2021

a3daabfe
Replace NotebookProgressReporter by ProgressReporter in Ray Tune run (#12357) · 238521b0
Kai Fricke authored Jun 25, 2021
```
* Replace NotebookProgressReporter by ProgressReporter in Ray Tune run

* Move to local import
```
238521b0

Add FlaxBigBird QuestionAnswering script (#12233) · 332a2458

Vasudev Gupta authored Jun 25, 2021

* port bigbird script

* adapt script a bit

* change location

* adapt more

* save progress

* init commit

* style

* dataset script tested

* readme add

332a2458

Fix exception in prediction loop occurring for certain batch sizes (#12350) · 55bb4c06

jglaser authored Jun 25, 2021



* fix distributed_concat for scalar outputs

* Update README.md

* fixed typo (#12356)

* simplify fix with terser syntax
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Trigger CI
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: michal pitr <21157924+MichalPitr@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

55bb4c06

fixed typo (#12356) · d4ce31e8
michal pitr authored Jun 25, 2021

d4ce31e8
Update README.md · aa550c4a
Patrick von Platen authored Jun 25, 2021

aa550c4a

24 Jun, 2021 5 commits
- Add flax/jax quickstart (#12342) · f2c4ce7e
  Marc van Zee authored Jun 24, 2021
  
  f2c4ce7e
- Document patch release v4.8.1 · 5b1b5635
  Sylvain Gugger authored Jun 24, 2021
  
  5b1b5635
- Fix torchscript tests (#12336) · 8ef62ec9
  Lysandre Debut authored Jun 24, 2021
```
* Fix torchscript tests

* Better test

* Remove bogus print
```
  8ef62ec9
- [examples/Flax] move the examples table up (#12341) · aef3823e
  Suraj Patil authored Jun 24, 2021
  
  aef3823e
- try-this (#12338) · 7875b638
  Richard Liaw authored Jun 24, 2021
```
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
```
  7875b638
23 Jun, 2021 21 commits

Fix default to logging_dir lost in merge conflict · cf3c9198
Sylvain Gugger authored Jun 23, 2021

cf3c9198

[Deepspeed] new docs (#12077) · 07ae6103

Stas Bekman authored Jun 23, 2021



* document sub_group_size

* style

* install + issues reporting

* style

* style

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* indent 4

* restore

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

07ae6103

Update training_args.py (#12328) · 3694484d

Sam Havens authored Jun 23, 2021

mention in `save_strategy` param description that `load_best_model_at_end` can override

3694484d

v4.9.0.dev0 · 2150dfed
Sylvain Gugger authored Jun 23, 2021

2150dfed
Release: v4.8.0 · 9252a512
Sylvain Gugger authored Jun 23, 2021

9252a512
[Flax T5] Fix weight initialization and fix docs (#12327) · 468cda20
Patrick von Platen authored Jun 23, 2021
```
* finish t5 flax fixes

* improve naming
```
468cda20
Pin good version of huggingface_hub · 12a4457c
Sylvain Gugger authored Jun 23, 2021

12a4457c
changed modeling_fx_utils.py to utils/fx.py for clarity (#12326) · 986ac03e
Michael Benayoun authored Jun 23, 2021
```
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
986ac03e
Temporarily revert the `fill-mask` improvements. · 941b4442
Lysandre authored Jun 23, 2021

941b4442
Conda build (#12323) · 4bdff2cd
Lysandre Debut authored Jun 23, 2021

4bdff2cd
Add all XxxPreTrainedModel to the main init (#12314) · 9eda6b52
Sylvain Gugger authored Jun 23, 2021
```
* Add all XxxPreTrainedModel to the main init

* Add to template

* Add to template bis

* Add FlaxT5
```
9eda6b52

Clean push to hub API (#12187) · 53c60bab

Sylvain Gugger authored Jun 23, 2021



* Clean push to hub API

* Create working dir if it does not exist

* Different tweak

* New API + all models + test Flax

* Adds the Trainer clean up

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* (nit) output types

* No need to set clone_from when folder exists

* Update src/transformers/trainer.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Add generated_from_trainer tag

* Update to new version

* Fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

53c60bab

[TFWav2Vec2] Fix docs (#12283) · 625f512d

chenht2010 authored Jun 23, 2021



* fix error

* make style check happy
Co-authored-by: chenhaitao <chenhaitao@qiyi.com>

625f512d

[Flax/JAX] Add how to propose projects markdown (#12311) · 44739c81
Patrick von Platen authored Jun 23, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* make style
```
44739c81
Add mention of the huggingface_hub methods for offline mode (#12320) · ef3dceff
Lysandre Debut authored Jun 23, 2021

ef3dceff

Flax T5 (#12150) · e98233dd

Vasudev Gupta authored Jun 23, 2021



* copy pytorch-t5

* init

* boom boom

* forward pass same

* make generation work

* add more tests

* make test work

* finish normal tests

* make fix-copies

* finish quality

* correct slow example

* correct slow test

* version table

* upload models

* Update tests/test_modeling_flax_t5.py

* correct incorrectly deleted line
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

e98233dd

Rewrite ProphetNet to adapt converting ONNX friendly (#11981) · 7d4cfa3b
David Fan authored Jun 23, 2021
```
* Rewrite

* [ONNX] rewrite
```
7d4cfa3b

Flax summarization script (#12230) · c0fe3c9a

Suraj Patil authored Jun 23, 2021

* add summrization script

* fix arguments, preprocessing, metrics

* add generation and metrics

* auto model, prediction loop

* prettify

* label smoothing

* adress Sylvain and Patricks suggestions

* dynamically import shift_tokens_right

* fix shift_tokens_right_fn call

c0fe3c9a

Add output in a dictionary for TF `generate` method (#12139) · 26a2e365

Daniel Stancl authored Jun 23, 2021

* Add output args to greedy search

* Fix critical typo + make style quality

* Handle generate_beam_search

* Add dict_specific tests and fix the placement of encoder outputs

* Add  specific outputs

* Update doc

* Fix typo

* Adjust handling encoder_outputs + Fix generating for T5

* Fix generate for RAG

* Fix handling ouptut_attentions when target_mapping is not None

Take care of situations when target_mapping is provided
as there are 2-tuple of attentions

Change from:
if inputs["output_attentions"]:
    attentions = tuple(tf.transpose(t, perm(2, 3, 0, 1)) for t in attentions)

to:
if inputs["output_attentions"]:
    if inputs["target_mapping"] is not None:
        # when target_mapping is provided, there are 2-tuple of attentions
         attentions = tuple(
             tuple(tf.transpose(attn_stream, perm=(2, 3, 0, 1)) for attn_stream in t) for t in attentions
        )
    else:
        attentions = tuple(tf.transpose(t, perm=(2, 3, 0, 1)) for t in attentions)

* Rename kwargs to model_kwargs

* make style quality

* Move imports in test_modeling_tf_common.py

Move ModelOutput-related imports in test_modeling_tf_common.py
into the `is_tf_available():` statement.

* Rewrite nested if-statements

* Fix added tests

26a2e365

Optimizing away the `fill-mask` pipeline. (#12113) · d4be4984

Nicolas Patry authored Jun 23, 2021



* Optimizing away the `fill-mask` pipeline.

- Don't send anything to the tokenizer unless needed. Vocab check is
much faster
- Keep BC by sending data to the tokenizer when needed. User handling warning messages will see performance benefits again
- Make `targets` and `top_k` work together better `top_k` cannot be
higher than `len(targets)` but can be smaller still.
- Actually simplify the `target_ids` in case of duplicate (it can happen
because we're parsing raw strings)
- Removed useless code to fail on empty strings. It works only if empty
string is in first position, moved to ignoring them instead.
- Changed the related tests as only the tests would fail correctly
(having incorrect value in first position)

* Make tests compatible for 2 different vocabs... (at the price of a
warning).

Co-authored-by: @EtaoinWu

* ValueError working globally

* Update src/transformers/pipelines/fill_mask.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* `tokenizer.vocab` -> `tokenizer.get_vocab()` for more compatiblity +
fallback.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d4be4984

Add CodeCarbon Integration (#12304) · 037e466b

Kevin Canwen Xu authored Jun 23, 2021

* Add optional dependency

* Add CodeCarbon integration

* Add CodeCarbon integration

* Add CodeCarbon integration

* typo

037e466b

22 Jun, 2021 2 commits

[docs] performance (#12258) · bfd5da8e

Stas Bekman authored Jun 22, 2021



* initial performance document

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* rewrites based on suggestions

* 8x multiple is for AMP only

* add contribute section
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

bfd5da8e

FlaxBartPretrainedModel -> FlaxBartPreTrainedModel (#12313) · 1562c04e
Sylvain Gugger authored Jun 22, 2021

1562c04e