Commits · 9490d668d2f59ad2e7a4db3dc7ed2f9684af369c · chenpangpang / transformers

28 Jun, 2021 2 commits

Taha ValizadehAslani authored Jun 28, 2021

Before the code could not be used for validation only because of this line:
extension = data_args.train_file.split(".")[-1]
was assuming that extension must be extracted from the training dataset. This line would run regardless of the training or validation options of the user. This would lead to an error if the user only wants to run an evaluation only and does not want to do train (because the training file does not exist). I modified it to extract extension from the training file if the user wants to do train and extract it from the validation file if the user wants to run eval. This way the code can be used for both training and validation separately.

9490d668

[Documentation] Warn that DataCollatorForWholeWordMask is limited to... · c7faf2cc

Kilian Kluge authored Jun 28, 2021

[Documentation] Warn that DataCollatorForWholeWordMask is limited to BertTokenizer-like tokenizers (#12371)

* Notify users that DataCollatorForWholeWordMask is limited to BertTokenier-like tokenizers

* Fix code formatting

c7faf2cc

26 Jun, 2021 2 commits
- replace print with logger (#12368) · ff5cdc08
  Bhadresh Savani authored Jun 26, 2021
  
  ff5cdc08
- updated example template (#12365) · 9a754594
  Bhadresh Savani authored Jun 26, 2021
  
  9a754594
25 Jun, 2021 10 commits

[Examples] Replicates the new --log_level feature to all trainer-based pytorch (#12359) · 539ee456
Bhadresh Savani authored Jun 25, 2021
```
* added log_level

* fix comment

* fixed log_level

* Trigger CI

* Unfied logging

* simplified args for log_level
```
539ee456
[trainer] add main_process_first context manager (#12351) · 64e60980
Stas Bekman authored Jun 25, 2021
```
* main_process_first context manager

* handle multi-node, add context description

* sync desc
```
64e60980

fixed multiplechoice tokenization (#12362) · f8664258

cronoik authored Jun 25, 2021

* fixed multiplechoice tokenization

The model would have seen two sequences:
1. [CLS]prompt[SEP]prompt[SEP]
2. [CLS]choice0[SEP]choice1[SEP]
that is not correct as we want a contextualized embedding of prompt and choice

* removed outer brackets for proper sequence generation

f8664258

remove extra white space from log format (#12360) · 4a872cae
Stas Bekman authored Jun 25, 2021

4a872cae
Style · a3daabfe
Sylvain Gugger authored Jun 25, 2021

a3daabfe
Replace NotebookProgressReporter by ProgressReporter in Ray Tune run (#12357) · 238521b0
Kai Fricke authored Jun 25, 2021
```
* Replace NotebookProgressReporter by ProgressReporter in Ray Tune run

* Move to local import
```
238521b0

Add FlaxBigBird QuestionAnswering script (#12233) · 332a2458

Vasudev Gupta authored Jun 25, 2021

* port bigbird script

* adapt script a bit

* change location

* adapt more

* save progress

* init commit

* style

* dataset script tested

* readme add

332a2458

Fix exception in prediction loop occurring for certain batch sizes (#12350) · 55bb4c06

jglaser authored Jun 25, 2021



* fix distributed_concat for scalar outputs

* Update README.md

* fixed typo (#12356)

* simplify fix with terser syntax
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Trigger CI
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: michal pitr <21157924+MichalPitr@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

55bb4c06

fixed typo (#12356) · d4ce31e8
michal pitr authored Jun 25, 2021

d4ce31e8
Update README.md · aa550c4a
Patrick von Platen authored Jun 25, 2021

aa550c4a

24 Jun, 2021 5 commits
- Add flax/jax quickstart (#12342) · f2c4ce7e
  Marc van Zee authored Jun 24, 2021
  
  f2c4ce7e
- Document patch release v4.8.1 · 5b1b5635
  Sylvain Gugger authored Jun 24, 2021
  
  5b1b5635
- Fix torchscript tests (#12336) · 8ef62ec9
  Lysandre Debut authored Jun 24, 2021
```
* Fix torchscript tests

* Better test

* Remove bogus print
```
  8ef62ec9
- [examples/Flax] move the examples table up (#12341) · aef3823e
  Suraj Patil authored Jun 24, 2021
  
  aef3823e
- try-this (#12338) · 7875b638
  Richard Liaw authored Jun 24, 2021
```
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
```
  7875b638
23 Jun, 2021 21 commits

Fix default to logging_dir lost in merge conflict · cf3c9198
Sylvain Gugger authored Jun 23, 2021

cf3c9198

[Deepspeed] new docs (#12077) · 07ae6103

Stas Bekman authored Jun 23, 2021



* document sub_group_size

* style

* install + issues reporting

* style

* style

* Update docs/source/main_classes/deepspeed.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* indent 4

* restore

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

07ae6103

Update training_args.py (#12328) · 3694484d

Sam Havens authored Jun 23, 2021

mention in `save_strategy` param description that `load_best_model_at_end` can override

3694484d

v4.9.0.dev0 · 2150dfed
Sylvain Gugger authored Jun 23, 2021

2150dfed
Release: v4.8.0 · 9252a512
Sylvain Gugger authored Jun 23, 2021

9252a512
[Flax T5] Fix weight initialization and fix docs (#12327) · 468cda20
Patrick von Platen authored Jun 23, 2021
```
* finish t5 flax fixes

* improve naming
```
468cda20
Pin good version of huggingface_hub · 12a4457c
Sylvain Gugger authored Jun 23, 2021

12a4457c
changed modeling_fx_utils.py to utils/fx.py for clarity (#12326) · 986ac03e
Michael Benayoun authored Jun 23, 2021
```
Co-authored-by: Michael Benayoun <michael@huggingface.co>
```
986ac03e
Temporarily revert the `fill-mask` improvements. · 941b4442
Lysandre authored Jun 23, 2021

941b4442
Conda build (#12323) · 4bdff2cd
Lysandre Debut authored Jun 23, 2021

4bdff2cd
Add all XxxPreTrainedModel to the main init (#12314) · 9eda6b52
Sylvain Gugger authored Jun 23, 2021
```
* Add all XxxPreTrainedModel to the main init

* Add to template

* Add to template bis

* Add FlaxT5
```
9eda6b52

Clean push to hub API (#12187) · 53c60bab

Sylvain Gugger authored Jun 23, 2021



* Clean push to hub API

* Create working dir if it does not exist

* Different tweak

* New API + all models + test Flax

* Adds the Trainer clean up

* Update src/transformers/file_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* (nit) output types

* No need to set clone_from when folder exists

* Update src/transformers/trainer.py
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Add generated_from_trainer tag

* Update to new version

* Fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

53c60bab

[TFWav2Vec2] Fix docs (#12283) · 625f512d

chenht2010 authored Jun 23, 2021



* fix error

* make style check happy
Co-authored-by: chenhaitao <chenhaitao@qiyi.com>

625f512d

[Flax/JAX] Add how to propose projects markdown (#12311) · 44739c81
Patrick von Platen authored Jun 23, 2021
```
* fix_torch_device_generate_test

* remove @

* finish

* make style
```
44739c81
Add mention of the huggingface_hub methods for offline mode (#12320) · ef3dceff
Lysandre Debut authored Jun 23, 2021

ef3dceff

Flax T5 (#12150) · e98233dd

Vasudev Gupta authored Jun 23, 2021



* copy pytorch-t5

* init

* boom boom

* forward pass same

* make generation work

* add more tests

* make test work

* finish normal tests

* make fix-copies

* finish quality

* correct slow example

* correct slow test

* version table

* upload models

* Update tests/test_modeling_flax_t5.py

* correct incorrectly deleted line
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

e98233dd

Rewrite ProphetNet to adapt converting ONNX friendly (#11981) · 7d4cfa3b
David Fan authored Jun 23, 2021
```
* Rewrite

* [ONNX] rewrite
```
7d4cfa3b

Flax summarization script (#12230) · c0fe3c9a

Suraj Patil authored Jun 23, 2021

* add summrization script

* fix arguments, preprocessing, metrics

* add generation and metrics

* auto model, prediction loop

* prettify

* label smoothing

* adress Sylvain and Patricks suggestions

* dynamically import shift_tokens_right

* fix shift_tokens_right_fn call

c0fe3c9a

Add output in a dictionary for TF `generate` method (#12139) · 26a2e365

Daniel Stancl authored Jun 23, 2021

* Add output args to greedy search

* Fix critical typo + make style quality

* Handle generate_beam_search

* Add dict_specific tests and fix the placement of encoder outputs

* Add  specific outputs

* Update doc

* Fix typo

* Adjust handling encoder_outputs + Fix generating for T5

* Fix generate for RAG

* Fix handling ouptut_attentions when target_mapping is not None

Take care of situations when target_mapping is provided
as there are 2-tuple of attentions

Change from:
if inputs["output_attentions"]:
    attentions = tuple(tf.transpose(t, perm(2, 3, 0, 1)) for t in attentions)

to:
if inputs["output_attentions"]:
    if inputs["target_mapping"] is not None:
        # when target_mapping is provided, there are 2-tuple of attentions
         attentions = tuple(
             tuple(tf.transpose(attn_stream, perm=(2, 3, 0, 1)) for attn_stream in t) for t in attentions
        )
    else:
        attentions = tuple(tf.transpose(t, perm=(2, 3, 0, 1)) for t in attentions)

* Rename kwargs to model_kwargs

* make style quality

* Move imports in test_modeling_tf_common.py

Move ModelOutput-related imports in test_modeling_tf_common.py
into the `is_tf_available():` statement.

* Rewrite nested if-statements

* Fix added tests

26a2e365

Optimizing away the `fill-mask` pipeline. (#12113) · d4be4984

Nicolas Patry authored Jun 23, 2021



* Optimizing away the `fill-mask` pipeline.

- Don't send anything to the tokenizer unless needed. Vocab check is
much faster
- Keep BC by sending data to the tokenizer when needed. User handling warning messages will see performance benefits again
- Make `targets` and `top_k` work together better `top_k` cannot be
higher than `len(targets)` but can be smaller still.
- Actually simplify the `target_ids` in case of duplicate (it can happen
because we're parsing raw strings)
- Removed useless code to fail on empty strings. It works only if empty
string is in first position, moved to ignoring them instead.
- Changed the related tests as only the tests would fail correctly
(having incorrect value in first position)

* Make tests compatible for 2 different vocabs... (at the price of a
warning).

Co-authored-by: @EtaoinWu

* ValueError working globally

* Update src/transformers/pipelines/fill_mask.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* `tokenizer.vocab` -> `tokenizer.get_vocab()` for more compatiblity +
fallback.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d4be4984

Add CodeCarbon Integration (#12304) · 037e466b

Kevin Canwen Xu authored Jun 23, 2021

* Add optional dependency

* Add CodeCarbon integration

* Add CodeCarbon integration

* Add CodeCarbon integration

* typo

037e466b