Commits · f33a6f34461fea61b579a7ec732fcd174b2b41cd · chenpangpang / transformers

07 Jan, 2021 8 commits

[TFGPT2] - Fix flaky past_key_values test (#9460) · f33a6f34
Patrick von Platen authored Jan 07, 2021
```
* fix tf flakey

* remove test files
```
f33a6f34

Transformers fast import part 2 (#9446) · 758ed333

Sylvain Gugger authored Jan 07, 2021



* Main init work

* Add version

* Change from absolute to relative imports

* Fix imports

* One more typo

* More typos

* Styling

* Make quality script pass

* Add necessary replace in template

* Fix typos

* Spaces are ignored in replace for some reason

* Forgot one models.

* Fixes for import
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

* Add documentation

* Styling
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

758ed333

[LED Test] fix common inputs pt for flaky pt-tf led test (#9459) · a400fe89
Patrick von Platen authored Jan 07, 2021
```
* fix common inputs pt flakey led

* fix other tests correspondingly
```
a400fe89
up (#9454) · ae5a32bb
Patrick von Platen authored Jan 07, 2021

ae5a32bb

New serving (#9419) · 812045ad

Julien Plu authored Jan 07, 2021

* Add a serving method

* Add albert

* Add serving for BERT and BART

* Add more models

* Finish the serving addition

* Temp fix

* Restore DPR

* Fix funnel attribute

* Fix attributes GPT2

* Fix OpenAIGPT attribute

* Fix T5 attributes

* Fix Bart attributes

* Fix TransfoXL attributes

* Add versioning

* better test

* Update template

* Fix Flaubert

* Fix T5

* Apply style

* Remove unused imports

* Deactivate extra parameters

* Remove too long test + saved_model default to False

* Ignore the saved model test for some models

* Fix some inputs

* Fix mpnet serving

* Trigger CI

* Address all comments

812045ad

Prophetnet optimization (#9453) · 390cf16b

guillaume-be authored Jan 07, 2021

* Vectorized `ngram_attention_bias` calculation

* updated formatting with black

* Further optimization

* one (last) optimization

390cf16b

a more reliable version of branching point discovery (#9449) · 28d74872
Stas Bekman authored Jan 07, 2021

28d74872
Remove nested lxmert (#9440) · 3ec40299
Sylvain Gugger authored Jan 07, 2021

3ec40299

06 Jan, 2021 15 commits

[GenerationOutputs] Fix GenerationOutputs Tests (#9443) · b8462b5b

Patrick von Platen authored Jan 06, 2021

* fix generation models

* fix led

* fix docs

* add is_decoder

* fix last docstrings

* make style

* fix t5 cross attentions

* correct t5

b8462b5b

Fast transformers import part 1 (#9441) · 0c96262f

Sylvain Gugger authored Jan 06, 2021

* Don't import libs to check they are available

* Don't import integrations at init

* Add importlib_metdata to deps

* Remove old vars references

* Avoid syntax error

* Adapt testing utils

* Try to appease torchhub

* Add dependency

* Remove more private variables

* Fix typo

* Another typo

* Refine the tf availability test

0c96262f

Add flags to return scores, hidden states and / or attention weights in GenerationMixin (#9150) · c89f1bc9

Simon Brandeis authored Jan 06, 2021



* Define new output dataclasses for greedy generation

* Add output_[...] flags in greedy generation methods

Added output_attentions, output_hidden_states, output_scores flags in
generate and greedy_search methods in GenerationMixin.

* [WIP] Implement logic and tests for output flags in generation

* Update GreedySearchOutput classes & docstring

* Implement greedy search output accumulation logic

Update greedy_search unittests

Fix generate method return value docstring

Properly init flags with the default config

* Update configuration to add output_scores flag

* Fix test_generation_utils

Sort imports and fix isinstance tests for GreedySearchOutputs

* Fix typo in generation_utils

* Add return_dict_in_generate for backwards compatibility

* Add return_dict_in_generate flag in config

* Fix tyPo in configuration

* Fix handling of attentions and hidden_states flags

* Make style & quality

* first attempt attentions

* some corrections

* improve tests

* special models requires special test

* disable xlm test for now

* clean tests

* fix for tf

* isort

* Add output dataclasses for other generation methods

* Add logic to return dict in sample generation

* Complete test for sample generation

- Pass output_attentions and output_hidden_states flags to encoder in
encoder-decoder models
- Fix import satements order in test_generation_utils file

* Add logic to return dict in sample generation

- Refactor tests to avoid using self.assertTrue, which provides
scarce information when the test fails
- Add tests for the three beam_search methods: vanilla, sample and
grouped

* Style doc

* Fix copy-paste error in generation tests

* Rename logits to scores and refactor

* Refactor group_beam_search for consistency

* make style

* add sequences_scores

* fix all tests

* add docs

* fix beam search finalize test

* correct docstring

* clean some files

* Made suggested changes to the documentation

* Style doc ?

* Style doc using the Python util

* Update src/transformers/generation_utils.py

* fix empty lines

* fix all test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

c89f1bc9

Store transformers version info when saving the model (#9421) · 7a9f1b5c

Kevin Canwen Xu authored Jan 06, 2021



* Store transformers version info when saving the model

* Store transformers version info when saving the model

* fix format

* fix format

* fix format

* Update src/transformers/configuration_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update configuration_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

7a9f1b5c

Improve documentation coverage for Phobert (#9427) · ecfcac22

Qbiwan authored Jan 06, 2021



* first commit

* change phobert to phoBERT as per author in overview

* v3 and v4 both runs on same code hence there is no need to differentiate them
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ecfcac22

Improve documentation coverage for Herbert (#9428) · be898998
Qbiwan authored Jan 06, 2021
```
* first commit

* changed XLMTokenizer to HerbertTokenizer in code example
```
be898998
finalize (#9431) · b972c1bf
Patrick von Platen authored Jan 06, 2021

b972c1bf

Upgrade styler to better handle lists (#9423) · bcb55d33

Sylvain Gugger authored Jan 06, 2021

* Add missing lines before a new list.

* Update doc styler and restyle some files.

* Fix docstrings of LED and Longformer

bcb55d33

Fix URLs to TAPAS notebooks (#9435) · b7e54897
NielsRogge authored Jan 06, 2021

b7e54897

[trainer] self.model_wrapped + _model_unwrap (#9390) · 9f675b05

Stas Bekman authored Jan 06, 2021



* model wrapped + model_unwrap

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* deprecation warning

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9f675b05

Allow example to use a revision and work with private models (#9407) · 453a70d4
Sylvain Gugger authored Jan 06, 2021
```
* Allow example to use a revision and work with private models

* Copy to other examples and template

* Styling
```
453a70d4
Fix link to Notebook to fine-tune TAPAS (#9413) · 7988edc0
Manuel Romero authored Jan 06, 2021
```
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
```
7988edc0
Fix link to Evaluate TAPAS Notebook (#9414) · c9553c03
Manuel Romero authored Jan 06, 2021

c9553c03

[Refactor] Splitting pipelines.py into its own module. (#9279) · 090d28e3

Nicolas Patry authored Jan 06, 2021

* Splitting pipelines into its own module.

* Moving everything into base.py

* Moving FeatureExtractionPipeline into its own file.

* TextGenerationPipeline.

* TextClassifictionPipeline

* ZeroShot + get_framework import.

* FillMaskPipeline

* NerPipeline + TokenClassificationPipeline

* QuestionAnsweringPipeline

* TableQuestionAnsweringPipeline

* ConversationnalPipeline

* Text2TextGenerationPipeline, TranslationPipeline, SummarizationPipeline

* Typo import fix.

* Relative imports.

090d28e3

[docs] outline sharded ddp doc (#9208) · d64372fd

Stas Bekman authored Jan 05, 2021



* outline sharded dpp doc

* fix link

* add example

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* narrow the command and remove non-essentials
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d64372fd

05 Jan, 2021 13 commits

[PyTorch Bart] Split Bart into different models (#9343) · eef66035

Patrick von Platen authored Jan 05, 2021

* first try

* remove old template

* finish bart

* finish mbart

* delete unnecessary line

* init pegasus

* save intermediate

* correct pegasus

* finish pegasus

* remove cookie cutter leftover

* add marian

* finish blenderbot

* replace in file

* correctly split blenderbot

* delete "old" folder

* correct "add statement"

* adapt config for tf comp

* correct configs for tf

* remove ipdb

* fix more stuff

* fix mbart

* push pegasus fix

* fix mbart

* more fixes

* fix research projects code

* finish docs for bart, mbart, and marian

* delete unnecessary file

* correct attn typo

* correct configs

* remove pegasus for seq class

* correct peg docs

* correct peg docs

* finish configs

* further improve docs

* add copied from statements to mbart

* fix copied from in mbart

* add copy statements to marian

* add copied from to marian

* add pegasus copied from

* finish pegasus

* finish copied from

* Apply suggestions from code review

* make style

* backward comp blenderbot

* apply lysandres and sylvains suggestions

* apply suggestions

* push last fixes

* fix docs

* fix tok tests

* fix imports code style

* fix doc

eef66035

improve readme text to private models/versioning/api (#9424) · 4eec5d0c
Clement authored Jan 05, 2021

4eec5d0c
add experimental warning (#9412) · d9e848c1
Stas Bekman authored Jan 05, 2021

d9e848c1

[trainer] group fp16 args together (#9409) · 29acabd8

Stas Bekman authored Jan 05, 2021

* [t5 doc] typos

a few run away backticks

@sgugger

* style

* [trainer] put fp16 args together

this PR proposes a purely cosmetic change that puts all the fp16 args together - so they are easier to manager/read

@sgugger

* style

29acabd8

[examples/text-classification] Fix a bug for using one's own dataset of a regression task (#9411) · 57a66269
Yusuke Mori authored Jan 05, 2021

57a66269

LED (#9278) · 189387e9

Patrick von Platen authored Jan 05, 2021

* create model

* add integration

* save current state

* make integration tests pass

* add one more test

* add explanation to tests

* remove from bart

* add padding

* remove unnecessary test

* make all tests pass

* re-add cookie cutter tests

* finish PyTorch

* fix attention test

* Update tests/test_modeling_common.py

* revert change

* remove unused file

* add string to doc

* save intermediate

* make tf integration tests pass

* finish tf

* fix doc

* fix docs again

* add led to doctree

* add to auto tokenizer

* added tips for led

* make style

* apply jplus statements

* correct tf longformer

* apply lysandres suggestions

* apply sylvains suggestions

* Apply suggestions from code review

189387e9

Fix documentation links always pointing to master. (#9217) · 314cca28

Sugeeth authored Jan 05, 2021



* Use extlinks to point hyperlink with the version of code

* Point to version on release and master until then

* Apply style

* Correct links

* Add missing backtick

* Simple missing backtick after all.
Co-authored-by: Raghavendra Sugeeth P S <raghav-5305@raghav-5305.csez.zohocorpin.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

314cca28

Fix TF Funnel (#9300) · 52d62e68

Julien Plu authored Jan 05, 2021

* Fix Funnel

* Apply Patrick's comment

* Remove comment

* Fix dummy value

* Apply style

52d62e68

[trainer] --model_parallel hasn't been implemented for most models (#9347) · 748006c0

Stas Bekman authored Jan 05, 2021

* --model_parallel hasn't been implemented for most models

* make the help clear as well

* implement is_parallelizable; use it

* oops

* remove property

748006c0

Use stable functions (#9369) · 4225740a
Julien Plu authored Jan 05, 2021

4225740a

[logging] autoflush (#9385) · 4aa8f6ad

Stas Bekman authored Jan 05, 2021

This PR proposes to:

* auto-flush `transformers` logging 

When using logging for tracing signals from different parts of the code and which could be mixed with print debug this aids to get all the logging events synchronized. 

I don't think this change will introduce any performance impacts.

If it helps someone here is the code I used to sync `transformers` logging with various other debug prints.

I was porting bart to MP and I needed to trace that the device switching happens correctly and I added a bunch of logger.info calls inside `modeling_bart.py` and also had some other helpers `print` debug messages which weren't logger based:

```

# auto flush std streams
from sys import stdout, stderr
def stdout_write_flush(args, w=stderr.write): w(args); stderr.flush()
def stderr_write_flush(args, w=stderr.write): w(args); stderr.flush()
stdout.write = stdout_write_flush
stderr.write = stderr_write_flush

from transformers import BartTokenizer, BartForConditionalGeneration, BartConfig

import logging
import transformers.utils.logging
import transformers.models.bart.modeling_bart

# I wanted a shorter simpler format
handlers = transformers.utils.logging._get_library_root_logger().handlers
for handler in handlers:
    formatter = logging.Formatter("[%(funcName)s] %(message)s")
    handler.setFormatter(formatter)

transformers.models.bart.modeling_bart.logger.setLevel(transformers.logging.INFO)
```

@LysandreJik, @sgugger, @patrickvonplaten

4aa8f6ad

Fix TF Longformer (#9348) · 83eec97e

Julien Plu authored Jan 05, 2021

* Fix longformer

* Apply style

* Remove serving content

* Forgot a condition

* Apply style

* Address Patrick's comments

* Fix dtype

83eec97e

feat(wandb): save model as artifact (#8119) · 30fa0b78

Boris Dayma authored Jan 05, 2021

* feat(wandb): log artifacts

* fix: typo

* feat(wandb): ensure name is allowed

* feat(wandb): log artifact

* feat(wandb): saving logic

* style: improve formatting

* fix: unrelated typo

* feat: use a fake trainer

* fix: simplify

* feat(wandb): log model files as artifact

* style: fix style

* docs(wandb): correct description

* feat: unpack model + allow env Truethy values

* feat: TrainerCallback can access tokenizer

* style: fix style

* feat(wandb): log more interesting metadata

* feat: unpack tokenizer

* feat(wandb): metadata with load_best_model_at_end

* feat(wandb): more robust metadata

* style(wandb): fix formatting

30fa0b78

04 Jan, 2021 4 commits
- [test_model_parallelization] multiple fixes (#9354) · 143289dc
  Stas Bekman authored Jan 04, 2021
  
  143289dc
- Improve documentation coverage for Bertweet (#9379) · 086718ac
  Qbiwan authored Jan 05, 2021
```
* bertweet docs coverage

* style doc max len 119

* maxlen style rst

* run main() from style_doc

* changed according to  comments
```
  086718ac
- replace apex.normalization.FusedLayerNorm with torch.nn.LayerNorm (#9386) · 47ca0eaa
  Stas Bekman authored Jan 04, 2021
  
  47ca0eaa
- correct docs (#9378) · 75ff5305
  Patrick von Platen authored Jan 04, 2021
  
  75ff5305