Commits · 49c61a4ae7f269c5f590d62334c33832a29e0c7d · chenpangpang / transformers

10 Mar, 2021 7 commits

Extend trainer logging for sm (#10633) · 49c61a4a

Philipp Schmid authored Mar 10, 2021

* renamed logging to hf_logging

* changed logging from hf_logging to logging and loggin to native_logging

* removed everything trying to fix import Trainer error

* adding imports again

* added custom add_handler function to logging.py

* make style

* added remove_handler

* added another conditional to assert

49c61a4a

Fix GPU tests with speech · 1aa9c13f
Sylvain Gugger authored Mar 10, 2021

1aa9c13f

Copy tokenizer files in each of their repo (#10624) · 2295d783

Sylvain Gugger authored Mar 10, 2021

* Move tokenizer files in each repo

* Fix mBART50 tests

* Fix mBART tests

* Fix Marian tests

* Update templates

2295d783

Speech2TextTransformer (#10175) · d26b37e7

Suraj Patil authored Mar 10, 2021



* s2t

* fix config

* conversion script

* fix import

* add tokenizer

* fix tok init

* fix tokenizer

* first version working

* fix embeds

* fix lm head

* remove extra heads

* fix convert script

* handle encoder attn mask

* style

* better enc attn mask

* override _prepare_attention_mask_for_generation

* handle attn_maks in encoder and decoder

* input_ids => input_features

* enable use_cache

* remove old code

* expand embeddings if needed

* remove logits bias

* masked_lm_loss => loss

* hack tokenizer to support feature processing

* fix model_input_names

* style

* fix error message

* doc

* remove inputs_embeds

* remove input_embeds

* remove unnecessary docstring

* quality

* SpeechToText => Speech2Text

* style

* remove shared_embeds

* subsample => conv

* remove Speech2TextTransformerDecoderWrapper

* update output_lengths formula

* fix table

* remove max_position_embeddings

* update conversion scripts

* add possibility to do upper case for now

* add FeatureExtractor and Processor

* add tests for extractor

* require_torch_audio => require_torchaudio

* add processor test

* update import

* remove classification head

* attention mask is now 1D

* update docstrings

* attention mask should be of type long

* handle attention mask from generate

* alwyas return attention_mask

* fix test

* style

* doc

* Speech2TextTransformer => Speech2Text

* Speech2TextTransformerConfig => Speech2TextConfig

* remove dummy_inputs

* nit

* style

* multilinguial tok

* fix tokenizer

* add tgt_lang setter

* save lang_codes

* fix tokenizer

* add forced_bos_token_id to tokenizer

* apply review suggestions

* add torchaudio to extra deps

* add speech deps to CI

* fix dep

* add libsndfile to ci

* libsndfile1

* add speech to extras all

* libsndfile1 -> libsndfile1

* libsndfile

* libsndfile1-dev

* apt update

* add sudo to install

* update deps table

* install libsndfile1-dev on CI

* tuple to list

* init conv layer

* add model tests

* quality

* add integration tests

* skip_special_tokens

* add speech_to_text_transformer in toctree

* fix tokenizer

* fix fp16 tests

* add tokenizer tests

* fix copyright

* input_values => input_features

* doc

* add model in readme

* doc

* change checkpoint names

* fix copyright

* fix code example

* add max_model_input_sizes in tokenizer

* fix integration tests

* add do_lower_case to tokenizer

* remove clamp trick

* fix "Add modeling imports here"

* fix copyrights

* fix tests

* SpeechToTextTransformer => SpeechToText

* fix naming

* fix table formatting

* fix typo

* style

* fix typos

* remove speech dep from extras[testing]

* fix copies

* rename doc file,

* put imports under is_torch_available

* run feat extract tests when torch is available

* dummy objects for processor and extractor

* fix imports in tests

* fix import in modeling test

* fxi imports

* fix torch import

* fix imports again

* fix positional embeddings

* fix typo in import

* adapt new extractor refactor

* style

* fix torchscript test

* doc

* doc

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix docs, copied from, style

* fix docstring

* handle imports

* remove speech from all extra deps

* remove s2t from seq2seq lm mapping

* better names

* skip training tests

* add install instructions

* List => Tuple

* doc

* fix conversion script

* fix urls

* add instruction for libsndfile

* fix fp16 test
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

d26b37e7

Add new GLUE example with no Trainer. (#10555) · efb5c0a4
Sylvain Gugger authored Mar 10, 2021
```
* Add new GLUE example with no Trainer.

* Style

* Address review comments
```
efb5c0a4
remove final_logits_bias (#10606) · 44f64132
Suraj Patil authored Mar 10, 2021

44f64132

Fixes an issue in `text-classification` where MNLI eval/test datasets are not... · 6f52fce6

Allen Wang authored Mar 09, 2021

Fixes an issue in `text-classification` where MNLI eval/test datasets are not being preprocessed. (#10621)

* Fix MNLI tests

* Linter fix

6f52fce6

09 Mar, 2021 9 commits

Fix tests of TrainerCallback (#10615) · 72d9e039

Sylvain Gugger authored Mar 09, 2021



* Fix tests of TrainerCallback

* Update tests/test_trainer_callback.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

72d9e039

Fairscale FSDP fix model save (#10596) · 0d909f6b
Sylvain Gugger authored Mar 09, 2021
```
* Hotfix fairscale FSDP

* Evaluation works

* Save on process zero
```
0d909f6b
added max_sample args and metrics changes (#10602) · ac17f711
Bhadresh Savani authored Mar 09, 2021

ac17f711

Trigger add sm information (#10610) · c19c811a

Philipp Schmid authored Mar 09, 2021

* added sm to ua

* update id

* removed id

* removed comments

* added env variable

* changed variable name

* make quality happy

* added sguggers feedback

* make styling happy and remove brackets

* added sm to ua

* update id

* removed id

* removed comments

* added env variable

* changed variable name

* make quality happy

* added sguggers feedback

* make styling happy and remove brackets

c19c811a

layerdrop 0 (#10604) · 20c10258
Suraj Patil authored Mar 09, 2021

20c10258
Update cache version for github actions · 95ab0677
Lysandre authored Mar 09, 2021

95ab0677

[FeatureExtractorSavingUtils] Refactor PretrainedFeatureExtractor (#10594) · 9a06b6b1

Patrick von Platen authored Mar 09, 2021



* save first version

* finish refactor

* finish refactor

* correct naming

* correct naming

* shorter names

* Update src/transformers/feature_extraction_common_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* change name

* finish
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

9a06b6b1

[docs] How to solve "Title level inconsistent" sphinx error (#10600) · b6a28e9a
Stas Bekman authored Mar 08, 2021
```
* How to solve: Title level inconsistent

* list chars
```
b6a28e9a

Speedup tf tests (#10601) · 546cbe7e

Lysandre Debut authored Mar 08, 2021

* Pipeline tests should be slow

* Temporarily mark some tests as slow

* Temporarily mark Barthez tests as slow

546cbe7e

08 Mar, 2021 20 commits

Add TFRag (#9002) · 696e8a43

Ratthachat (Jung) authored Mar 09, 2021

* Create modeling_tf_dpr.py

* Add TFDPR

* Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot

last commit accidentally deleted these 4 lines, so I recover them back

* Add TFDPR

* Add TFDPR

* clean up some comments, add TF input-style doc string

* Add TFDPR

* Make return_dict=False as default

* Fix return_dict bug (in .from_pretrained)

* Add get_input_embeddings()

* Create test_modeling_tf_dpr.py

The current version is already passed all 27 tests!
Please see the test run at : 
https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing



* fix quality

* delete init weights

* run fix copies

* fix repo consis

* del config_class, load_tf_weights

They shoud be 'pytorch only'

* add config_class back

after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion

* newline after .. note::

* import tf, np (Necessary for ModelIntegrationTest)

* slow_test from_pretrained with from_pt=True

At the moment we don't have TF weights (since we don't have official official TF model)
Previously, I did not run slow test, so I missed this bug

* Add simple TFDPRModelIntegrationTest

Note that this is just a test that TF and Pytorch gives approx. the same output.
However, I could not test with the official DPR repo's output yet

* upload correct tf model

* remove position_ids as missing keys

* create modeling_tf_rag

* add tests for tf

* add tf tests

* revert wrong pt commit

* further refactor

* further refactor

* refactor

* Update modeling_tf_rag.py

- input_processing
- fix prepare_input_for_generation (mostly fix generate bug)
- bring back from_pretrained hack in order to test generate

* delete colab pieces of code

* Show case of greedy "generate"

Temporarily change from beam_search test to greedy_search test to show case that TF and PT do get equivalent output.

* cosmetic update

* correct typos

* update

* push some progress

* make easy check

* fix rag save from pretrained

* Update src/transformers/modeling_tf_utils.py

* remove commented out lines

* delete unnecessary lines

* add simple test case for nq_checkpoint

Add nq_checkpoint test to show that current version without hack still fails

* temporarily put ugly hack back again

* Add TFRagSequenceForGeneration!!

* __init__.py , import TFRagSequenceForGeneration

* Add TFRagSequence tests!

* rag init.py - add TFRagSequenceForGeneration

* fix from_pretrained

* fix prepare_inputs_for_generation

* Beam search for RagToken!

* minor clean up

* add tf.cast in TFRagModel

* More tf.cast

* Add all remaining tests (still have issues)

* delete all T5 related

* make style

* fix load weight prefix

* fix bart

* fix return_dict for tf_rag

make all tests pass .. Hooray

* fix some tests

* fix code quality

* fix qualtiy check

* finish tests tf rag

* add tf rag to docs

* remove TFT5 from docstring
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* remove TFT5 from docstring
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Delete outdated comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* improve doc strings

* add generative model classes

* fix adjust token logic

* refactor generate for TFRag

* using shape_list, not _get_shape
Co-authored-by: Julien Plu <plu.julien@gmail.com>

* axis=[1]->axis=1

* delete NEED_HELP comment

* improve readability
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve readability
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improve readability
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Indicating model is in a developing state in docstrings

As suggested by Julien

* small last changes

* apply sylvains suggestions

* finish tf rag
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: patrickvonplaten <patrick@huggingface.co>
Co-authored-by: Julien Plu <plu.julien@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

696e8a43

Check layer types for Optimizer construction (#10598) · 3ced9b3e
Sylvain Gugger authored Mar 08, 2021
```
* Check layer types for Optimizer construction

* Duplicate class
```
3ced9b3e
Revert "Tests" · 821d518e
Sylvain Gugger authored Mar 08, 2021
```
This reverts commit b35e7b68.
```
821d518e
Revert "Style" · 4196bfed
Sylvain Gugger authored Mar 08, 2021
```
This reverts commit a8ec52ef.
```
4196bfed
Style · a8ec52ef
Sylvain Gugger authored Mar 08, 2021

a8ec52ef
Tests · b35e7b68
Sylvain Gugger authored Mar 08, 2021

b35e7b68
[examples tests on multigpu] resolving require_torch_non_multi_gpu_but_fix_me (#10561) · f284089e
Stas Bekman authored Mar 08, 2021
```
* batch 1

* this is tpu

* deebert attempt

* the rest
```
f284089e

Added max_sample_ arguments (#10551) · dfd16af8

Bhadresh Savani authored Mar 09, 2021

* reverted changes of logging and saving metrics

* added max_sample arguments

* fixed code

* white space diff

* reformetting code

* reformatted code

dfd16af8

[examples tests] various fixes (#10584) · 917f1045
Stas Bekman authored Mar 08, 2021
```
* fix sharded ddp enum

* test fixes

* stronger validation + apex breaks other tests
```
917f1045

offline mode for firewalled envs (part 2) (#10569) · 6f84531e

Stas Bekman authored Mar 08, 2021

* more readable test

* add all the missing places

* one more nltk

* better exception check

* revert

6f84531e

Fix version control with anchors (#10595) · 54693694
Sylvain Gugger authored Mar 08, 2021
```
* Fix version control with anchors

* Simplify
```
54693694
fix double wrapping + test (#10583) · f8829660
Stas Bekman authored Mar 08, 2021

f8829660

tokenization_marian.py: use current_spm for decoding (#10357) · b8805084

Mehrad Moradshahi authored Mar 08, 2021



* Fix Marian decoding

Tokenizer's decode and batch_decode now accepts a new argument (use_source_tokenizer) which indicates whether the source spm should be used to decode ids. This is useful for Marian models specificallly when decoding source input ids.

* Adapt docstrings
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

b8805084

Correct YAML · 8fd7eb34
Lysandre authored Mar 08, 2021

8fd7eb34
Enable torch 1.8.0 on GPU CI (#10593) · 89b8d4f5
Lysandre Debut authored Mar 08, 2021
```
* Enable torch 1.8.0 in GPU CI

* Disable torch-scatter
```
89b8d4f5

[M2M100] fix positional embeddings (#10590) · 2a737bff

Suraj Patil authored Mar 08, 2021

* fix tests

* emb should be a parameter

* fix positional embeddings

* fix make_weights

* don't save pos embeds

* add comment to describe the clamping

2a737bff

fix BART Summarization example in doc (#10582) · d59464db
Oren Amsalem authored Mar 08, 2021

d59464db
Fix typo in docstring for pipeline (#10591) · 3b583d02
Eunhyuk Shin authored Mar 08, 2021

3b583d02
fix nltk lookup (#10585) · e6ce636e
Stas Bekman authored Mar 07, 2021

e6ce636e
fix tf doc bug (#10570) · 9dd054fb
Yu authored Mar 08, 2021

9dd054fb

06 Mar, 2021 3 commits

Add m2m100 (#10236) · f6e74a63

Suraj Patil authored Mar 06, 2021

* m2m_100

* no layernorm_embedding

* sinusoidal positional embeddings

* update pos embeddings

* add default config values

* tokenizer

* add conversion script

* fix config

* fix pos embed

* remove _float_tensor

* update tokenizer

* update lang codes

* handle lang codes

* fix pos embeds

* fix spm key

* put embedding weights on device

* remove qa and seq classification heads

* fix convert script

* lang codes pn one line

* fix embeds

* fix tokenizer

* fix tokenizer

* add fast tokenizer

* style

* M2M100MT => M2M100

* fix copyright, style

* tokenizer converter

* vocab file

* remove fast tokenizer

* fix embeds

* fix tokenizer

* fix tests

* add tokenizer tests

* add integration test

* quality

* fix model name

* fix test

* doc

* doc

* fix doc

* add copied from statements

* fix tokenizer tests

* apply review suggestions

* fix urls

* fix shift_tokens_right

* apply review suggestions

* fix

* fix doc

* add lang code to id

* remove unused function

* update checkpoint names

* fix copy

* fix tokenizer

* fix checkpoint names

* fix merge issue

* style

f6e74a63

Temporarily disable stale bot · fd011044
Lysandre authored Mar 06, 2021

fd011044

offline mode for firewalled envs (#10407) · 88a951e3

Stas Bekman authored Mar 05, 2021



* offline mode start

* add specific values

* fix fallback

* add test

* better values check and range

* test that actually works

* document the offline mode

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* more strict check

* cleaner test

* pt-only test

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

88a951e3

05 Mar, 2021 1 commit

Refactoring checkpoint names for multiple models (#10527) · 90ecc296

Daniel Hug authored Mar 05, 2021

* Refactor checkpoint name in ALBERT and ALBERT_tf

* Refactor checkpoint name in BART and BART_tf

* Refactor checkpoint name in BERT generation

* Refactor checkpoint name in Blenderbot_tf

* Refactor checkpoint name in Blenderbot_small_tf

* Refactor checkpoint name in ConvBERT AND CONVBERT_TF

* Refactor checkpoint name in CTRL AND CTRL_TF

* Refactor checkpoint name in DistilBERT AND DistilBERT_TF

* Refactor checkpoint name in DistilBERT redo

* Refactor checkpoint name in Electra and Electra_tf

* Refactor checkpoint name in FlauBERT and FlauBERT_tf

* Refactor checkpoint name in FSMT

* Refactor checkpoint name in GPT2 and GPT2_tf

* Refactor checkpoint name in IBERT

* Refactor checkpoint name in LED and LED_tf

* Refactor checkpoint name in Longformer and Longformer_tf

* Refactor checkpoint name in Lxmert and Lxmert_tf

* Refactor checkpoint name in Marian_tf

* Refactor checkpoint name in MBART and MBART_tf

* Refactor checkpoint name in MobileBERT and MobileBERT_tf

* Refactor checkpoint name in mpnet and mpnet_tf

* Refactor checkpoint name in openai and openai_tf

* Refactor checkpoint name in pegasus_tf

* Refactor checkpoint name in reformer

* Refactor checkpoint name in Roberta and Roberta_tf

* Refactor checkpoint name in SqueezeBert

* Refactor checkpoint name in Transformer_xl and Transformer_xl_tf

* Refactor checkpoint name in XLM and XLM_tf

* Refactor checkpoint name in XLNET and XLNET_tf

* Refactor checkpoint name in BERT_tf

* run make tests, style, quality, fixup

90ecc296