Commits · 9a25c5bd3afeab85a80acb1a5348beec1d2cbbfd · chenpangpang / transformers

18 Dec, 2020 7 commits

Add new run_swag example (#9175) · 9a25c5bd

Sylvain Gugger authored Dec 18, 2020



* Add new run_swag example

* Add check

* Add sample

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Very important change to make Lysandre happy
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

9a25c5bd

Fix typo · 3e56e2ce
Sylvain Gugger authored Dec 18, 2020

3e56e2ce
Fix link to old SQUAD fine-tuning script (#9181) · 077a5dce
Manuel Romero authored Dec 18, 2020

077a5dce

[setup] correct transformers version format (#9176) · 84d5879e

Stas Bekman authored Dec 18, 2020

setuptools has a pretty fixed expectation of version numbers.

This PR fixes the dev version number and adds a comment with correct formats for the future editors

This fix removes this warning on `make fixup|style|etc` or any other time `setup.py` is being run.
```
setuptools/dist.py:452: UserWarning: Normalizing '4.2.0dev0' to '4.2.0.dev0'
  warnings.warn(tmpl.format(**locals()))
```
and the alternative:
```
/setuptools/dist.py:452: UserWarning: Normalizing '4.0.0-rc-1' to '4.0.0rc1
```

Fixes: #8749

@LysandreJik, @sgugger

84d5879e

fixed JSON error in run_qa with fp16 (#9186) · fd7b6a52
Wissam Antoun authored Dec 18, 2020

fd7b6a52
Fix link to old NER fine-tuning script (#9182) · 66a14a2f
Manuel Romero authored Dec 18, 2020

66a14a2f
[trainer] apex fixes and tests (#9180) · f06d0fad
Stas Bekman authored Dec 17, 2020

f06d0fad

17 Dec, 2020 12 commits
- Added TF CTRL Sequence Classification (#9151) · 467e9158
  sandip authored Dec 18, 2020
```
* Added TF CTRL Sequence Classification

* code refactor
```
  467e9158
- add tests for the new sharded ddp fairscale integration (#9177) · 63841c55
  Stas Bekman authored Dec 17, 2020
  
  63841c55
- setup.py development version · bf713cde
  Lysandre authored Dec 17, 2020
  
  bf713cde
- v4.1.1 docs · bd40345d
  Lysandre authored Dec 17, 2020
  
  bd40345d
- Release: v4.1.1 · bfa4ccf7
  Lysandre authored Dec 17, 2020
  
  bfa4ccf7
- Fix TAPAS doc · e0790cca
  Lysandre authored Dec 17, 2020
  
  e0790cca
- Put all models in the constants (#9170) · 6d2e864d
  Sylvain Gugger authored Dec 17, 2020
```
* Put all models in the constants

* Add Google AI mention in the main README
```
  6d2e864d
- v4.1.0 docs · f83d9c8d
  Lysandre authored Dec 17, 2020
  
  f83d9c8d
- Release: v4.1.0 · f5438ab8
  Lysandre authored Dec 17, 2020
  
  f5438ab8
- Remove erroneous character · ac2c7e39
  Lysandre authored Dec 17, 2020
  
  ac2c7e39
- Fix gradient clipping for Sharded DDP (#9168) · 77d6941e
  Sylvain Gugger authored Dec 17, 2020
```
* Fix gradient clipping for Sharded DDP

* Fix typos in comments
```
  77d6941e
- Add disclaimer to TAPAS rst file (#9167) · 1aca3d6a
  Lysandre Debut authored Dec 17, 2020
```
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
```
  1aca3d6a
16 Dec, 2020 10 commits

Torch scatter with torch 1.7.0 · dc9f2454
Lysandre authored Dec 16, 2020

dc9f2454

Experimental support for fairscale ShardedDDP (#9139) · 9a671853

Sylvain Gugger authored Dec 16, 2020

* Experimental stupport for fairscale ShardedDDP

* Add import error if fairscale not available

* Address review comments

* Fix seq2seq trainer

9a671853

TableQuestionAnsweringPipeline (#9145) · 1c1a2ffb

Lysandre Debut authored Dec 16, 2020



* AutoModelForTableQuestionAnswering

* TableQuestionAnsweringPipeline

* Apply suggestions from Patrick's code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Sylvain and Patrick comments

* Better PyTorch/TF error message

* Add integration tests

* Argument Handler naming
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

* Fix docs to appease the documentation gods
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

1c1a2ffb

AutoModelForTableQuestionAnswering (#9154) · 07384baf

Lysandre Debut authored Dec 16, 2020

* AutoModelForTableQuestionAnswering

* Update src/transformers/models/auto/modeling_auto.py

* Style

07384baf

Add message to documentation that longformer doesn't support token_type_ids (#9152) · 34334662
Hayden Housen authored Dec 16, 2020
```
* Add message to documentation that longformer doesn't support token_type_ids

* Format changes
```
34334662
hotfix torch scatter version · 2f918def
Lysandre authored Dec 16, 2020

2f918def
Update notebook table and transformers intro notebook (#9136) · 4d489735
Sylvain Gugger authored Dec 16, 2020

4d489735

Support for private models from huggingface.co (#9141) · fb650df8

Julien Chaumond authored Dec 16, 2020



* minor wording tweaks

* Create private model repo + exist_ok flag

* file_utils: `use_auth_token`

* Update src/transformers/file_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Propagate doc from @sgugger
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fb650df8

DistilBertForSequenceClassification (#9148) · c69d19fa
AndreaSottana authored Dec 16, 2020
```
fix small shape error in comments
```
c69d19fa

[Flax] Align FlaxBertForMaskedLM with BertForMaskedLM, implement from_pretrained, init (#9054) · 640e6fe1

Patrick von Platen authored Dec 16, 2020



* save intermediate

* save intermediate

* save intermediate

* correct flax bert model file

* new module / model naming

* make style

* almost finish BERT

* finish roberta

* make fix-copies

* delete keys file

* last refactor

* fixes in run_mlm_flax.py

* remove pooled from run_mlm_flax.py`

* fix gelu | gelu_new

* remove Module from inits

* splits

* dirty print

* preventing warmup_steps == 0

* smaller splits

* make fix-copies

* dirty print

* dirty print

* initial_evaluation argument

* declaration order fix

* proper model initialization/loading

* proper initialization

* run_mlm_flax improvements: improper model inputs bugfix + automatic dataset splitting + tokenizers parallelism warning + avoiding warmup_steps=0 bug

* removed tokenizers warning hack, fixed model re-initialization

* reverted training_args.py changes

* fix flax from pretrained

* improve test in flax

* apply sylvains tips

* update init

* make 0.3.0 compatible

* revert tevens changes

* revert tevens changes 2

* finalize revert

* fix bug

* add docs

* add pretrained to init

* Update src/transformers/modeling_flax_utils.py

* fix copies

* final improvements
Co-authored-by: TevenLeScao <teven.lescao@gmail.com>

640e6fe1

15 Dec, 2020 11 commits

Fix fp16_backend field · 51adb97c
Sylvain Gugger authored Dec 15, 2020

51adb97c

[WIP] Tapas v4 (tres) (#9117) · 1551e2dc

NielsRogge authored Dec 15, 2020



* First commit: adding all files from tapas_v3

* Fix multiple bugs including soft dependency and new structure of the library

* Improve testing by adding torch_device to inputs and adding dependency on scatter

* Use Python 3 inheritance rather than Python 2

* First draft model cards of base sized models

* Remove model cards as they are already on the hub

* Fix multiple bugs with integration tests

* All model integration tests pass

* Remove print statement

* Add test for convert_logits_to_predictions method of TapasTokenizer

* Incorporate suggestions by Google authors

* Fix remaining tests

* Change position embeddings sizes to 512 instead of 1024

* Comment out positional embedding sizes

* Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES

* Added more model names

* Fix truncation when no max length is specified

* Disable torchscript test

* Make style & make quality

* Quality

* Address CI needs

* Test the Masked LM model

* Fix the masked LM model

* Truncate when overflowing

* More much needed docs improvements

* Fix some URLs

* Some more docs improvements

* Test PyTorch scatter

* Set to slow + minify

* Calm flake8 down

* First commit: adding all files from tapas_v3

* Fix multiple bugs including soft dependency and new structure of the library

* Improve testing by adding torch_device to inputs and adding dependency on scatter

* Use Python 3 inheritance rather than Python 2

* First draft model cards of base sized models

* Remove model cards as they are already on the hub

* Fix multiple bugs with integration tests

* All model integration tests pass

* Remove print statement

* Add test for convert_logits_to_predictions method of TapasTokenizer

* Incorporate suggestions by Google authors

* Fix remaining tests

* Change position embeddings sizes to 512 instead of 1024

* Comment out positional embedding sizes

* Update PRETRAINED_VOCAB_FILES_MAP and PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES

* Added more model names

* Fix truncation when no max length is specified

* Disable torchscript test

* Make style & make quality

* Quality

* Address CI needs

* Test the Masked LM model

* Fix the masked LM model

* Truncate when overflowing

* More much needed docs improvements

* Fix some URLs

* Some more docs improvements

* Add add_pooling_layer argument to TapasModel

Fix comments by @sgugger and @patrickvonplaten

* Fix issue in docs + fix style and quality

* Clean up conversion script and add task parameter to TapasConfig

* Revert the task parameter of TapasConfig

Some minor fixes

* Improve conversion script and add test for absolute position embeddings

* Improve conversion script and add test for absolute position embeddings

* Fix bug with reset_position_index_per_cell arg of the conversion cli

* Add notebooks to the examples directory and fix style and quality

* Apply suggestions from code review

* Move from `nielsr/` to `google/` namespace

* Apply Sylvain's comments
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Rogge Niels <niels.rogge@howest.be>
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

1551e2dc

Add possibility to switch between APEX and AMP in Trainer (#9137) · ad895af9

Sylvain Gugger authored Dec 15, 2020



* Add possibility to switch between APEX and AMP in Trainer

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Address review comments

* Update src/transformers/training_args.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

ad895af9

Add large model config (#9140) · 0b2f46fa
Lysandre Debut authored Dec 15, 2020

0b2f46fa

[Examples] Add automatic dataset splitting in language-modeling examples (#9133) · 2a7e8e16

Teven authored Dec 15, 2020

* replaced jnp.split + removing textual model inputs + ensuring warmup_steps > 0

* Add automatic dataset splitting in language-modeling examples

2a7e8e16

Fix add order (#9129) · e7717497
Julien Plu authored Dec 15, 2020

e7717497
Fix Bart Shift (#9135) · 18ecd36f
Patrick von Platen authored Dec 15, 2020
```
* correct mistake in order

* fix tensor copy

* clone tensor correctly
```
18ecd36f
correct mistake in order (#9134) · d018622d
Patrick von Platen authored Dec 15, 2020

d018622d
fix bart loss masking (#9131) · 80bdb9c3
Patrick von Platen authored Dec 15, 2020

80bdb9c3
Fix typo in trainer_tf.py (#9132) · 3caba8d3
Manbish authored Dec 15, 2020

3caba8d3

[TF Bart] Refactor TFBart (#9029) · abc573f5

Patrick von Platen authored Dec 15, 2020

* reorder file

* delete unnecesarry function

* make style

* save intermediate

* fix attention masks

* correct tf bart past key values

* solve merge conflict bug

* correct tensor dims

* save intermediate tf

* change attn layer

* fix typo re-order past

* inputs_embeds

* make fix copies

* finish tests

* fix graph mode

* appyl lysandres suggestions

abc573f5