Commits · c949516695f74d59ab9051aabfafbf1e388b68a7 · chenpangpang / transformers

13 Jan, 2021 10 commits

Fix slow tests v4.2.0 (#9561) · c9495166

Lysandre Debut authored Jan 13, 2021

* Fix conversational pipeline test

* LayoutLM

* ProphetNet

* BART

* Blenderbot & small

* Marian

* mBART

* Pegasus

* Tapas tokenizer

* BERT2BERT test

* Style

* Example requirements

* TF BERT2BERT test

c9495166

Fix data parallelism in Trainer (#9566) · 04dc65e5

Sylvain Gugger authored Jan 13, 2021



* Fix data parallelism in Trainer

* Update src/transformers/training_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

04dc65e5

use correct deps for torchhub (#9552) · b2dfcc56
Stas Bekman authored Jan 13, 2021

b2dfcc56

Update run_glue for do_predict with local test data (#9442) (#9486) · eabad8fd

Yusuke Mori authored Jan 13, 2021

* Update run_glue for do_predict with local test data (#9442)

* Update run_glue (#9442): fix comments ('files' to 'a file')

* Update run_glue (#9442): reflect the code review

* Update run_glue (#9442): auto format

* Update run_glue (#9442): reflect the code review

eabad8fd

Speed up TopKLogitsWarper and TopPLogitsWarper (pytorch) (#9557) · 0c9f01a8
LSinev authored Jan 13, 2021
```
* make TopKLogitsWarper faster

* make TopPLogitsWarper faster
```
0c9f01a8
Fix classification script: enable dynamic padding with truncation (#9554) · 27d0e01d
Pavel Tarashkevich authored Jan 13, 2021
```
Co-authored-by: Pavel Tarashkevich <Pavel.Tarashkievich@orange.com>
```
27d0e01d
Fix barthez tokenizer (#9562) · 245cdb46
Lysandre Debut authored Jan 13, 2021

245cdb46

Doc: Update pretrained_models wording (#9545) · 247a7b20

Julien Chaumond authored Jan 13, 2021

* Update pretrained_models.rst

To clarify things cf. this tweet for instance https://twitter.com/RTomMcCoy/status/1349094111505211395

* format

247a7b20

fix BlenderbotSmallTokenizer (#9538) · 69ed3606
Suraj Patil authored Jan 13, 2021
```
* add model_input_names

* fix test
```
69ed3606

[trainer] deepspeed integration (#9211) · 2df34f4a

Stas Bekman authored Jan 12, 2021



* deepspeed integration

* style

* add test

* ds wants to do its own backward

* fp16 assert

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* for clarity extract what args are being passed to deepspeed

* introduce the concept of self.wrapped_model

* s/self.wrapped_model/self.model_wrapped/

* complete transition to self.wrapped_model / self.model

* fix

* doc

* give ds its own init

* add custom overrides, handle bs correctly

* fix test

* clean up model_init logic, fix small bug

* complete fix

* collapse --deepspeed_config into --deepspeed

* style

* start adding doc notes

* style

* implement hf2ds optimizer and scheduler configuration remapping

* oops

* call get_num_training_steps absolutely when needed

* workaround broken auto-formatter

* deepspeed_config arg is no longer needed - fixed in deepspeed master

* use hf's fp16 args in config

* clean

* start on the docs

* rebase cleanup

* finish up --fp16

* clarify the supported stages

* big refactor thanks to discovering deepspeed.init_distributed

* cleanup

* revert fp16 part

* add checkpoint-support

* more init ds into integrations

* extend docs

* cleanup

* unfix docs

* clean up old code

* imports

* move docs

* fix logic

* make it clear which file it's referring to

* document nodes/gpus

* style

* wrong format

* style

* deepspeed handles gradient clipping

* easier to read

* major doc rewrite

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* docs

* switch to AdamW optimizer

* style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* clarify doc
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2df34f4a

12 Jan, 2021 13 commits

Use the right version of tokenizers (#9550) · 5f672103

Sylvain Gugger authored Jan 12, 2021

* Use the right version of tokenizers

* Try another way

* Try another way

* Deps are installed from there...

* Deps are installed from there...

* Revert last

* remove needless comment

5f672103

Refactor `prepare_seq2seq_batch` (#9524) · 063d8d27

Sylvain Gugger authored Jan 12, 2021

* Add target contextmanager and rework prepare_seq2seq_batch

* Fix tests, treat BART and Barthez

* Add last tokenizers

* Fix test

* Set src token before calling the superclass

* Remove special behavior for T5

* Remove needless imports

* Remove needless asserts

063d8d27

Revert, it was not the issue. · e6ecef71
Sylvain Gugger authored Jan 12, 2021

e6ecef71
Fix tokenizers install for now · 250f27f2
Sylvain Gugger authored Jan 12, 2021

250f27f2
topk -> top_k (#9541) · dfbf0f55
Lysandre Debut authored Jan 12, 2021

dfbf0f55
LayoutLM Config (#9539) · a1100fac
Lysandre Debut authored Jan 12, 2021

a1100fac

Improve LayoutLM (#9476) · e45eba3b

NielsRogge authored Jan 12, 2021



* Add LayoutLMForSequenceClassification and integration tests

Improve docs

Add LayoutLM notebook to list of community notebooks

* Make style & quality

* Address comments by @sgugger, @patrickvonplaten and @LysandreJik

* Fix rebase with master

* Reformat in one line

* Improve code examples as requested by @patrickvonplaten
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

e45eba3b

[T5] enable T5 fp16 (#9487) · ccd1923f
Suraj Patil authored Jan 12, 2021
```
* fix t5 fp16
```
ccd1923f
fix blenderbot tok (#9532) · 2aa9c2f2
Patrick von Platen authored Jan 12, 2021

2aa9c2f2
Shouldn't stale issues/PRs with feature request label (#9511) · 406cbf58
Lysandre Debut authored Jan 12, 2021

406cbf58
Update 'Develop on Windows' guidelines (#9519) · 3b67c5ab
Simon Brandeis authored Jan 12, 2021

3b67c5ab
[ProphetNet] Fix naming and wrong config (#9514) · a051d892
Patrick von Platen authored Jan 12, 2021
```
* fix naming issues

* better names
```
a051d892

[TFBart] Split TF-Bart (#9497) · 7f286132

Patrick von Platen authored Jan 12, 2021

* make templates ready

* make add_new_model_command_ready

* finish tf bart

* prepare tf mbart

* finish tf bart

* add tf mbart

* add marian

* prep pegasus

* add tf pegasus

* push blenderbot tf

* add blenderbot

* add blenderbot small

* clean-up

* make fix copy

* define blend bot tok

* fix

* up

* make style

* add to docs

* add copy statements

* overwrite changes

* improve

* fix docs

* finish

* fix last slow test

* fix missing git conflict line

* fix blenderbot

* up

* fix blenderbot small

* load changes

* finish copied from

* upload fix

7f286132

11 Jan, 2021 16 commits

[make docs] parallel build (#9522) · 0ecbb698

Stas Bekman authored Jan 11, 2021

After experimenting with different number of workers https://github.com/huggingface/transformers/issues/9496#issuecomment-758145868 4-5 workers seems to be the most optimal - let's go with 4 as surely we wouldn't find a cpu with less cores these days.

Fixes part of https://github.com/huggingface/transformers/issues/9496

@sgugger

0ecbb698

[trainer] round numbers in trainer state (#9491) · e6f211ca
Stas Bekman authored Jan 11, 2021
```
* round numbers

* style

* round only on logging
```
e6f211ca
Make doc styler behave properly on Windows (#9516) · 01a16840
Sylvain Gugger authored Jan 11, 2021

01a16840
Add link to forums thread · 6009668c
Sylvain Gugger authored Jan 11, 2021

6009668c
Fix cardinality (#9505) · ba702966
Julien Plu authored Jan 11, 2021

ba702966

[trainer] remove `--model_parallel` (#9451) · 33b74228

Stas Bekman authored Jan 11, 2021



* fix bad merge - dropped code

* remove --model_parallel

* Deal with TrainingArguments

* Use a private attr and fix batch sizes

* fix _n_gpu

* add is_parallel helper wrapper

* fix attribute

* introduce a new attribute is_model_parallel

* docs

* docs

* Put back init False and rearrange doc

* Ignore non-init args in HFArgumentParser
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

33b74228

[doc] How To Request Support document stab (#9288) · 6f635013

Stas Bekman authored Jan 11, 2021



* How To Request Support document stab

* integrate suggestions

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* small corrections

* expand on how to search for issues with examples

* address issues

* Update ISSUES.md
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* patrick's suggestion

* patrick's suggestion

* small fix
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

6f635013

Enable TruncationStrategy override for pipelines (#9432) · d20e9c72

Nicolas Patry authored Jan 11, 2021

* Enable TruncationStrategy override for pipelines

* Update isort.

* Fixing test

* Fixing text_generation pipeline.

* Using same DummyTok as other PR  for easier merge later.

* Some more import guards.

* Remove bogus file.

* Do not pass `generate_kwargs` to `_parse_and_tokenize`.
@patrickvonplaten

* Removed DummyTok.

* Doc quality.

d20e9c72

Make doc styler detect lists on rst (#9488) · 8d25df2c
Sylvain Gugger authored Jan 11, 2021

8d25df2c
New Updated DistilGPT-2 Finetuning and Generation (#9494) · 5a442a8d
Aakash Tripathi authored Jan 11, 2021
```
https://github.com/huggingface/transformers/pull/3177
```
5a442a8d
fix tf led pt test (#9513) · 6c8ec2a9
Patrick von Platen authored Jan 11, 2021

6c8ec2a9
Fix template (#9512) · 1e3c3622
Julien Plu authored Jan 11, 2021

1e3c3622
Remove tolerance + drop_rows_to_fit by default (#9507) · d415882b
Lysandre Debut authored Jan 11, 2021
```
* Remove tolerance + drop_rows_to_fit by default

* remove drop_rows_to_fit
```
d415882b

Full rework of the TF input/output embeddings and bias resizing (#9193) · 1243ee7d

Julien Plu authored Jan 11, 2021

* Start rework resizing

* Rework bias/decoder resizing

* Full resizing rework

* Full resizing rework

* Start to update the models with the new approach

* Finish to update the models

* Update all the tests

* Update the template

* Fix tests

* Fix tests

* Test a new approach

* Refactoring

* Refactoring

* Refactoring

* New rework

* Rework BART

* Rework bert+blenderbot

* Rework CTRL

* Rework Distilbert

* Rework DPR

* Rework Electra

* Rework Flaubert

* Rework Funnel

* Rework GPT2

* Rework Longformer

* Rework Lxmert

* Rework marian+mbart

* Rework mobilebert

* Rework mpnet

* Rework openai

* Rework pegasus

* Rework Roberta

* Rework T5

* Rework xlm+xlnet

* Rework template

* Fix TFT5EncoderOnly + DPRs

* Restore previous methods

* Fix Funnel

* Fix CTRL and TransforXL

* Apply style

* Apply Sylvain's comments

* Restore a test in DPR

* Address the comments

* Fix bug

* Apply style

* remove unused import

* Fix test

* Forgot a method

* missing test

* Trigger CI

* naming update

* Rebase

* Trigger CI

1243ee7d

Fix template (#9504) · cf416764
Julien Plu authored Jan 11, 2021

cf416764
fix-template (#9499) · 09926c8e
Richard Liaw authored Jan 10, 2021
```
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
```
09926c8e

10 Jan, 2021 1 commit
- Reformat (#9482) · 4f7022d6
  Julien Plu authored Jan 10, 2021
  
  4f7022d6