Commits · 280db79ac139eff31962a56006b34a9f42886834 · chenpangpang / transformers

14 Jan, 2021 7 commits

BatchEncoding.to with device with tests (#9584) · 280db79a
Lysandre Debut authored Jan 14, 2021

280db79a

Lysandre Debut authored Jan 14, 2021

* conda build -> conda-build

* Syntax error

* conda build -> conda-build + 4.2.0

* Prepare to merge in `master`

8bf27075

[setup.py] note on how to get to transformers exact dependencies from shell (#9553) · c99751dd

Stas Bekman authored Jan 14, 2021



* note on how to get to deps from shell

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix text
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c99751dd

Make logs tf compliant (#9565) · a26536f0
Julien Plu authored Jan 14, 2021

a26536f0
Compliancy with tf-nightly (#9570) · 14d677ca
Julien Plu authored Jan 14, 2021
```
* Compliancy with tf-nightly

* Add more version + restore min version check
```
14d677ca

Switch metrics in run_ner to datasets (#9567) · 46ed56cf

Sylvain Gugger authored Jan 14, 2021

* Switch metrics in run_ner to datasets

* Add flag to return all metrics

* Upstream (and rename) sortish_sampler

* Revert "Upstream (and rename) sortish_sampler"

This reverts commit e07d0dcf650c2bae36da011dd76c77a8bb4feb0d.

46ed56cf

Fix Trainer with a parallel model (#9578) · 5e1bea4f
Sylvain Gugger authored Jan 14, 2021
```
* Fix Trainer with a parallel model

* More clean up
```
5e1bea4f

13 Jan, 2021 14 commits

Update README.md · 126fd281
Patrick von Platen authored Jan 13, 2021

126fd281
v4.3.0.dev0 · e63cad79
Lysandre authored Jan 13, 2021

e63cad79
v4.2.0 documentation · 33a8497d
Lysandre authored Jan 13, 2021

33a8497d
Release: v4.2.0 · 7d9a9d0c
Lysandre authored Jan 13, 2021

7d9a9d0c

Fix slow tests v4.2.0 (#9561) · c9495166

Lysandre Debut authored Jan 13, 2021

* Fix conversational pipeline test

* LayoutLM

* ProphetNet

* BART

* Blenderbot & small

* Marian

* mBART

* Pegasus

* Tapas tokenizer

* BERT2BERT test

* Style

* Example requirements

* TF BERT2BERT test

c9495166

Fix data parallelism in Trainer (#9566) · 04dc65e5

Sylvain Gugger authored Jan 13, 2021



* Fix data parallelism in Trainer

* Update src/transformers/training_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

04dc65e5

use correct deps for torchhub (#9552) · b2dfcc56
Stas Bekman authored Jan 13, 2021

b2dfcc56

Update run_glue for do_predict with local test data (#9442) (#9486) · eabad8fd

Yusuke Mori authored Jan 13, 2021

* Update run_glue for do_predict with local test data (#9442)

* Update run_glue (#9442): fix comments ('files' to 'a file')

* Update run_glue (#9442): reflect the code review

* Update run_glue (#9442): auto format

* Update run_glue (#9442): reflect the code review

eabad8fd

Speed up TopKLogitsWarper and TopPLogitsWarper (pytorch) (#9557) · 0c9f01a8
LSinev authored Jan 13, 2021
```
* make TopKLogitsWarper faster

* make TopPLogitsWarper faster
```
0c9f01a8
Fix classification script: enable dynamic padding with truncation (#9554) · 27d0e01d
Pavel Tarashkevich authored Jan 13, 2021
```
Co-authored-by: Pavel Tarashkevich <Pavel.Tarashkievich@orange.com>
```
27d0e01d
Fix barthez tokenizer (#9562) · 245cdb46
Lysandre Debut authored Jan 13, 2021

245cdb46

Doc: Update pretrained_models wording (#9545) · 247a7b20

Julien Chaumond authored Jan 13, 2021

* Update pretrained_models.rst

To clarify things cf. this tweet for instance https://twitter.com/RTomMcCoy/status/1349094111505211395

* format

247a7b20

fix BlenderbotSmallTokenizer (#9538) · 69ed3606
Suraj Patil authored Jan 13, 2021
```
* add model_input_names

* fix test
```
69ed3606

[trainer] deepspeed integration (#9211) · 2df34f4a

Stas Bekman authored Jan 12, 2021



* deepspeed integration

* style

* add test

* ds wants to do its own backward

* fp16 assert

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* for clarity extract what args are being passed to deepspeed

* introduce the concept of self.wrapped_model

* s/self.wrapped_model/self.model_wrapped/

* complete transition to self.wrapped_model / self.model

* fix

* doc

* give ds its own init

* add custom overrides, handle bs correctly

* fix test

* clean up model_init logic, fix small bug

* complete fix

* collapse --deepspeed_config into --deepspeed

* style

* start adding doc notes

* style

* implement hf2ds optimizer and scheduler configuration remapping

* oops

* call get_num_training_steps absolutely when needed

* workaround broken auto-formatter

* deepspeed_config arg is no longer needed - fixed in deepspeed master

* use hf's fp16 args in config

* clean

* start on the docs

* rebase cleanup

* finish up --fp16

* clarify the supported stages

* big refactor thanks to discovering deepspeed.init_distributed

* cleanup

* revert fp16 part

* add checkpoint-support

* more init ds into integrations

* extend docs

* cleanup

* unfix docs

* clean up old code

* imports

* move docs

* fix logic

* make it clear which file it's referring to

* document nodes/gpus

* style

* wrong format

* style

* deepspeed handles gradient clipping

* easier to read

* major doc rewrite

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* docs

* switch to AdamW optimizer

* style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* clarify doc
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2df34f4a

12 Jan, 2021 13 commits

Use the right version of tokenizers (#9550) · 5f672103

Sylvain Gugger authored Jan 12, 2021

* Use the right version of tokenizers

* Try another way

* Try another way

* Deps are installed from there...

* Deps are installed from there...

* Revert last

* remove needless comment

5f672103

Refactor `prepare_seq2seq_batch` (#9524) · 063d8d27

Sylvain Gugger authored Jan 12, 2021

* Add target contextmanager and rework prepare_seq2seq_batch

* Fix tests, treat BART and Barthez

* Add last tokenizers

* Fix test

* Set src token before calling the superclass

* Remove special behavior for T5

* Remove needless imports

* Remove needless asserts

063d8d27

Revert, it was not the issue. · e6ecef71
Sylvain Gugger authored Jan 12, 2021

e6ecef71
Fix tokenizers install for now · 250f27f2
Sylvain Gugger authored Jan 12, 2021

250f27f2
topk -> top_k (#9541) · dfbf0f55
Lysandre Debut authored Jan 12, 2021

dfbf0f55
LayoutLM Config (#9539) · a1100fac
Lysandre Debut authored Jan 12, 2021

a1100fac

Improve LayoutLM (#9476) · e45eba3b

NielsRogge authored Jan 12, 2021



* Add LayoutLMForSequenceClassification and integration tests

Improve docs

Add LayoutLM notebook to list of community notebooks

* Make style & quality

* Address comments by @sgugger, @patrickvonplaten and @LysandreJik

* Fix rebase with master

* Reformat in one line

* Improve code examples as requested by @patrickvonplaten
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

e45eba3b

[T5] enable T5 fp16 (#9487) · ccd1923f
Suraj Patil authored Jan 12, 2021
```
* fix t5 fp16
```
ccd1923f
fix blenderbot tok (#9532) · 2aa9c2f2
Patrick von Platen authored Jan 12, 2021

2aa9c2f2
Shouldn't stale issues/PRs with feature request label (#9511) · 406cbf58
Lysandre Debut authored Jan 12, 2021

406cbf58
Update 'Develop on Windows' guidelines (#9519) · 3b67c5ab
Simon Brandeis authored Jan 12, 2021

3b67c5ab
[ProphetNet] Fix naming and wrong config (#9514) · a051d892
Patrick von Platen authored Jan 12, 2021
```
* fix naming issues

* better names
```
a051d892

[TFBart] Split TF-Bart (#9497) · 7f286132

Patrick von Platen authored Jan 12, 2021

* make templates ready

* make add_new_model_command_ready

* finish tf bart

* prepare tf mbart

* finish tf bart

* add tf mbart

* add marian

* prep pegasus

* add tf pegasus

* push blenderbot tf

* add blenderbot

* add blenderbot small

* clean-up

* make fix copy

* define blend bot tok

* fix

* up

* make style

* add to docs

* add copy statements

* overwrite changes

* improve

* fix docs

* finish

* fix last slow test

* fix missing git conflict line

* fix blenderbot

* up

* fix blenderbot small

* load changes

* finish copied from

* upload fix

7f286132

11 Jan, 2021 6 commits

[make docs] parallel build (#9522) · 0ecbb698

Stas Bekman authored Jan 11, 2021

After experimenting with different number of workers https://github.com/huggingface/transformers/issues/9496#issuecomment-758145868 4-5 workers seems to be the most optimal - let's go with 4 as surely we wouldn't find a cpu with less cores these days.

Fixes part of https://github.com/huggingface/transformers/issues/9496

@sgugger

0ecbb698

[trainer] round numbers in trainer state (#9491) · e6f211ca
Stas Bekman authored Jan 11, 2021
```
* round numbers

* style

* round only on logging
```
e6f211ca
Make doc styler behave properly on Windows (#9516) · 01a16840
Sylvain Gugger authored Jan 11, 2021

01a16840
Add link to forums thread · 6009668c
Sylvain Gugger authored Jan 11, 2021

6009668c
Fix cardinality (#9505) · ba702966
Julien Plu authored Jan 11, 2021

ba702966

[trainer] remove `--model_parallel` (#9451) · 33b74228

Stas Bekman authored Jan 11, 2021



* fix bad merge - dropped code

* remove --model_parallel

* Deal with TrainingArguments

* Use a private attr and fix batch sizes

* fix _n_gpu

* add is_parallel helper wrapper

* fix attribute

* introduce a new attribute is_model_parallel

* docs

* docs

* Put back init False and rearrange doc

* Ignore non-init args in HFArgumentParser
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

33b74228