Commits · 6d3b688b04705f9faffefe8d7384db7e2b0bb3dc · chenpangpang / transformers

15 Jan, 2021 4 commits

Ignore lm_head decoder bias warning (#9615) · 6d3b688b

Lysandre Debut authored Jan 15, 2021

* Ignore lm_head decoder bias warning

* Revert "Ignore lm_head decoder bias warning"

This reverts commit f25177a9da6ca898e351f46c8b1515971de5c670.

* predictions -> lm_head

6d3b688b

Remove unused token_type_ids in MPNet (#9564) · 8eba1f8c

Julien Plu authored Jan 15, 2021



* Add warning

* Remove unused import

* Fix missing call

* Fix missing call

* Completely remove token_type_ids

* Apply style

* Remove unused import

* Update src/transformers/models/mpnet/modeling_tf_mpnet.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

8eba1f8c

[TF Led] Fix wrong decoder attention mask behavior (#9601) · 90ca8d36
Patrick von Platen authored Jan 15, 2021
```
* fix tf led

* remove loop file
```
90ca8d36
Revert "Gradient accumulation for TFTrainer (#9585)" · 85788bae
Kiyoung Kim authored Jan 15, 2021
```
This reverts commit 3f40070c.
```
85788bae

14 Jan, 2021 11 commits

[deepspeed doc] install issues + 1-gpu deployment (#9582) · 82498cbc

Stas Bekman authored Jan 14, 2021



* [doc] install + 1-gpu deployment

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* improvements
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

82498cbc

Upstream (and rename) sortish sampler (#9574) · 329fe274

Sylvain Gugger authored Jan 14, 2021



* Upstream (and rename) sortish sampler

* Use proper sampler

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

329fe274

Gradient accumulation for TFTrainer (#9585) · 3f40070c

Kiyoung Kim authored Jan 15, 2021



* gradient accumulation for tftrainer

* label naming
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* label naming
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3f40070c

v4.2.1 in docs · e43f3b61
Lysandre authored Jan 14, 2021

e43f3b61
BatchEncoding.to with device with tests (#9584) · 280db79a
Lysandre Debut authored Jan 14, 2021

280db79a

Fix conda build (#9589) · 8bf27075

Lysandre Debut authored Jan 14, 2021

* conda build -> conda-build

* Syntax error

* conda build -> conda-build + 4.2.0

* Prepare to merge in `master`

8bf27075

[setup.py] note on how to get to transformers exact dependencies from shell (#9553) · c99751dd

Stas Bekman authored Jan 14, 2021



* note on how to get to deps from shell

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* fix text
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c99751dd

Make logs tf compliant (#9565) · a26536f0
Julien Plu authored Jan 14, 2021

a26536f0
Compliancy with tf-nightly (#9570) · 14d677ca
Julien Plu authored Jan 14, 2021
```
* Compliancy with tf-nightly

* Add more version + restore min version check
```
14d677ca

Switch metrics in run_ner to datasets (#9567) · 46ed56cf

Sylvain Gugger authored Jan 14, 2021

* Switch metrics in run_ner to datasets

* Add flag to return all metrics

* Upstream (and rename) sortish_sampler

* Revert "Upstream (and rename) sortish_sampler"

This reverts commit e07d0dcf650c2bae36da011dd76c77a8bb4feb0d.

46ed56cf

Fix Trainer with a parallel model (#9578) · 5e1bea4f
Sylvain Gugger authored Jan 14, 2021
```
* Fix Trainer with a parallel model

* More clean up
```
5e1bea4f

13 Jan, 2021 14 commits

Update README.md · 126fd281
Patrick von Platen authored Jan 13, 2021

126fd281
v4.3.0.dev0 · e63cad79
Lysandre authored Jan 13, 2021

e63cad79
v4.2.0 documentation · 33a8497d
Lysandre authored Jan 13, 2021

33a8497d
Release: v4.2.0 · 7d9a9d0c
Lysandre authored Jan 13, 2021

7d9a9d0c

Fix slow tests v4.2.0 (#9561) · c9495166

Lysandre Debut authored Jan 13, 2021

* Fix conversational pipeline test

* LayoutLM

* ProphetNet

* BART

* Blenderbot & small

* Marian

* mBART

* Pegasus

* Tapas tokenizer

* BERT2BERT test

* Style

* Example requirements

* TF BERT2BERT test

c9495166

Fix data parallelism in Trainer (#9566) · 04dc65e5

Sylvain Gugger authored Jan 13, 2021



* Fix data parallelism in Trainer

* Update src/transformers/training_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

04dc65e5

use correct deps for torchhub (#9552) · b2dfcc56
Stas Bekman authored Jan 13, 2021

b2dfcc56

Update run_glue for do_predict with local test data (#9442) (#9486) · eabad8fd

Yusuke Mori authored Jan 13, 2021

* Update run_glue for do_predict with local test data (#9442)

* Update run_glue (#9442): fix comments ('files' to 'a file')

* Update run_glue (#9442): reflect the code review

* Update run_glue (#9442): auto format

* Update run_glue (#9442): reflect the code review

eabad8fd

Speed up TopKLogitsWarper and TopPLogitsWarper (pytorch) (#9557) · 0c9f01a8
LSinev authored Jan 13, 2021
```
* make TopKLogitsWarper faster

* make TopPLogitsWarper faster
```
0c9f01a8
Fix classification script: enable dynamic padding with truncation (#9554) · 27d0e01d
Pavel Tarashkevich authored Jan 13, 2021
```
Co-authored-by: Pavel Tarashkevich <Pavel.Tarashkievich@orange.com>
```
27d0e01d
Fix barthez tokenizer (#9562) · 245cdb46
Lysandre Debut authored Jan 13, 2021

245cdb46

Doc: Update pretrained_models wording (#9545) · 247a7b20

Julien Chaumond authored Jan 13, 2021

* Update pretrained_models.rst

To clarify things cf. this tweet for instance https://twitter.com/RTomMcCoy/status/1349094111505211395

* format

247a7b20

fix BlenderbotSmallTokenizer (#9538) · 69ed3606
Suraj Patil authored Jan 13, 2021
```
* add model_input_names

* fix test
```
69ed3606

[trainer] deepspeed integration (#9211) · 2df34f4a

Stas Bekman authored Jan 12, 2021



* deepspeed integration

* style

* add test

* ds wants to do its own backward

* fp16 assert

* Update src/transformers/training_args.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* for clarity extract what args are being passed to deepspeed

* introduce the concept of self.wrapped_model

* s/self.wrapped_model/self.model_wrapped/

* complete transition to self.wrapped_model / self.model

* fix

* doc

* give ds its own init

* add custom overrides, handle bs correctly

* fix test

* clean up model_init logic, fix small bug

* complete fix

* collapse --deepspeed_config into --deepspeed

* style

* start adding doc notes

* style

* implement hf2ds optimizer and scheduler configuration remapping

* oops

* call get_num_training_steps absolutely when needed

* workaround broken auto-formatter

* deepspeed_config arg is no longer needed - fixed in deepspeed master

* use hf's fp16 args in config

* clean

* start on the docs

* rebase cleanup

* finish up --fp16

* clarify the supported stages

* big refactor thanks to discovering deepspeed.init_distributed

* cleanup

* revert fp16 part

* add checkpoint-support

* more init ds into integrations

* extend docs

* cleanup

* unfix docs

* clean up old code

* imports

* move docs

* fix logic

* make it clear which file it's referring to

* document nodes/gpus

* style

* wrong format

* style

* deepspeed handles gradient clipping

* easier to read

* major doc rewrite

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* docs

* switch to AdamW optimizer

* style

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* clarify doc
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2df34f4a

12 Jan, 2021 11 commits
- Use the right version of tokenizers (#9550) · 5f672103
  Sylvain Gugger authored Jan 12, 2021
```
* Use the right version of tokenizers

* Try another way

* Try another way

* Deps are installed from there...

* Deps are installed from there...

* Revert last

* remove needless comment
```
  5f672103
- Refactor `prepare_seq2seq_batch` (#9524) · 063d8d27
  Sylvain Gugger authored Jan 12, 2021
```
* Add target contextmanager and rework prepare_seq2seq_batch

* Fix tests, treat BART and Barthez

* Add last tokenizers

* Fix test

* Set src token before calling the superclass

* Remove special behavior for T5

* Remove needless imports

* Remove needless asserts
```
  063d8d27
- Revert, it was not the issue. · e6ecef71
  Sylvain Gugger authored Jan 12, 2021
  
  e6ecef71
- Fix tokenizers install for now · 250f27f2
  Sylvain Gugger authored Jan 12, 2021
  
  250f27f2
- topk -> top_k (#9541) · dfbf0f55
  Lysandre Debut authored Jan 12, 2021
  
  dfbf0f55
- LayoutLM Config (#9539) · a1100fac
  Lysandre Debut authored Jan 12, 2021
  
  a1100fac
- Improve LayoutLM (#9476) · e45eba3b
  NielsRogge authored Jan 12, 2021
```
* Add LayoutLMForSequenceClassification and integration tests

Improve docs

Add LayoutLM notebook to list of community notebooks

* Make style & quality

* Address comments by @sgugger, @patrickvonplaten and @LysandreJik

* Fix rebase with master

* Reformat in one line

* Improve code examples as requested by @patrickvonplaten
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
```
  e45eba3b
- [T5] enable T5 fp16 (#9487) · ccd1923f
  Suraj Patil authored Jan 12, 2021
```
* fix t5 fp16
```
  ccd1923f
- fix blenderbot tok (#9532) · 2aa9c2f2
  Patrick von Platen authored Jan 12, 2021
  
  2aa9c2f2
- Shouldn't stale issues/PRs with feature request label (#9511) · 406cbf58
  Lysandre Debut authored Jan 12, 2021
  
  406cbf58
- Update 'Develop on Windows' guidelines (#9519) · 3b67c5ab
  Simon Brandeis authored Jan 12, 2021
  
  3b67c5ab