Commits · f6e53e3c2bafb37c861db71a4b28c304403af92b · chenpangpang / transformers

19 Feb, 2021 14 commits
- Fix example links in the task summary (#10291) · f6e53e3c
  Sylvain Gugger authored Feb 19, 2021
  
  f6e53e3c
- Move the TF NER example (#10276) · 536aee99
  Julien Plu authored Feb 19, 2021
  
  536aee99
- Zero shot distillation script cuda patch (#10284) · cbadb524
  Joe Davison authored Feb 19, 2021
  
  cbadb524
- Kill any run-away pytest processes (#10281) · f1299f50
  Stas Bekman authored Feb 19, 2021
  
  f1299f50
- Introduce logging_strategy training argument (#10267) (#10267) · 709c86b5
  Tanmay Garg authored Feb 19, 2021
```
Introduce logging_strategy training argument
in TrainingArguments and TFTrainingArguments. (#9838)
```
  709c86b5
- Making TF OpenAI GPT model compliant with AMP and XLA (#10261) · 34df26ec
  Julien Plu authored Feb 19, 2021
```
* Fix AMP and XLA

* Remove useless var
```
  34df26ec
- Making TF TransfoXL model compliant with AMP (#10264) · 3e116ed3
  Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Apply style

* Remove unused import
```
  3e116ed3
- Fix XLA and AMP (#10262) · 86caeb76
  Julien Plu authored Feb 19, 2021
  
  86caeb76
- Making TF MPNet model compliant with XLA (#10260) · 3d72d47f
  Julien Plu authored Feb 19, 2021
```
* Fix XLA

* Rework cast

* Apply style
```
  3d72d47f
- Making TF MobileBert model compliant with AMP (#10259) · fb56bf25
  Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Trigger CI

* Rework cast
```
  fb56bf25
- Making TF Lxmert model compliant with AMP (#10257) · 2fc6284f
  Julien Plu authored Feb 19, 2021
```
* Fix AMP

* Rework cast

* Apply style
```
  2fc6284f
- [ISSUES.md] propose using google colab to reproduce problems (#10270) · d27b28d9
  Stas Bekman authored Feb 18, 2021
```
* propose using google colab to reproduce problems

* Update ISSUES.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  d27b28d9
- [trainer] implement support for full fp16 in evaluation/predict (#10268) · 4eddc459
  Stas Bekman authored Feb 18, 2021
```
* implement --fp16_full_eval

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style

* add test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  4eddc459
- fix func signature (#10271) · d9a81fc0
  Stas Bekman authored Feb 18, 2021
  
  d9a81fc0
18 Feb, 2021 6 commits

Script for distilling zero-shot classifier to more efficient student (#10244) · c6fe1755

Joe Davison authored Feb 18, 2021



* add zero-shot distillation script

* readme wordsmithing

* clean up code

* add multi-gpu teacher inference
plus tidying up more code

* add use_fast_tokenizer arg

* update results in readme

* more readme wordsmithing

* style

* Add handle to readme
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* fix code block

* add error+docs about distributed & tpu

* add @sgugger format requests

* xla -> tpu

* support fp16 for teacher preds

* no checkpoint by default

* add demo colab link

* add model sharing prompt + model link

* correct resulting acc of example
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

c6fe1755

[Trainer] memory tracker metrics (#10225) · 97e688bc

Stas Bekman authored Feb 18, 2021



* memory tracker metrics

* go back to eval for somewhat consistency

* handle no-gpu case

* deal with stackable eval calls

* restore callback order

* style

* simplify the API

* add test

* docs

* consistently use eval_ prefix

* improve docs

* Update src/transformers/trainer_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* rename method

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

97e688bc

Introduce warmup_ratio training argument (#10229) · d7f38c5d

Tanmay Garg authored Feb 18, 2021

Introduce warmup_ratio training argument in both
TrainingArguments and TFTrainingArguments classes (#6673)

d7f38c5d

Reduce the time spent for the TF slow tests (#10152) · 2acae50a
Julien Plu authored Feb 18, 2021
```
* rework savedmodel slow test

* Improve savedmodel tests

* Remove useless content
```
2acae50a
Fix AMP (#10216) · 14ed3b97
Julien Plu authored Feb 18, 2021

14ed3b97
Making TF GPT2 compliant with XLA and AMP (#10230) · bdf1669e
Julien Plu authored Feb 18, 2021
```
* Fix XLA and AMP

* Fix AMP and XLA

* Apply style

* Apply Patrick's comment
```
bdf1669e

17 Feb, 2021 8 commits
- update to new script; notebook notes (#10241) · 5da7c78e
  Stas Bekman authored Feb 17, 2021
  
  5da7c78e
- [trainer] refactor place_model_on_device logic, add deepspeed (#10243) · dee876ce
  Stas Bekman authored Feb 17, 2021
```
* refactor place_model_on_device logic, add deepspeed

* doc

* style
```
  dee876ce
- [CI] 2 fixes (#10248) · d1eb88f4
  Stas Bekman authored Feb 17, 2021
```
* fix invalid port

* missing requirements
```
  d1eb88f4
- Make TF CTRL compliant with XLA and AMP (#10209) · 7246785a
  Julien Plu authored Feb 17, 2021
```
* Fix XLA and AMP

* Apply style

* Remove useless cast
```
  7246785a
- Making TF XLM-like models XLA and AMP compliant (#10211) · fdb2351e
  Julien Plu authored Feb 17, 2021
```
* Fix Flaubert and XLM

* Remove useless cast

* Tiny fix

* Tiny fix
```
  fdb2351e
- Making TF BART-like models XLA and AMP compliant (#10191) · 83d803ba
  Julien Plu authored Feb 17, 2021
```
* Update BART

* Update Blenderbot

* Update BlenderbotSmall

* Update Marian

* Update MBart

* Update MBart

* Update Pegasus

* Update template

* Fix Marian and Pegasus

* Apply style

* Default initializer

* Default initializer

* Default initializer

* Remove int32 casts

* Fix template

* Remove more cast
```
  83d803ba
- Fix head masking for TFT5 (#9877) · 8d79e5ca
  Daniel Stancl authored Feb 17, 2021
```
* Fix head_mask and decoder_head_mask in TFT5 models

* Enable test_headmasking both fot TFT5 tester
and TFT5EncoderOnly tester
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
```
  8d79e5ca
- Factor out methods (#10215) · 4b919657
  Lysandre Debut authored Feb 17, 2021
  
  4b919657
16 Feb, 2021 5 commits
- [trainer] fix ignored columns logger (#10219) · e94d63f6
  Stas Bekman authored Feb 16, 2021
```
* [trainer] fix ignored columns logger

This PR fixes a confusing log entry that says:
```
  The following columns in the evaluation set don't have a corresponding argument in `T5ForConditionalGeneration.forward` and have been ignored: .
```
when everything is in order.

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  e94d63f6
- fix add_token_positions fn (#10217) · 4210cd96
  Joe Davison authored Feb 16, 2021
  
  4210cd96
- Store FLOS as floats to avoid overflow. (#10213) · 7169d1ea
  Sylvain Gugger authored Feb 16, 2021
  
  7169d1ea
- set tgt_lang of MBart Tokenizer for summarization (#10205) · df1b0fb5
  Zhang Cheng authored Feb 16, 2021
  
  df1b0fb5
- Unlock XLA test for convbert (#10207) · 5c2d66a2
  Julien Plu authored Feb 16, 2021
  
  5c2d66a2
15 Feb, 2021 7 commits

[WIP][examples/seq2seq] move old s2s scripts to legacy (#10136) · 1c8c2d9a

Suraj Patil authored Feb 16, 2021



* move old s2s scripts to legacy

* add the tests back

* proper rename

* restore

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1c8c2d9a

make the sub-group of tests run always (#10196) · 96897a35
Stas Bekman authored Feb 15, 2021

96897a35

Specify dataset dtype (#10195) · 8cbd0bd1

Lysandre Debut authored Feb 15, 2021


Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

8cbd0bd1

fix run_seq2seq.py; porting trainer tests to it (#10162) · 0b1f552a

Stas Bekman authored Feb 15, 2021

* fix run_seq2seq.py; porting DeepSpeed tests to it

* unrefactor

* defensive programming

* defensive programming 2

* port the rest of the trainer tests

* style

* a cleaner scripts dir finder

* cleanup

0b1f552a

Add AMP for Albert (#10141) · 31b0560a
Julien Plu authored Feb 15, 2021

31b0560a

Add mBART-50 (#10154) · 6fc940ed

Suraj Patil authored Feb 15, 2021

* add tokenizer for mBART-50

* update tokenizers

* make src_lang and tgt_lang optional

* update tokenizer test

* add setter

* update docs

* update conversion script

* update docs

* update conversion script

* update tokenizer

* update test

* update docs

* doc

* address Sylvain's suggestions

* fix test

* fix formatting

* nits

6fc940ed

Fix TF template (#10189) · 57021887
Julien Plu authored Feb 15, 2021
```
* Fix template

* Update Seq2Seq tests
```
57021887