Commits · ac5bcf236e471d523c5ae1c68922e37b8da76509 · chenpangpang / transformers

12 Aug, 2020 1 commit
- Disabled pabee test (#6431) · 4ffea5ce
  Lysandre Debut authored Aug 12, 2020
  
  4ffea5ce
11 Aug, 2020 9 commits

[examples] add pytest dependency (#6425) · 3f071c4b
Sam Shleifer authored Aug 11, 2020

3f071c4b

lr_schedulers: add get_polynomial_decay_schedule_with_warmup (#6361) · ece0903e

Stas Bekman authored Aug 11, 2020



* [wip] add get_polynomial_decay_schedule_with_warmup

* style

* add assert

* change lr_end to a much smaller default number

* check for exact equality

* [model_cards] electra-base-turkish-cased-ner (#6350)

* for electra-base-turkish-cased-ner

* Add metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Temporarily de-activate TPU CI

* Update modeling_tf_utils.py (#6372)

fix typo: ckeckpoint->checkpoint

* the test now works again (#6371)

* correct pl link in readme (#6364)

* refactor almost identical tests (#6339)

* refactor almost identical tests

* important to add a clear assert error message

* make the assert error even more descriptive than the original bt

* Small docfile fixes (#6328)

* Patch models (#6326)

* TFAlbertFor{TokenClassification, MultipleChoice}

* Patch models

* BERT and TF BERT info


s

* Update check_repo

* Ci GitHub caching (#6382)

* Cache Github Actions CI

* Remove useless file

* Colab button (#6389)

* Add colab button

* Add colab link for tutorials

* Fix links for open in colab (#6391)

* Update src/transformers/optimization.py

consistently use lr_end=1e-7 default
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* [wip] add get_polynomial_decay_schedule_with_warmup

* style

* add assert

* change lr_end to a much smaller default number

* check for exact equality

* Update src/transformers/optimization.py

consistently use lr_end=1e-7 default
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* remove dup (leftover from merge)

* convert the test into the new refactored format

* stick to using the current_step as is, without ++
Co-authored-by: M. Yusuf Sarıgöz <yusufsarigoz@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Alexander Measure <ameasure@gmail.com>
Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ece0903e

[pl] restore lr logging behavior for glue, ner examples (#6314) · 0203d651
Stas Bekman authored Aug 11, 2020

0203d651
rename prepare_translation_batch -> prepare_seq2seq_batch (#6103) · be1520d3
Sam Shleifer authored Aug 11, 2020

be1520d3
PegasusForConditionalGeneration (torch version) (#6340) · 66fa8cea
Sam Shleifer authored Aug 11, 2020
```
Co-authored-by: Jingqing  Zhang <jingqing.zhang15@imperial.ac.uk>
```
66fa8cea
[s2s] wmt download script use less ram (#6405) · f6cb0f80
Stas Bekman authored Aug 11, 2020

f6cb0f80
pl version: examples/requirements.txt is single source of truth (#6309) · 7c6a085e
Stas Bekman authored Aug 11, 2020

7c6a085e

add pl_glue example test (#6034) · f6c0680d

Stas Bekman authored Aug 11, 2020

* add pl_glue example test

* for now just test that it runs, next validate results of eval or predict?

* complete the run_pl_glue test to validate the actual outcome

* worked on my machine, CI gets less accuracy - trying higher epochs

* match run_pl.sh hparms

* more epochs?

* trying higher lr

* for now just test that the script runs to a completion

* correct the comment

* if cuda is available, add --fp16 --gpus=1 to cover more bases

* style

f6c0680d

[s2s] Script to save wmt data to disk (#6403) · b9ecd92e
Sam Shleifer authored Aug 10, 2020

b9ecd92e

10 Aug, 2020 2 commits
- correct pl link in readme (#6364) · 35eb96de
  Rohit Gupta authored Aug 10, 2020
  
  35eb96de
- the test now works again (#6371) · 0830e795
  Stas Bekman authored Aug 09, 2020
  
  0830e795
09 Aug, 2020 1 commit
- [s2s] fix --gpus clarg collision (#6358) · 9a5ef837
  Sam Shleifer authored Aug 08, 2020
  
  9a5ef837
08 Aug, 2020 3 commits
- [s2s] fix label_smoothed_nll_loss (#6344) · 9bed3554
  Suraj Patil authored Aug 08, 2020
  
  9bed3554
- [s2s] tiny QOL improvement: run_eval prints scores (#6341) · 99f73bcc
  Sam Shleifer authored Aug 08, 2020
  
  99f73bcc
- remove a TODO item to use a tiny model (#6338) · 322dffc6
  Stas Bekman authored Aug 07, 2020
```
as discussed with @sshleifer, removing this TODO to switch to a tiny model, since it won't be able to test the results of the evaluation (i.e. the results are meaningless).
```
  322dffc6
07 Aug, 2020 3 commits
- Add setup for TPU CI to run every hour. (#6219) · 1b8a7ffc
  zcain117 authored Aug 07, 2020
```
* Add setup for TPU CI to run every hour.

* Re-organize config.yml
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  1b8a7ffc
- [examples] consistently use --gpus, instead of --n_gpu (#6315) · 6695450a
  Stas Bekman authored Aug 07, 2020
  
  6695450a
- fix the shuffle agrument usage and the default (#6307) · 175cd45e
  Stas Bekman authored Aug 06, 2020
  
  175cd45e
06 Aug, 2020 4 commits

[Fix] text-classification PL example (#6027) · ffceef20
Bhashithe Abeysinghe authored Aug 06, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
ffceef20
Remove redundant line in run_pl_glue.py (#6305) · eb2bd8d6
xujiaze13 authored Aug 06, 2020

eb2bd8d6
[s2s]Use prepare_translation_batch for Marian finetuning (#6293) · 2804fff8
Sam Shleifer authored Aug 06, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
2804fff8

Adds comet_ml to the list of auto-experiment loggers (#6176) · b923871b

Doug Blank authored Aug 06, 2020



* Support for Comet.ml

* Need to import comet first

* Log this model, not the one in the backprop step

* Log args as hyperparameters; use framework to allow fine control

* Log hyperparameters with context

* Apply black formatting

* isort fix integrations

* isort fix __init__

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address review comments

* Style + Quality, remove Tensorboard import test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

b923871b

05 Aug, 2020 1 commit

[WIP] lightning_base: support --lr_scheduler with multiple possibilities (#6232) · 376c02e9

Stas Bekman authored Aug 05, 2020

* support --lr_scheduler with multiple possibilities

* correct the error message

* add a note about supported schedulers

* cleanup

* cleanup2

* needs the argument default

* style

* add another assert in the test

* implement requested changes

* cleanups

* fix relative import

* cleanup

376c02e9

03 Aug, 2020 3 commits
- [s2s] Document better mbart finetuning command (#6229) · 57eb1cb6
  Sam Shleifer authored Aug 03, 2020
```
* Document better MT command

* improve multigpu command
```
  57eb1cb6
- correct label extraction + add note on discrepancies on trained MNLI model and HANS (#6221) · 0513f8d2
  Victor SANH authored Aug 03, 2020
  
  0513f8d2
- s2s: fix LR logging, remove some dead code. (#6205) · b6b2f227
  Sam Shleifer authored Aug 03, 2020
  
  b6b2f227
01 Aug, 2020 1 commit
- [s2s] clean up + doc (#6184) · d8dbf3b7
  Stas Bekman authored Aug 01, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  d8dbf3b7
31 Jul, 2020 1 commit

enable easy checkout switch (#5645) · f250beb8

Stas Bekman authored Jul 31, 2020

* enable easy checkout switch

allow having multiple repository checkouts and not needing to remember to rerun 'pip install -e .[dev]' when switching between checkouts and running tests.

* make isort happy

* examples needs one too

f250beb8

30 Jul, 2020 2 commits

Switch from return_tuple to return_dict (#6138) · 91cb9546

Sylvain Gugger authored Jul 30, 2020



* Switch from return_tuple to return_dict

* Fix test

* [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614)

* Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests

* AutoModels


Tiny tweaks

* Style

* Final changes before merge

* Re-order for simpler review

* Final fixes

* Addressing @sgugger's comments

* Test MultipleChoice

* Rework TF trainer (#6038)

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

* Switch from return_tuple to return_dict

* Fix test

* Add recent model
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Plu <plu.julien@gmail.com>

91cb9546

[s2s] add support for overriding config params (#6149) · 3212b885
Stas Bekman authored Jul 29, 2020

3212b885

29 Jul, 2020 2 commits

Rework TF trainer (#6038) · 54f9fbef

Julien Plu authored Jul 29, 2020

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

54f9fbef

XLNet PLM Readme (#6121) · 641b873c
Lysandre Debut authored Jul 29, 2020

641b873c

28 Jul, 2020 5 commits
- Fix deebert tests (#6102) · 92f8ce2e
  Sam Shleifer authored Jul 28, 2020
  
  92f8ce2e
- [s2s] Delete useless method, log tokens_per_batch (#6081) · dafa296c
  Sam Shleifer authored Jul 28, 2020
  
  dafa296c
- link to README.md (#6068) · f0c70085
  Stas Bekman authored Jul 28, 2020
```
* add a link to README.md

* Update README.md
```
  f0c70085
- MBART: support summarization tasks where max_src_len > max_tgt_len (#6003) · 3c7fbf35
  Sam Shleifer authored Jul 28, 2020
```
* MBART: support summarization tasks

* fix test

* Style

* add tokenizer test
```
  3c7fbf35
- [s2s] Don't mention packed data in README (#6079) · 7a68d401
  Sam Shleifer authored Jul 27, 2020
  
  7a68d401
27 Jul, 2020 2 commits
- [s2s] dont document packing because it hurts performance (#6077) · 1e00ef68
  Sam Shleifer authored Jul 27, 2020
  
  1e00ef68
- CL util to convert models to fp16 before upload (#5953) · 11792d78
  Sam Shleifer authored Jul 27, 2020
  
  11792d78