Commits · 9a5ef83748d656e65013567f6feedb201d44258b · chenpangpang / transformers

09 Aug, 2020 1 commit
- [s2s] fix --gpus clarg collision (#6358) · 9a5ef837
  Sam Shleifer authored Aug 08, 2020
  
  9a5ef837
08 Aug, 2020 3 commits
- [s2s] fix label_smoothed_nll_loss (#6344) · 9bed3554
  Suraj Patil authored Aug 08, 2020
  
  9bed3554
- [s2s] tiny QOL improvement: run_eval prints scores (#6341) · 99f73bcc
  Sam Shleifer authored Aug 08, 2020
  
  99f73bcc
- remove a TODO item to use a tiny model (#6338) · 322dffc6
  Stas Bekman authored Aug 07, 2020
```
as discussed with @sshleifer, removing this TODO to switch to a tiny model, since it won't be able to test the results of the evaluation (i.e. the results are meaningless).
```
  322dffc6
07 Aug, 2020 3 commits
- Add setup for TPU CI to run every hour. (#6219) · 1b8a7ffc
  zcain117 authored Aug 07, 2020
```
* Add setup for TPU CI to run every hour.

* Re-organize config.yml
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  1b8a7ffc
- [examples] consistently use --gpus, instead of --n_gpu (#6315) · 6695450a
  Stas Bekman authored Aug 07, 2020
  
  6695450a
- fix the shuffle agrument usage and the default (#6307) · 175cd45e
  Stas Bekman authored Aug 06, 2020
  
  175cd45e
06 Aug, 2020 4 commits

[Fix] text-classification PL example (#6027) · ffceef20
Bhashithe Abeysinghe authored Aug 06, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
ffceef20
Remove redundant line in run_pl_glue.py (#6305) · eb2bd8d6
xujiaze13 authored Aug 06, 2020

eb2bd8d6
[s2s]Use prepare_translation_batch for Marian finetuning (#6293) · 2804fff8
Sam Shleifer authored Aug 06, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
2804fff8

Adds comet_ml to the list of auto-experiment loggers (#6176) · b923871b

Doug Blank authored Aug 06, 2020



* Support for Comet.ml

* Need to import comet first

* Log this model, not the one in the backprop step

* Log args as hyperparameters; use framework to allow fine control

* Log hyperparameters with context

* Apply black formatting

* isort fix integrations

* isort fix __init__

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address review comments

* Style + Quality, remove Tensorboard import test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

b923871b

05 Aug, 2020 1 commit

[WIP] lightning_base: support --lr_scheduler with multiple possibilities (#6232) · 376c02e9

Stas Bekman authored Aug 05, 2020

* support --lr_scheduler with multiple possibilities

* correct the error message

* add a note about supported schedulers

* cleanup

* cleanup2

* needs the argument default

* style

* add another assert in the test

* implement requested changes

* cleanups

* fix relative import

* cleanup

376c02e9

03 Aug, 2020 3 commits
- [s2s] Document better mbart finetuning command (#6229) · 57eb1cb6
  Sam Shleifer authored Aug 03, 2020
```
* Document better MT command

* improve multigpu command
```
  57eb1cb6
- correct label extraction + add note on discrepancies on trained MNLI model and HANS (#6221) · 0513f8d2
  Victor SANH authored Aug 03, 2020
  
  0513f8d2
- s2s: fix LR logging, remove some dead code. (#6205) · b6b2f227
  Sam Shleifer authored Aug 03, 2020
  
  b6b2f227
01 Aug, 2020 1 commit
- [s2s] clean up + doc (#6184) · d8dbf3b7
  Stas Bekman authored Aug 01, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  d8dbf3b7
31 Jul, 2020 1 commit

enable easy checkout switch (#5645) · f250beb8

Stas Bekman authored Jul 31, 2020

* enable easy checkout switch

allow having multiple repository checkouts and not needing to remember to rerun 'pip install -e .[dev]' when switching between checkouts and running tests.

* make isort happy

* examples needs one too

f250beb8

30 Jul, 2020 2 commits

Switch from return_tuple to return_dict (#6138) · 91cb9546

Sylvain Gugger authored Jul 30, 2020



* Switch from return_tuple to return_dict

* Fix test

* [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614)

* Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests

* AutoModels


Tiny tweaks

* Style

* Final changes before merge

* Re-order for simpler review

* Final fixes

* Addressing @sgugger's comments

* Test MultipleChoice

* Rework TF trainer (#6038)

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

* Switch from return_tuple to return_dict

* Fix test

* Add recent model
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Plu <plu.julien@gmail.com>

91cb9546

[s2s] add support for overriding config params (#6149) · 3212b885
Stas Bekman authored Jul 29, 2020

3212b885

29 Jul, 2020 2 commits

Rework TF trainer (#6038) · 54f9fbef

Julien Plu authored Jul 29, 2020

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

54f9fbef

XLNet PLM Readme (#6121) · 641b873c
Lysandre Debut authored Jul 29, 2020

641b873c

28 Jul, 2020 5 commits
- Fix deebert tests (#6102) · 92f8ce2e
  Sam Shleifer authored Jul 28, 2020
  
  92f8ce2e
- [s2s] Delete useless method, log tokens_per_batch (#6081) · dafa296c
  Sam Shleifer authored Jul 28, 2020
  
  dafa296c
- link to README.md (#6068) · f0c70085
  Stas Bekman authored Jul 28, 2020
```
* add a link to README.md

* Update README.md
```
  f0c70085
- MBART: support summarization tasks where max_src_len > max_tgt_len (#6003) · 3c7fbf35
  Sam Shleifer authored Jul 28, 2020
```
* MBART: support summarization tasks

* fix test

* Style

* add tokenizer test
```
  3c7fbf35
- [s2s] Don't mention packed data in README (#6079) · 7a68d401
  Sam Shleifer authored Jul 27, 2020
  
  7a68d401
27 Jul, 2020 4 commits
- [s2s] dont document packing because it hurts performance (#6077) · 1e00ef68
  Sam Shleifer authored Jul 27, 2020
  
  1e00ef68
- CL util to convert models to fp16 before upload (#5953) · 11792d78
  Sam Shleifer authored Jul 27, 2020
  
  11792d78
- [pack_dataset] don't sort before packing, only pack train (#5954) · 4302ace5
  Sam Shleifer authored Jul 27, 2020
  
  4302ace5
- [examples (seq2seq)] fix preparing decoder_input_ids for T5 (#5994) · d1d15d6f
  Suraj Patil authored Jul 27, 2020
  
  d1d15d6f
24 Jul, 2020 1 commit
- [CI] Don't test apex (#6021) · c69ea5ef
  Sam Shleifer authored Jul 24, 2020
  
  c69ea5ef
22 Jul, 2020 2 commits
- [test] partial coverage for train_mbart_enro_cc25.sh (#5976) · c3206eef
  Sam Shleifer authored Jul 22, 2020
  
  c3206eef
- [docs] Add integration test example to copy pasta template (#5961) · feeb956a
  Sam Shleifer authored Jul 22, 2020
```
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  feeb956a
21 Jul, 2020 4 commits
- seq2seq/run_eval.py can take decoder_start_token_id (#5949) · 9dab39fe
  Sam Shleifer authored Jul 21, 2020
  
  9dab39fe
- [examples/seq2seq]: add --label_smoothing option (#5919) · 5b193b39
  Sam Shleifer authored Jul 21, 2020
  
  5b193b39
- [Doc] explaining romanian postprocessing for MBART BLEU hacking (#5943) · 95d1962b
  Sam Shleifer authored Jul 21, 2020
  
  95d1962b
- typos in seq2seq/readme (#5937) · ccbf74a6
  Aditya Soni authored Jul 21, 2020
  
  ccbf74a6
20 Jul, 2020 3 commits

DataParallel fix: multi gpu evaluation (#5926) · 8e0bcb56

Qingqing Cao authored Jul 20, 2020

The DataParallel training was fixed in https://github.com/huggingface/transformers/pull/5733, this commit also fixes the evaluation. It's more convenient when the user enables both `do_train` and `do_eval`.

8e0bcb56

[Fix] seq2seq pack_dataset.py actually packs (#5913) · f1a4e06f
Sam Shleifer authored Jul 20, 2020
```
Huge MT speedup!
```
f1a4e06f

DataParallel fixes (#5733) · 35cb101e

Stas Bekman authored Jul 20, 2020

* DataParallel fixes:

1. switched to a more precise check
-        if self.args.n_gpu > 1:
+        if isinstance(model, nn.DataParallel):

2. fix tests - require the same fixup under DataParallel as the training module

* another fix

35cb101e