Commits · cdc48ce92ddf50e7ad871376be651638268b2e9a · chenpangpang / transformers

30 Oct, 2020 1 commit

Sylvain Gugger authored Oct 30, 2020



* Finish the cleanup of the language-modeling examples

* Update main README

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Propagate changes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

cdc48ce9

27 Oct, 2020 3 commits
- Remove header · 1e01db35
  Sylvain Gugger authored Oct 27, 2020
  
  1e01db35
- Fix typo · b715e40c
  Sylvain Gugger authored Oct 27, 2020
  
  b715e40c
- Move installation instructions to the top (#8106) · 41cc5f3f
  Sylvain Gugger authored Oct 27, 2020
  
  41cc5f3f
30 Sep, 2020 1 commit
- [doc] rm Azure buttons as not implemented yet · 0acd1ffa
  Julien Chaumond authored Sep 30, 2020
  
  0acd1ffa
25 Sep, 2020 1 commit
- doc changes (#7385) · 415071b4
  Suraj Patil authored Sep 25, 2020
  
  415071b4
10 Aug, 2020 1 commit
- correct pl link in readme (#6364) · 35eb96de
  Rohit Gupta authored Aug 10, 2020
  
  35eb96de
06 Aug, 2020 1 commit

Adds comet_ml to the list of auto-experiment loggers (#6176) · b923871b

Doug Blank authored Aug 06, 2020



* Support for Comet.ml

* Need to import comet first

* Log this model, not the one in the backprop step

* Log args as hyperparameters; use framework to allow fine control

* Log hyperparameters with context

* Apply black formatting

* isort fix integrations

* isort fix __init__

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/trainer_tf.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Address review comments

* Style + Quality, remove Tensorboard import test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

b923871b

29 Jul, 2020 1 commit

Rework TF trainer (#6038) · 54f9fbef

Julien Plu authored Jul 29, 2020

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

54f9fbef

14 Jul, 2020 1 commit

docs(wandb): explain how to use W&B integration (#5607) · 4d5a8d65

Boris Dayma authored Jul 14, 2020



* docs(wandb): explain how to use W&B integration

fix #5262

* Also mention TensorBoard
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

4d5a8d65

10 Jul, 2020 1 commit

Update The Big Table of Tasks · 201d23f2

Julien Chaumond authored Jul 10, 2020


Co-Authored-By: Suraj Patil <surajp815@gmail.com>
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

201d23f2

01 Jul, 2020 1 commit
- Fix examples titles and optimization doc page (#5408) · 4ade7491
  Sylvain Gugger authored Jul 01, 2020
  
  4ade7491
28 Jun, 2020 1 commit
- [examples] fix example links (#5344) · 12dfbd4f
  Suraj Patil authored Jun 28, 2020
  
  12dfbd4f
25 Jun, 2020 1 commit
- Closes #5218 · 7cc15bdd
  Lysandre Debut authored Jun 25, 2020
  
  7cc15bdd
24 Jun, 2020 1 commit
- Add hugs (#5225) · 7c41057d
  Sylvain Gugger authored Jun 24, 2020
  
  7c41057d
16 Jun, 2020 1 commit
- Convert hans to Trainer (#5025) · d5477baf
  Sylvain Gugger authored Jun 16, 2020
```
* Convert hans to Trainer

* Tick box
```
  d5477baf
05 Jun, 2020 1 commit
- [doc] Make it clearer that `text-generation` does not involve training · b9109f2d
  Julien Chaumond authored Jun 05, 2020
  
  b9109f2d
02 Jun, 2020 1 commit
- Specify PyTorch versions for examples (#4710) · 88762a2f
  Lysandre Debut authored Jun 02, 2020
  
  88762a2f
27 May, 2020 1 commit

per_device instead of per_gpu/error thrown when argument unknown (#4618) · 6a176880

Lysandre Debut authored May 27, 2020



* per_device instead of per_gpu/error thrown when argument unknown

* [docs] Restore examples.md symlink

* Correct absolute links so that symlink to the doc works correctly

* Update src/transformers/hf_argparser.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Warning + reorder

* Docs

* Style

* not for squad
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

6a176880

15 May, 2020 1 commit
- [examples] Streamline doc · af2e6bf8
  Julien Chaumond authored May 14, 2020
  
  af2e6bf8
13 May, 2020 1 commit

Question Answering for TF trainer (#4320) · ca136186

Julien Plu authored May 13, 2020

* Add QA trainer example for TF

* Make data_dir optional

* Fix parameter logic

* Fix feature convert

* Update the READMEs to add the question-answering task

* Apply style

* Change 'sequence-classification' to 'text-classification' and prefix with 'eval' all the metric names

* Apply style

* Apply style

ca136186

12 May, 2020 1 commit

Add MultipleChoice to TFTrainer [WIP] (#4270) · e4512aab

Viktor Alm authored May 12, 2020



* catch gpu len 1 set to gpu0

* Add mpc to trainer

* Add MPC for TF

* fix TF automodel for MPC and add Albert

* Apply style

* Fix import

* Note to self: double check

* Make shape None, None for datasetgenerator output shapes

* Add from_pt bool which doesnt seem to work

* Original checkpoint dir

* Fix docstrings for automodel

* Update readme and apply style

* Colab should probably not be from users

* Colabs should probably not be from users

* Add colab

* Update README.md

* Update README.md

* Cleanup __intit__

* Cleanup flake8 trailing comma

* Update src/transformers/training_args_tf.py

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e4512aab

08 May, 2020 1 commit

[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) · 7b75aa9f

Julien Chaumond authored May 08, 2020

* [TPU] Doc, fix xla_spawn.py, only preprocess dataset once

* Update examples/README.md

* [xla_spawn] Add `_mp_fn` to other Trainer scripts

* [TPU] Fix: eval dataloader was None

7b75aa9f

07 May, 2020 4 commits
- [doc] Fix broken links + remove crazy big notebook · c99fe038
  Julien Chaumond authored May 07, 2020
  
  c99fe038
- [examples] Add column for pytorch-lightning support · 6669915b
  Julien Chaumond authored May 07, 2020
  
  6669915b
- Examples readme.md (#4215) · 612fa1b1
  Julien Chaumond authored May 07, 2020
```
* README

* Update README.md
```
  612fa1b1
- BIG Reorganize examples (#4213) · 0ae96ff8
  Julien Chaumond authored May 07, 2020
```
* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around
```
  0ae96ff8
22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

10 Apr, 2020 2 commits

Add `run_glue_tpu.py` that trains models on TPUs (#3702) · 551b4505

Jin Young Sohn authored Apr 10, 2020

* Initial commit to get BERT + run_glue.py on TPU

* Add README section for TPU and address comments.

* Cleanup TPU bits from run_glue.py (#3)

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* Cleanup TPU bits from run_glue.py

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py

.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* No need to call `xm.mark_step()` explicitly (#4)

Since for gradient accumulation we're accumulating on batches from
`ParallelLoader` instance which on next() marks the step itself.

* Resolve R/W conflicts from multiprocessing (#5)

* Add XLNet in list of models for `run_glue_tpu.py` (#6)

* Add RoBERTa to list of models in TPU GLUE (#7)

* Add RoBERTa and DistilBert to list of models in TPU GLUE (#8)

* Use barriers to reduce duplicate work/resources (#9)

* Shard eval dataset and aggregate eval metrics (#10)

* Shard eval dataset and aggregate eval metrics

Also, instead of calling `eval_loss.item()` every time do summation with
tensors on device.

* Change defaultdict to float

* Reduce the pred, label tensors instead of metrics

As brought up during review some metrics like f1 cannot be aggregated
via averaging. GLUE task metrics depends largely on the dataset, so
instead we sync the prediction and label tensors so that the metrics can
be computed accurately on those instead.

* Only use tb_writer from master (#11)

* Apply huggingface black code formatting

* Style

* Remove `--do_lower_case` as example uses cased

* Add option to specify tensorboard logdir

This is needed for our testing framework which checks regressions
against key metrics writtern by the summary writer.

* Using configuration for `xla_device`

* Prefix TPU specific comments.

* num_cores clarification and namespace eval metrics

* Cache features file under `args.cache_dir`

Instead of under `args.data_dir`. This is needed as our test infra uses
data_dir with a read-only filesystem.

* Rename `run_glue_tpu` to `run_tpu_glue`
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

551b4505

[docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738) · cbad305c
Julien Chaumond authored Apr 10, 2020

cbad305c

09 Mar, 2020 1 commit
- cased -> uncased in BERT SQuAD example · eb3e6cb0
  Lysandre authored Mar 09, 2020
```
closes #3183
```
  eb3e6cb0
25 Feb, 2020 1 commit
- missing ner link (#2967) · 7a7ee28c
  Jhuo IH authored Feb 25, 2020
  
  7a7ee28c
22 Feb, 2020 1 commit
- fix hardcoded path in examples readme · cafc4dfc
  saippuakauppias authored Feb 22, 2020
  
  cafc4dfc
20 Feb, 2020 1 commit

Support for torch-lightning in NER examples (#2890) · b662f0e6

srush authored Feb 20, 2020



* initial pytorch lightning commit

* tested multigpu

* Fix learning rate schedule

* black formatting

* fix flake8

* isort

* isort

* .
Co-authored-by: Check your git settings! <chris@chris-laptop>

b662f0e6

17 Feb, 2020 1 commit
- fix typo in hans example call · 0dbddba6
  VictorSanh authored Feb 17, 2020
  
  0dbddba6
07 Feb, 2020 2 commits
- [examples] rename run_lm_finetuning to run_language_modeling · 42f08e59
  Julien Chaumond authored Feb 06, 2020
  
  42f08e59
- [examples] Fix broken markdown · 4f7bdb09
  Julien Chaumond authored Feb 06, 2020
  
  4f7bdb09
30 Jan, 2020 1 commit
- Correct documentation · 71a38231
  Jared Nielsen authored Jan 30, 2020
  
  71a38231
24 Jan, 2020 1 commit
- update correct eval metrics (distilbert & co) · 1ce3fb5c
  VictorSanh authored Jan 24, 2020
  
  1ce3fb5c
16 Jan, 2020 1 commit
- adding details in readme · 258ed2ea
  thomwolf authored Jan 16, 2020
  
  258ed2ea