Commits · 4d5a8d65576236b8adc1ada86a06ad11da93063f · chenpangpang / transformers

14 Jul, 2020 1 commit

docs(wandb): explain how to use W&B integration (#5607) · 4d5a8d65

Boris Dayma authored Jul 14, 2020



* docs(wandb): explain how to use W&B integration

fix #5262

* Also mention TensorBoard
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

4d5a8d65

10 Jul, 2020 1 commit

Update The Big Table of Tasks · 201d23f2

Julien Chaumond authored Jul 10, 2020


Co-Authored-By: Suraj Patil <surajp815@gmail.com>
Co-Authored-By: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

201d23f2

01 Jul, 2020 1 commit
- Fix examples titles and optimization doc page (#5408) · 4ade7491
  Sylvain Gugger authored Jul 01, 2020
  
  4ade7491
28 Jun, 2020 1 commit
- [examples] fix example links (#5344) · 12dfbd4f
  Suraj Patil authored Jun 28, 2020
  
  12dfbd4f
25 Jun, 2020 1 commit
- Closes #5218 · 7cc15bdd
  Lysandre Debut authored Jun 25, 2020
  
  7cc15bdd
24 Jun, 2020 1 commit
- Add hugs (#5225) · 7c41057d
  Sylvain Gugger authored Jun 24, 2020
  
  7c41057d
16 Jun, 2020 1 commit
- Convert hans to Trainer (#5025) · d5477baf
  Sylvain Gugger authored Jun 16, 2020
```
* Convert hans to Trainer

* Tick box
```
  d5477baf
05 Jun, 2020 1 commit
- [doc] Make it clearer that `text-generation` does not involve training · b9109f2d
  Julien Chaumond authored Jun 05, 2020
  
  b9109f2d
02 Jun, 2020 1 commit
- Specify PyTorch versions for examples (#4710) · 88762a2f
  Lysandre Debut authored Jun 02, 2020
  
  88762a2f
27 May, 2020 1 commit

per_device instead of per_gpu/error thrown when argument unknown (#4618) · 6a176880

Lysandre Debut authored May 27, 2020



* per_device instead of per_gpu/error thrown when argument unknown

* [docs] Restore examples.md symlink

* Correct absolute links so that symlink to the doc works correctly

* Update src/transformers/hf_argparser.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Warning + reorder

* Docs

* Style

* not for squad
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

6a176880

15 May, 2020 1 commit
- [examples] Streamline doc · af2e6bf8
  Julien Chaumond authored May 14, 2020
  
  af2e6bf8
13 May, 2020 1 commit

Question Answering for TF trainer (#4320) · ca136186

Julien Plu authored May 13, 2020

* Add QA trainer example for TF

* Make data_dir optional

* Fix parameter logic

* Fix feature convert

* Update the READMEs to add the question-answering task

* Apply style

* Change 'sequence-classification' to 'text-classification' and prefix with 'eval' all the metric names

* Apply style

* Apply style

ca136186

12 May, 2020 1 commit

Add MultipleChoice to TFTrainer [WIP] (#4270) · e4512aab

Viktor Alm authored May 12, 2020



* catch gpu len 1 set to gpu0

* Add mpc to trainer

* Add MPC for TF

* fix TF automodel for MPC and add Albert

* Apply style

* Fix import

* Note to self: double check

* Make shape None, None for datasetgenerator output shapes

* Add from_pt bool which doesnt seem to work

* Original checkpoint dir

* Fix docstrings for automodel

* Update readme and apply style

* Colab should probably not be from users

* Colabs should probably not be from users

* Add colab

* Update README.md

* Update README.md

* Cleanup __intit__

* Cleanup flake8 trailing comma

* Update src/transformers/training_args_tf.py

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e4512aab

08 May, 2020 1 commit

[TPU] Doc, fix xla_spawn.py, only preprocess dataset once (#4223) · 7b75aa9f

Julien Chaumond authored May 08, 2020

* [TPU] Doc, fix xla_spawn.py, only preprocess dataset once

* Update examples/README.md

* [xla_spawn] Add `_mp_fn` to other Trainer scripts

* [TPU] Fix: eval dataloader was None

7b75aa9f

07 May, 2020 4 commits
- [doc] Fix broken links + remove crazy big notebook · c99fe038
  Julien Chaumond authored May 07, 2020
  
  c99fe038
- [examples] Add column for pytorch-lightning support · 6669915b
  Julien Chaumond authored May 07, 2020
  
  6669915b
- Examples readme.md (#4215) · 612fa1b1
  Julien Chaumond authored May 07, 2020
```
* README

* Update README.md
```
  612fa1b1
- BIG Reorganize examples (#4213) · 0ae96ff8
  Julien Chaumond authored May 07, 2020
```
* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around
```
  0ae96ff8
22 Apr, 2020 1 commit

Trainer (#3800) · dd9d483d

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

10 Apr, 2020 2 commits

Add `run_glue_tpu.py` that trains models on TPUs (#3702) · 551b4505

Jin Young Sohn authored Apr 10, 2020

* Initial commit to get BERT + run_glue.py on TPU

* Add README section for TPU and address comments.

* Cleanup TPU bits from run_glue.py (#3)

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* Cleanup TPU bits from run_glue.py

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py

.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* No need to call `xm.mark_step()` explicitly (#4)

Since for gradient accumulation we're accumulating on batches from
`ParallelLoader` instance which on next() marks the step itself.

* Resolve R/W conflicts from multiprocessing (#5)

* Add XLNet in list of models for `run_glue_tpu.py` (#6)

* Add RoBERTa to list of models in TPU GLUE (#7)

* Add RoBERTa and DistilBert to list of models in TPU GLUE (#8)

* Use barriers to reduce duplicate work/resources (#9)

* Shard eval dataset and aggregate eval metrics (#10)

* Shard eval dataset and aggregate eval metrics

Also, instead of calling `eval_loss.item()` every time do summation with
tensors on device.

* Change defaultdict to float

* Reduce the pred, label tensors instead of metrics

As brought up during review some metrics like f1 cannot be aggregated
via averaging. GLUE task metrics depends largely on the dataset, so
instead we sync the prediction and label tensors so that the metrics can
be computed accurately on those instead.

* Only use tb_writer from master (#11)

* Apply huggingface black code formatting

* Style

* Remove `--do_lower_case` as example uses cased

* Add option to specify tensorboard logdir

This is needed for our testing framework which checks regressions
against key metrics writtern by the summary writer.

* Using configuration for `xla_device`

* Prefix TPU specific comments.

* num_cores clarification and namespace eval metrics

* Cache features file under `args.cache_dir`

Instead of under `args.data_dir`. This is needed as our test infra uses
data_dir with a read-only filesystem.

* Rename `run_glue_tpu` to `run_tpu_glue`
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

551b4505

[docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738) · cbad305c
Julien Chaumond authored Apr 10, 2020

cbad305c

09 Mar, 2020 1 commit
- cased -> uncased in BERT SQuAD example · eb3e6cb0
  Lysandre authored Mar 09, 2020
```
closes #3183
```
  eb3e6cb0
25 Feb, 2020 1 commit
- missing ner link (#2967) · 7a7ee28c
  Jhuo IH authored Feb 25, 2020
  
  7a7ee28c
22 Feb, 2020 1 commit
- fix hardcoded path in examples readme · cafc4dfc
  saippuakauppias authored Feb 22, 2020
  
  cafc4dfc
20 Feb, 2020 1 commit

Support for torch-lightning in NER examples (#2890) · b662f0e6

srush authored Feb 20, 2020



* initial pytorch lightning commit

* tested multigpu

* Fix learning rate schedule

* black formatting

* fix flake8

* isort

* isort

* .
Co-authored-by: Check your git settings! <chris@chris-laptop>

b662f0e6

17 Feb, 2020 1 commit
- fix typo in hans example call · 0dbddba6
  VictorSanh authored Feb 17, 2020
  
  0dbddba6
07 Feb, 2020 2 commits
- [examples] rename run_lm_finetuning to run_language_modeling · 42f08e59
  Julien Chaumond authored Feb 06, 2020
  
  42f08e59
- [examples] Fix broken markdown · 4f7bdb09
  Julien Chaumond authored Feb 06, 2020
  
  4f7bdb09
30 Jan, 2020 1 commit
- Correct documentation · 71a38231
  Jared Nielsen authored Jan 30, 2020
  
  71a38231
24 Jan, 2020 1 commit
- update correct eval metrics (distilbert & co) · 1ce3fb5c
  VictorSanh authored Jan 24, 2020
  
  1ce3fb5c
16 Jan, 2020 3 commits
- adding details in readme · 258ed2ea
  thomwolf authored Jan 16, 2020
  
  258ed2ea
- updating readme · e25b6fe3
  thomwolf authored Jan 06, 2020
  
  e25b6fe3
- adding details in readme - moving file · 27c7b990
  thomwolf authored Jan 06, 2020
  
  27c7b990
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
24 Dec, 2019 1 commit

Remove [--editable] in install instructions. · a8d34e53

Aymeric Augustin authored Dec 24, 2019

Use -e only in docs targeted at contributors.

If a user copy-pastes  command line with [--editable], they will hit
an error. If they don't know the --editable option, we're giving them
a choice to make before they can move forwards, but this isn't a choice
they need to make right now.

a8d34e53

21 Dec, 2019 1 commit
- move example to mm-imdb folder · 344126fe
  thomwolf authored Dec 21, 2019
  
  344126fe
19 Dec, 2019 1 commit
- Updated typo on the link · 284572ef
  Ejar authored Dec 18, 2019
```
Updated documentation due to typo
```
  284572ef
10 Dec, 2019 2 commits
- remove misplaced summarization documentation · 4b82c485
  Rémi Louf authored Dec 10, 2019
  
  4b82c485
- Add MMBT Model to Transformers Repo · df396112
  Suvrat Bhooshan authored Dec 09, 2019
  
  df396112