Commits · c99fe0386be118bceaab1c85cdb8309eb8cb8208 · chenpangpang / transformers

07 May, 2020 4 commits
- [doc] Fix broken links + remove crazy big notebook · c99fe038
  Julien Chaumond authored May 07, 2020
  
  c99fe038
- [examples] Add column for pytorch-lightning support · 6669915b
  Julien Chaumond authored May 07, 2020
  
  6669915b
- Examples readme.md (#4215) · 612fa1b1
  Julien Chaumond authored May 07, 2020
```
* README

* Update README.md
```
  612fa1b1
- BIG Reorganize examples (#4213) · 0ae96ff8
  Julien Chaumond authored May 07, 2020
```
* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around
```
  0ae96ff8
22 Apr, 2020 1 commit

Julien Chaumond authored Apr 21, 2020

* doc

* [tests] Add sample files for a regression task

* [HUGE] Trainer

* Feedback from @sshleifer

* Feedback from @thomwolf + logging tweak

* [file_utils] when downloading concurrently, get_from_cache will use the cached file for subsequent processes

* [glue] Use default max_seq_length of 128 like before

* [glue] move DataTrainingArguments around

* [ner] Change interface of InputExample, and align run_{tf,pl}

* Re-align the pl scripts a little bit

* ner

* [ner] Add integration test

* Fix language_modeling with API tweak

* [ci] Tweak loss target

* Don't break console output

* amp.initialize: model must be on right device before

* [multiple-choice] update for Trainer

* Re-align to 827d6d6e

dd9d483d

10 Apr, 2020 2 commits

Add `run_glue_tpu.py` that trains models on TPUs (#3702) · 551b4505

Jin Young Sohn authored Apr 10, 2020

* Initial commit to get BERT + run_glue.py on TPU

* Add README section for TPU and address comments.

* Cleanup TPU bits from run_glue.py (#3)

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* Cleanup TPU bits from run_glue.py

TPU runner is currently implemented in:
https://github.com/pytorch-tpu/transformers/blob/tpu/examples/run_glue_tpu.py

.

We plan to upstream this directly into `huggingface/transformers`
(either `master` or `tpu`) branch once it's been more thoroughly tested.

* No need to call `xm.mark_step()` explicitly (#4)

Since for gradient accumulation we're accumulating on batches from
`ParallelLoader` instance which on next() marks the step itself.

* Resolve R/W conflicts from multiprocessing (#5)

* Add XLNet in list of models for `run_glue_tpu.py` (#6)

* Add RoBERTa to list of models in TPU GLUE (#7)

* Add RoBERTa and DistilBert to list of models in TPU GLUE (#8)

* Use barriers to reduce duplicate work/resources (#9)

* Shard eval dataset and aggregate eval metrics (#10)

* Shard eval dataset and aggregate eval metrics

Also, instead of calling `eval_loss.item()` every time do summation with
tensors on device.

* Change defaultdict to float

* Reduce the pred, label tensors instead of metrics

As brought up during review some metrics like f1 cannot be aggregated
via averaging. GLUE task metrics depends largely on the dataset, so
instead we sync the prediction and label tensors so that the metrics can
be computed accurately on those instead.

* Only use tb_writer from master (#11)

* Apply huggingface black code formatting

* Style

* Remove `--do_lower_case` as example uses cased

* Add option to specify tensorboard logdir

This is needed for our testing framework which checks regressions
against key metrics writtern by the summary writer.

* Using configuration for `xla_device`

* Prefix TPU specific comments.

* num_cores clarification and namespace eval metrics

* Cache features file under `args.cache_dir`

Instead of under `args.data_dir`. This is needed as our test infra uses
data_dir with a read-only filesystem.

* Rename `run_glue_tpu` to `run_tpu_glue`
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

551b4505

[docs] The use of `do_lower_case` in scripts is on its way to deprecation (#3738) · cbad305c
Julien Chaumond authored Apr 10, 2020

cbad305c

09 Mar, 2020 1 commit
- cased -> uncased in BERT SQuAD example · eb3e6cb0
  Lysandre authored Mar 09, 2020
```
closes #3183
```
  eb3e6cb0
25 Feb, 2020 1 commit
- missing ner link (#2967) · 7a7ee28c
  Jhuo IH authored Feb 25, 2020
  
  7a7ee28c
22 Feb, 2020 1 commit
- fix hardcoded path in examples readme · cafc4dfc
  saippuakauppias authored Feb 22, 2020
  
  cafc4dfc
20 Feb, 2020 1 commit

Support for torch-lightning in NER examples (#2890) · b662f0e6

srush authored Feb 20, 2020



* initial pytorch lightning commit

* tested multigpu

* Fix learning rate schedule

* black formatting

* fix flake8

* isort

* isort

* .
Co-authored-by: Check your git settings! <chris@chris-laptop>

b662f0e6

17 Feb, 2020 1 commit
- fix typo in hans example call · 0dbddba6
  VictorSanh authored Feb 17, 2020
  
  0dbddba6
07 Feb, 2020 2 commits
- [examples] rename run_lm_finetuning to run_language_modeling · 42f08e59
  Julien Chaumond authored Feb 06, 2020
  
  42f08e59
- [examples] Fix broken markdown · 4f7bdb09
  Julien Chaumond authored Feb 06, 2020
  
  4f7bdb09
30 Jan, 2020 1 commit
- Correct documentation · 71a38231
  Jared Nielsen authored Jan 30, 2020
  
  71a38231
24 Jan, 2020 1 commit
- update correct eval metrics (distilbert & co) · 1ce3fb5c
  VictorSanh authored Jan 24, 2020
  
  1ce3fb5c
16 Jan, 2020 3 commits
- adding details in readme · 258ed2ea
  thomwolf authored Jan 16, 2020
  
  258ed2ea
- updating readme · e25b6fe3
  thomwolf authored Jan 06, 2020
  
  e25b6fe3
- adding details in readme - moving file · 27c7b990
  thomwolf authored Jan 06, 2020
  
  27c7b990
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
24 Dec, 2019 1 commit

Remove [--editable] in install instructions. · a8d34e53

Aymeric Augustin authored Dec 24, 2019

Use -e only in docs targeted at contributors.

If a user copy-pastes  command line with [--editable], they will hit
an error. If they don't know the --editable option, we're giving them
a choice to make before they can move forwards, but this isn't a choice
they need to make right now.

a8d34e53

21 Dec, 2019 1 commit
- move example to mm-imdb folder · 344126fe
  thomwolf authored Dec 21, 2019
  
  344126fe
19 Dec, 2019 1 commit
- Updated typo on the link · 284572ef
  Ejar authored Dec 18, 2019
```
Updated documentation due to typo
```
  284572ef
10 Dec, 2019 4 commits
- remove misplaced summarization documentation · 4b82c485
  Rémi Louf authored Dec 10, 2019
  
  4b82c485
- Add MMBT Model to Transformers Repo · df396112
  Suvrat Bhooshan authored Dec 09, 2019
  
  df396112
- add README · c0707a85
  Rémi Louf authored Dec 06, 2019
  
  c0707a85
- update the docs · 693606a7
  Rémi Louf authored Dec 05, 2019
  
  693606a7
05 Dec, 2019 2 commits
- Add few tests on the TF optimization file with some info in the documentation. Complete the README. · 9200a759
  Julien Plu authored Dec 05, 2019
  
  9200a759
- fix #1450 - add doc · 75a97af6
  thomwolf authored Dec 05, 2019
  
  75a97af6
27 Nov, 2019 4 commits
- update readme · 3e7656f7
  VictorSanh authored Nov 05, 2019
  
  3e7656f7
- mbert reproducibility results · 84a0b522
  VictorSanh authored Oct 29, 2019
  
  84a0b522
- add xnli examples/README.md · d52e98ff
  VictorSanh authored Oct 29, 2019
  
  d52e98ff
- Uniformize #1952 · b5d884d2
  Julien Chaumond authored Nov 27, 2019
  
  b5d884d2
23 Nov, 2019 1 commit
- [doc] homogenize instructions slightly · 176cd1ce
  Julien Chaumond authored Nov 23, 2019
  
  176cd1ce
21 Nov, 2019 1 commit
- update the documentation · 26db31e0
  Rémi Louf authored Nov 20, 2019
  
  26db31e0
15 Nov, 2019 1 commit
- Update example readme · ca99a2d5
  Xu Hongshen authored Nov 15, 2019
  
  ca99a2d5
14 Nov, 2019 1 commit
- added small comparison between BERT, RoBERTa and DistilBERT · 05db5bc1
  Thomas Wolf authored Nov 14, 2019
  
  05db5bc1
04 Nov, 2019 1 commit
- Update example readme · ad908686
  thomwolf authored Nov 04, 2019
  
  ad908686
30 Oct, 2019 1 commit
- rename seq2seq to encoder_decoder · 3b0d2fa3
  Rémi Louf authored Oct 30, 2019
  
  3b0d2fa3