Commits · f1a4e06f1fe2baaf85799db2b0316991ee1a2405 · chenpangpang / transformers

20 Jul, 2020 14 commits
- [Fix] seq2seq pack_dataset.py actually packs (#5913) · f1a4e06f
  Sam Shleifer authored Jul 20, 2020
```
Huge MT speedup!
```
  f1a4e06f
- Improve doc of use_cache (#5912) · 32883b31
  Sylvain Gugger authored Jul 20, 2020
```
* Improve doc of use_cache

* Update src/transformers/configuration_xlnet.py
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Teven <teven.lescao@gmail.com>
```
  32883b31
- Update gpt2-README.md · 9ccb45a2
  Clement authored Jul 20, 2020
  
  9ccb45a2
- Create gpt2-medium-README.md · f1975111
  Clement authored Jul 20, 2020
  
  f1975111
- Create gpt2-large-README.md · 51152367
  Clement authored Jul 20, 2020
  
  51152367
- Update gpt2-README.md · 182c6119
  Clement authored Jul 20, 2020
  
  182c6119
- add link to write with transformers to model card · a9ae27cd
  Clement authored Jul 20, 2020
  
  a9ae27cd
- [cleanup] squad processor (#5868) · 01c40db4
  Sam Shleifer authored Jul 20, 2020
  
  01c40db4
- DataParallel fixes (#5733) · 35cb101e
  Stas Bekman authored Jul 20, 2020
```
* DataParallel fixes:

1. switched to a more precise check
-        if self.args.n_gpu > 1:
+        if isinstance(model, nn.DataParallel):

2. fix tests - require the same fixup under DataParallel as the training module

* another fix
```
  35cb101e
- Trainer support for iterabledataset (#5834) · 290b6e18
  Pradhy729 authored Jul 20, 2020
```
* Don't pass sampler for iterable dataset

* Added check for test and eval dataloaders.

* Formatting

* Don't pass sampler for iterable dataset

* Added check for test and eval dataloaders.

* Formatting

* Cleaner if nesting.

* Added test for trainer and iterable dataset

* Formatting for test

* Fixed import when torch is available only.

* Added require torch decorator to helper class

* Moved dataset class inside unittest

* Removed nested if and changed model in test

* Checking torch availability for IterableDataset
```
  290b6e18
- [model_cards] Dataset ids are case-sensitive · 82dd96ca
  Julien Chaumond authored Jul 20, 2020
```
cc @lhoestq @thomwolf

Also cc'ing model author @nreimers => Model pages now properly link to the dataset pages (and in the future, eval results, etc.)
```
  82dd96ca
- Create README.md (#5813) · b01a8844
  Manuel Romero authored Jul 20, 2020
  
  b01a8844
- fix typo in (#5893) · 223bad24
  Alan deLevie authored Jul 20, 2020
  
  223bad24
- fix typo in training_args_tf.py (#5894) · d441f8d2
  Alan deLevie authored Jul 20, 2020
  
  d441f8d2
18 Jul, 2020 8 commits

Seq2SeqDataset uses linecache to save memory by @Pradhy729 (#5792) · 09a2f406
Sam Shleifer authored Jul 18, 2020
```
Co-authored-by: Pradhy729 <49659913+Pradhy729@users.noreply.github.com>
```
09a2f406

Teven authored Jul 18, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

4b506a37

Revert "Xlnet outputs (#5881)" (#5882) · a5580924
Teven authored Jul 18, 2020
```
This reverts commit 13be4872.
```
a5580924

Xlnet outputs (#5881) · 13be4872

Teven authored Jul 18, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

13be4872

Update tokenizers to 0.8.1.rc to fix Mac OS X issues (#5867) · eae6d8d1
Sebastian authored Jul 18, 2020

eae6d8d1
[seq2seq] distillation.py accepts trainer arguments (#5865) · dad5e12e
Sam Shleifer authored Jul 18, 2020

dad5e12e
[seq2seq] MAX_LEN env var for MT commands (#5837) · ba240018
Sam Shleifer authored Jul 17, 2020

ba240018
Lightning Updates for v0.8.5 (#5798) · 529850ae
Nathan Raw authored Jul 17, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
529850ae

17 Jul, 2020 11 commits

Revert "XLNet `use_cache` refactor (#5770)" (#5854) · 615be03f
Teven authored Jul 17, 2020
```
This reverts commit 0b2da0e5.
```
615be03f

XLNet `use_cache` refactor (#5770) · 0b2da0e5

Teven authored Jul 17, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

0b2da0e5

Create README.md (#5847) · 9750e130
Jannes authored Jul 17, 2020

9750e130
[model_card] Fix metadata · 1bca4fbd
Julien Chaumond authored Jul 17, 2020

1bca4fbd

Added model card for neuraly/bert-base-italian-cased-sentiment (#5845) · a9d56a67

Gianpaolo Di Pietro authored Jul 17, 2020



* Added model card for neuraly/bert-base-italian-cased-sentiment

* Update model_cards/neuraly/bert-base-italian-cased-sentiment/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Gianpy15 <g.dipietro@neuraly.ai>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

a9d56a67

[Model card] Bert2Bert · 12f14710
Patrick von Platen authored Jul 17, 2020
```
Add Rouge2 results
```
12f14710

[Reformer] - Cache hidden states and buckets to speed up inference (#5578) · 9d37c56b

Patrick von Platen authored Jul 17, 2020

* fix merge rebase

* add intermediate reformer code

* save intermediate caching results

* save intermediate

* save intermediate results

* save intermediate

* upload next step

* fix generate tests

* make tests work

* add named tuple output

* Apply suggestions from code review

* fix use_cache for False case

* fix tensor to gpu

* fix tensor to gpu

* refactor

* refactor and make style

9d37c56b

[Model card] Bert2Bert (#5841) · 0b6c255a
Patrick von Platen authored Jul 17, 2020
```
* Create README.md

* Update README.md

* Update README.md

* Update README.md
```
0b6c255a
[cleanups] make Marian save as Marian (#5830) · 3d9556a7
Sam Shleifer authored Jul 17, 2020

3d9556a7
[seq2seq] Don't copy self.source in sortishsampler (#5818) · e238e3d5
Sam Shleifer authored Jul 17, 2020

e238e3d5

language tag addition on albert-mongolian (#5828) · 2e4624b4

Bayartsogt Yadamsuren authored Jul 17, 2020



* language tag addition on albert-mongolian

* Update model_cards/bayartsogt/albert-mongolian/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

2e4624b4

16 Jul, 2020 7 commits
- Create README.md (#5821) · d088d744
  Manuel Romero authored Jul 16, 2020
  
  d088d744
- dv-wave (#5823) · 233072fc
  Nick Doiron authored Jul 16, 2020
  
  233072fc
- [seq2seq] pack_dataset.py rewrites dataset in max_tokens format (#5819) · 283500ff
  Sam Shleifer authored Jul 16, 2020
  
  283500ff
- Update README.md (#5812) · c45d7a70
  Manuel Romero authored Jul 16, 2020
```
Fix missig "-" in meta data
```
  c45d7a70
- fix longformer slow down (#5811) · 057411c5
  Patrick von Platen authored Jul 16, 2020
  
  057411c5
- fix benchmark for longformer (#5808) · 89a78be5
  Patrick von Platen authored Jul 16, 2020
  
  89a78be5
- fix benchmark non standard model (#5801) · aefc0c04
  Patrick von Platen authored Jul 16, 2020
  
  aefc0c04