Commits · 63e3827c6bc5af9807b77e07fdcdae74b7d57161 · chenpangpang / transformers

21 Dec, 2019 40 commits
- Remove empty file. · 63e3827c
  Aymeric Augustin authored Dec 21, 2019
```
Likely it was added by accident.
```
  63e3827c
- Merge pull request #2254 from huggingface/fix-tfroberta · 645713e2
  Thomas Wolf authored Dec 21, 2019
```
adding positional embeds masking to TFRoBERTa
```
  645713e2
- Merge pull request #2115 from suvrat96/add_mmbt_model · 73f6e981
  Thomas Wolf authored Dec 21, 2019
```
[WIP] Add MMBT Model to Transformers Repo
```
  73f6e981
- adding positional embeds masking to TFRoBERTa · 77676c27
  thomwolf authored Dec 21, 2019
  
  77676c27
- move example to mm-imdb folder · 344126fe
  thomwolf authored Dec 21, 2019
  
  344126fe
- Merge pull request #2134 from bkkaggle/saving-and-resuming · 5b7fb6a4
  Thomas Wolf authored Dec 21, 2019
```
closes #1960 Add saving and resuming functionality for remaining examples
```
  5b7fb6a4
- Merge pull request #2130 from huggingface/ignored-index-coherence · 6f68d559
  Thomas Wolf authored Dec 21, 2019
```
[BREAKING CHANGE] Setting all ignored index to the PyTorch standard
```
  6f68d559
- Merge branch 'master' into pr/2115 · 1ab25c49
  thomwolf authored Dec 21, 2019
  
  1ab25c49
- fix merge · b03872aa
  thomwolf authored Dec 21, 2019
  
  b03872aa
- Merge branch 'master' into saving-and-resuming · 518ba748
  Thomas Wolf authored Dec 21, 2019
  
  518ba748
- Merge pull request #2173 from erenup/master · 18601c3b
  Thomas Wolf authored Dec 21, 2019
```
run_squad with roberta
```
  18601c3b
- Merge pull request #2203 from gthb/patch-1 · 6e7102cf
  Thomas Wolf authored Dec 21, 2019
```
fix: wrong architecture count in README
```
  6e7102cf
- Merge pull request #2177 from mandubian/issue-2106 · deceb001
  Thomas Wolf authored Dec 21, 2019
```
:zip: #2106 tokenizer.tokenize speed improvement (3-8x) by caching added_tokens in a Set
```
  deceb001
- Merge branch 'master' into saving-and-resuming · eeb70cdd
  Thomas Wolf authored Dec 21, 2019
  
  eeb70cdd
- Merge pull request #1840 from huggingface/generation_sampler · ed9b8481
  Thomas Wolf authored Dec 21, 2019
```
[WIP] Sampling sequence generator for transformers
```
  ed9b8481
- update doc · f86ed231
  thomwolf authored Dec 21, 2019
  
  f86ed231
- Merge branch 'master' into generation_sampler · cfa03805
  thomwolf authored Dec 21, 2019
  
  cfa03805
- fixing run_generation example - using torch.no_grad · 300ec300
  thomwolf authored Dec 21, 2019
  
  300ec300
- fixing run_generation · 1c377468
  thomwolf authored Dec 21, 2019
  
  1c377468
- Merge pull request #1803 from importpandas/fix-xlnet-squad2.0 · 7e17f09f
  Thomas Wolf authored Dec 21, 2019
```
fix run_squad.py during fine-tuning xlnet on squad2.0
```
  7e17f09f
- fix merge · 8a2be93b
  thomwolf authored Dec 21, 2019
  
  8a2be93b
- Merge branch 'master' into fix-xlnet-squad2.0 · 562f8640
  Thomas Wolf authored Dec 21, 2019
  
  562f8640
- Merge pull request #1736 from huggingface/fix-tf-xlnet · 8618bf15
  Thomas Wolf authored Dec 21, 2019
```
Fix TFXLNet
```
  8618bf15
- Merge pull request #1586 from enzoampil/include_special_tokens_in_bert_examples · 2fa8737c
  Thomas Wolf authored Dec 21, 2019
```
Add special tokens to documentation for bert examples to resolve issue: #1561
```
  2fa8737c
- Merge pull request #1764 from DomHudson/bug-fix-1761 · f15f0871
  Thomas Wolf authored Dec 21, 2019
```
Bug-fix: Roberta Embeddings Not Masked
```
  f15f0871
- Merge pull request #2217 from aaugustin/test-parallelization · fae4d1c2
  Thomas Wolf authored Dec 21, 2019
```
Support running tests in parallel
```
  fae4d1c2
- Restore test. · b8e924e1
  Aymeric Augustin authored Dec 21, 2019
```
This looks like debug code accidentally committed in b18509c2.

Refs #2250.
```
  b8e924e1
- Fix typo in model name. · 767bc3ca
  Aymeric Augustin authored Dec 21, 2019
```
This looks like a copy/paste mistake. Probably this test was never run.

Refs #2250.
```
  767bc3ca
- Run examples separately from tests. · 343c094f
  Aymeric Augustin authored Dec 20, 2019
```
This optimizes the total run time of the Circle CI test suite.
```
  343c094f
- Prevent excessive parallelism in PyTorch. · 80caf79d
  Aymeric Augustin authored Dec 20, 2019
```
We're already using as many processes in parallel as we have CPU cores.
Furthermore, the number of core may be incorrectly calculated as 36
(we've seen this in pytest-xdist) which make compound the problem.

PyTorch performance craters without this.
```
  80caf79d
- Distribute tests from the same file to the same worker. · bb3bfa2d
  Aymeric Augustin authored Dec 20, 2019
```
This should prevent two issues:

- hitting API rate limits for tests that hit the HF API
- multiplying the cost of expensive test setups
```
  bb3bfa2d
- Parallelize tests on Circle CI. · 29cbab98
  Aymeric Augustin authored Dec 20, 2019
```
Set the number of CPUs manually based on the Circle CI resource class,
or else we're getting 36 CPUs, which is far too much (perhaps that's
the underlying hardware and not what Circle CI allocates to us).

Don't parallelize the custom tokenizers tests because they take less
than one second to run and parallelization actually makes them slower.
```
  29cbab98
- Prevent parallel downloads of the same file with a lock. · a4c9338b
  Aymeric Augustin authored Dec 20, 2019
```
Since the file is written to the filesystem, a filesystem lock is the
way to go here. Add a dependency on the third-party filelock library to
get cross-platform functionality.
```
  a4c9338b
- Take advantage of the cache when running tests. · b670c266
  Aymeric Augustin authored Dec 20, 2019
```
Caching models across test cases and across runs of the test suite makes
slow tests somewhat more bearable.

Use gettempdir() instead of /tmp in tests. This makes it easier to
change the location of the cache with semi-standard TMPDIR/TEMP/TMP
environment variables.

Fix #2222.
```
  b670c266
- Download models directly to cache_dir. · b67fa1a8
  Aymeric Augustin authored Dec 20, 2019
```
This allows moving the file instead of copying it, which is more
reliable. Also it avoids writing large amounts of data to /tmp,
which may not be large enough to accomodate it.

Refs #2222.
```
  b67fa1a8
- Use a random temp dir for writing pruned models in tests. · 286d5bb6
  Aymeric Augustin authored Dec 20, 2019
  
  286d5bb6
- Use a random temp dir for writing file in tests. · 478e456e
  Aymeric Augustin authored Dec 20, 2019
  
  478e456e
- Remove redundant torch.jit.trace in tests. · 12726f85
  Aymeric Augustin authored Dec 20, 2019
```
This looks like it could be expensive, so don't run it twice.
```
  12726f85
- [doc] move distilroberta to more appropriate place · ac1b449c
  Julien Chaumond authored Dec 21, 2019
```
cc @lysandrejik
```
  ac1b449c
- [RoBERTa] Embeddings: fix dimensionality bug · 3e52915f
  Julien Chaumond authored Dec 20, 2019
  
  3e52915f