Commits · bb3bfa2d293589af0b3141c6f7235beba1c6bb44 · chenpangpang / transformers

21 Dec, 2019 9 commits

Distribute tests from the same file to the same worker. · bb3bfa2d

Aymeric Augustin authored Dec 20, 2019

This should prevent two issues:

- hitting API rate limits for tests that hit the HF API
- multiplying the cost of expensive test setups

bb3bfa2d

Parallelize tests on Circle CI. · 29cbab98

Aymeric Augustin authored Dec 20, 2019

Set the number of CPUs manually based on the Circle CI resource class,
or else we're getting 36 CPUs, which is far too much (perhaps that's
the underlying hardware and not what Circle CI allocates to us).

Don't parallelize the custom tokenizers tests because they take less
than one second to run and parallelization actually makes them slower.

29cbab98

Prevent parallel downloads of the same file with a lock. · a4c9338b

Aymeric Augustin authored Dec 20, 2019

Since the file is written to the filesystem, a filesystem lock is the
way to go here. Add a dependency on the third-party filelock library to
get cross-platform functionality.

a4c9338b

Take advantage of the cache when running tests. · b670c266

Aymeric Augustin authored Dec 20, 2019

Caching models across test cases and across runs of the test suite makes
slow tests somewhat more bearable.

Use gettempdir() instead of /tmp in tests. This makes it easier to
change the location of the cache with semi-standard TMPDIR/TEMP/TMP
environment variables.

Fix #2222.

b670c266

Download models directly to cache_dir. · b67fa1a8

Aymeric Augustin authored Dec 20, 2019

This allows moving the file instead of copying it, which is more
reliable. Also it avoids writing large amounts of data to /tmp,
which may not be large enough to accomodate it.

Refs #2222.

b67fa1a8

Use a random temp dir for writing pruned models in tests. · 286d5bb6
Aymeric Augustin authored Dec 20, 2019

286d5bb6
Use a random temp dir for writing file in tests. · 478e456e
Aymeric Augustin authored Dec 20, 2019

478e456e
Remove redundant torch.jit.trace in tests. · 12726f85
Aymeric Augustin authored Dec 20, 2019
```
This looks like it could be expensive, so don't run it twice.
```
12726f85
[doc] move distilroberta to more appropriate place · ac1b449c
Julien Chaumond authored Dec 21, 2019
```
cc @lysandrejik
```
ac1b449c

20 Dec, 2019 31 commits
- small refactoring (only esthetic, not functional) · a80778f4
  Francesco authored Dec 18, 2019
  
  a80778f4
- - Create the output directory (whose name is passed by the user in the... · 3df1d2d1
  Francesco authored Dec 17, 2019
```
- Create the output directory (whose name is passed by the user in the "save_directory" parameter) where it will be saved encoder and decoder, if not exists.
- Empty the output directory, if it contains any files or subdirectories.
- Create the "encoder" directory inside "save_directory", if not exists.
- Create the "decoder" directory inside "save_directory", if not exists.
- Save the encoder and the decoder in the previous two directories, respectively.
```
  3df1d2d1
- Release: v2.3.0 · a436574b
  Lysandre authored Dec 20, 2019
  
  a436574b
- Merge pull request #2244 from huggingface/fix-tok-pipe · d0f8b9a9
  Thomas Wolf authored Dec 20, 2019
```
Fix Camembert and XLM-R `decode` method- Fix NER pipeline alignement
```
  d0f8b9a9
- Merge pull request #2191 from huggingface/fix_sp_np · a557836a
  Thomas Wolf authored Dec 20, 2019
```
Numpy compatibility for sentence piece
```
  a557836a
- clean up · 655fd068
  thomwolf authored Dec 20, 2019
  
  655fd068
- clean up debug and less verbose tqdm · e5812462
  thomwolf authored Dec 20, 2019
  
  e5812462
- add overwrite - fix ner decoding · 4775ec35
  thomwolf authored Dec 20, 2019
  
  4775ec35
- Numpy compatibility for sentence piece · cb6d54bf
  Lysandre authored Dec 20, 2019
```
convert to int earlier
```
  cb6d54bf
- fix NER pipeline · f79a7dc6
  thomwolf authored Dec 20, 2019
  
  f79a7dc6
- fix pipeline NER · a2410110
  thomwolf authored Dec 20, 2019
  
  a2410110
- fix camembert and XLM-R tokenizer · e37ca8e1
  thomwolf authored Dec 20, 2019
  
  e37ca8e1
- fix mc loading · ceae85ad
  thomwolf authored Dec 20, 2019
  
  ceae85ad
- update link in readme · 71883b6d
  thomwolf authored Dec 20, 2019
  
  71883b6d
- Merge pull request #2243 from huggingface/fix-xlm-roberta · 8d5a47c7
  Thomas Wolf authored Dec 20, 2019
```
fixing xlm-roberta tokenizer max_length and automodels
```
  8d5a47c7
- update serving API · 79e4a6a2
  thomwolf authored Dec 20, 2019
  
  79e4a6a2
- fixing CLI pipeline · bbaaec04
  thomwolf authored Dec 20, 2019
  
  bbaaec04
- fixing xlm-roberta tokenizer max_length and automodels · 1c12ee0e
  thomwolf authored Dec 20, 2019
  
  1c12ee0e
- Clean special tokens test · 65c75fc5
  Lysandre authored Dec 20, 2019
  
  65c75fc5
- Added test for all special tokens · fb393ad9
  Lysandre authored Dec 20, 2019
  
  fb393ad9
- Keep even the first of the special tokens intact while lowercasing. · 90debb9f
  Dirk Groeneveld authored Dec 19, 2019
  
  90debb9f
- Added pipelines quick tour in README · b98ff885
  Morgan Funtowicz authored Dec 20, 2019
  
  b98ff885
- Merge pull request #1548 from huggingface/cli · 3a2c4e6f
  Thomas Wolf authored Dec 20, 2019
```
[2.2] - Command-line interface - Pipeline class
```
  3a2c4e6f
- add example for Model2Model in quickstart · 4e3f745b
  Rémi Louf authored Dec 20, 2019
  
  4e3f745b
- defaults models for tf and pt - update tests · db0795b5
  thomwolf authored Dec 20, 2019
  
  db0795b5
- Fix leading axis added when saving through the command run · 7f740845
  Morgan Funtowicz authored Dec 20, 2019
  
  7f740845
- clean up PT <=> TF 2.0 conversion and config loading · c37815f1
  thomwolf authored Dec 20, 2019
  
  c37815f1
- update serving command · 73fcebf7
  thomwolf authored Dec 20, 2019
  
  73fcebf7
- Merge pull request #2189 from stefan-it/xlmr · 59941c5d
  Thomas Wolf authored Dec 20, 2019
```
Add support for XLM-RoBERTa
```
  59941c5d
- remove python 2 tests for circle-ci cc @aaugustin @julien-c @LysandreJik · 15dda5ea
  thomwolf authored Dec 20, 2019
  
  15dda5ea
- update tests to remove unittest.patch · 01ffc65e
  thomwolf authored Dec 20, 2019
  
  01ffc65e