- 21 Dec, 2019 9 commits
-
-
Aymeric Augustin authored
This should prevent two issues: - hitting API rate limits for tests that hit the HF API - multiplying the cost of expensive test setups
-
Aymeric Augustin authored
Set the number of CPUs manually based on the Circle CI resource class, or else we're getting 36 CPUs, which is far too much (perhaps that's the underlying hardware and not what Circle CI allocates to us). Don't parallelize the custom tokenizers tests because they take less than one second to run and parallelization actually makes them slower.
-
Aymeric Augustin authored
Since the file is written to the filesystem, a filesystem lock is the way to go here. Add a dependency on the third-party filelock library to get cross-platform functionality.
-
Aymeric Augustin authored
Caching models across test cases and across runs of the test suite makes slow tests somewhat more bearable. Use gettempdir() instead of /tmp in tests. This makes it easier to change the location of the cache with semi-standard TMPDIR/TEMP/TMP environment variables. Fix #2222.
-
Aymeric Augustin authored
This allows moving the file instead of copying it, which is more reliable. Also it avoids writing large amounts of data to /tmp, which may not be large enough to accomodate it. Refs #2222.
-
Aymeric Augustin authored
-
Aymeric Augustin authored
-
Aymeric Augustin authored
This looks like it could be expensive, so don't run it twice.
-
Julien Chaumond authored
cc @lysandrejik
-
- 20 Dec, 2019 31 commits
-
-
Francesco authored
-
Francesco authored
- Create the output directory (whose name is passed by the user in the "save_directory" parameter) where it will be saved encoder and decoder, if not exists. - Empty the output directory, if it contains any files or subdirectories. - Create the "encoder" directory inside "save_directory", if not exists. - Create the "decoder" directory inside "save_directory", if not exists. - Save the encoder and the decoder in the previous two directories, respectively.
-
Lysandre authored
-
Thomas Wolf authored
Fix Camembert and XLM-R `decode` method- Fix NER pipeline alignement
-
Thomas Wolf authored
Numpy compatibility for sentence piece
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
Lysandre authored
convert to int earlier
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
Thomas Wolf authored
fixing xlm-roberta tokenizer max_length and automodels
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
Lysandre authored
-
Lysandre authored
-
Dirk Groeneveld authored
-
Morgan Funtowicz authored
-
Thomas Wolf authored
[2.2] - Command-line interface - Pipeline class
-
R茅mi Louf authored
-
thomwolf authored
-
Morgan Funtowicz authored
-
thomwolf authored
-
thomwolf authored
-
Thomas Wolf authored
Add support for XLM-RoBERTa
-
thomwolf authored
-
thomwolf authored
-