- 21 Dec, 2019 40 commits
-
-
Aymeric Augustin authored
Likely it was added by accident.
-
Thomas Wolf authored
adding positional embeds masking to TFRoBERTa
-
Thomas Wolf authored
[WIP] Add MMBT Model to Transformers Repo
-
thomwolf authored
-
thomwolf authored
-
Thomas Wolf authored
closes #1960 Add saving and resuming functionality for remaining examples
-
Thomas Wolf authored
[BREAKING CHANGE] Setting all ignored index to the PyTorch standard
-
thomwolf authored
-
thomwolf authored
-
Thomas Wolf authored
-
Thomas Wolf authored
run_squad with roberta
-
Thomas Wolf authored
fix: wrong architecture count in README
-
Thomas Wolf authored
:zip: #2106 tokenizer.tokenize speed improvement (3-8x) by caching added_tokens in a Set
-
Thomas Wolf authored
-
Thomas Wolf authored
[WIP] Sampling sequence generator for transformers
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
thomwolf authored
-
Thomas Wolf authored
fix run_squad.py during fine-tuning xlnet on squad2.0
-
thomwolf authored
-
Thomas Wolf authored
-
Thomas Wolf authored
Fix TFXLNet
-
Thomas Wolf authored
Add special tokens to documentation for bert examples to resolve issue: #1561
-
Thomas Wolf authored
Bug-fix: Roberta Embeddings Not Masked
-
Thomas Wolf authored
Support running tests in parallel
-
Aymeric Augustin authored
This looks like debug code accidentally committed in b18509c2. Refs #2250.
-
Aymeric Augustin authored
This looks like a copy/paste mistake. Probably this test was never run. Refs #2250.
-
Aymeric Augustin authored
This optimizes the total run time of the Circle CI test suite.
-
Aymeric Augustin authored
We're already using as many processes in parallel as we have CPU cores. Furthermore, the number of core may be incorrectly calculated as 36 (we've seen this in pytest-xdist) which make compound the problem. PyTorch performance craters without this.
-
Aymeric Augustin authored
This should prevent two issues: - hitting API rate limits for tests that hit the HF API - multiplying the cost of expensive test setups
-
Aymeric Augustin authored
Set the number of CPUs manually based on the Circle CI resource class, or else we're getting 36 CPUs, which is far too much (perhaps that's the underlying hardware and not what Circle CI allocates to us). Don't parallelize the custom tokenizers tests because they take less than one second to run and parallelization actually makes them slower.
-
Aymeric Augustin authored
Since the file is written to the filesystem, a filesystem lock is the way to go here. Add a dependency on the third-party filelock library to get cross-platform functionality.
-
Aymeric Augustin authored
Caching models across test cases and across runs of the test suite makes slow tests somewhat more bearable. Use gettempdir() instead of /tmp in tests. This makes it easier to change the location of the cache with semi-standard TMPDIR/TEMP/TMP environment variables. Fix #2222.
-
Aymeric Augustin authored
This allows moving the file instead of copying it, which is more reliable. Also it avoids writing large amounts of data to /tmp, which may not be large enough to accomodate it. Refs #2222.
-
Aymeric Augustin authored
-
Aymeric Augustin authored
-
Aymeric Augustin authored
This looks like it could be expensive, so don't run it twice.
-
Julien Chaumond authored
cc @lysandrejik
-
Julien Chaumond authored
-