- 03 Oct, 2019 2 commits
-
-
Simon Layton authored
Attention output was in bnij ordering instead of ijbn which everything else will expect. This was an oversight on my part, and keeps the attention inputs/outputs identical to the original code. Also moved back from tensor slicing to index_select in rel_shift_bnij to make the tracer happy.
-
Simon Layton authored
-
- 02 Oct, 2019 1 commit
-
-
Simon Layton authored
Significant performance boost over the original orderings on an already somewhat optimised branch this gave me > 2x end-to-end throughput on a squad xlnet fine-tuning task (batch 8, seq-length 612, fp16)
-
- 01 Oct, 2019 6 commits
-
-
thomwolf authored
-
Thomas Wolf authored
Typo with LM Fine tuning script
-
thomwolf authored
-
Thomas Wolf authored
Fix fp16 masking in PoolerEndLogits
-
thomwolf authored
Merge branch 'pooler_end_logits_fp16_fix' of https://github.com/slayton58/pytorch-transformers into pr/1284
-
DenysNahurnyi authored
-
- 30 Sep, 2019 1 commit
-
-
Santosh Gupta authored
Lines 183 - 200, fixed indentation. Line 198, replaced `tokenizer_class` with `BertTokenizer`, since `tokenizer_class` is not defined in the loop it belongs to.
-
- 29 Sep, 2019 1 commit
-
-
VictorSanh authored
-
- 28 Sep, 2019 2 commits
-
-
Thomas Wolf authored
fix link
-
wangfei authored
-
- 27 Sep, 2019 18 commits
-
-
Ikuya Yamada authored
-
Julien Chaumond authored
-
Julien Chaumond authored
-
Thomas Wolf authored
Fix some typos
-
Thomas Wolf authored
Fix tensorflow_dataset glue support
-
Thomas Wolf authored
Update run_lm_finetuning.py
-
Denny authored
The previous method, just as phrased, did not exist in the class.
-
Agrin Hilmkil authored
-
Agrin Hilmkil authored
`glue_convert_examples_to_features` assumed that tensorflow_dataset examples contains the features `'sentence1'` and `'sentence2'`. This commit encapsulates the choice of features in the glue processor and uses that to parse examples.
-
pj authored
-
Thomas Wolf authored
Just some typos
-
Gabriel Luiz Freitas Almeida authored
-
Thomas Wolf authored
faster dataset building
-
Thomas Wolf authored
Add small note about the output of hidden states (closes #1332)
-
BramVanroy authored
-
thomwolf authored
-
thomwolf authored
-
VictorSanh authored
-
- 26 Sep, 2019 9 commits
-
-
Julien Chaumond authored
Co-Authored-By:Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
Julien Chaumond authored
Co-Authored-By:Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
LysandreJik authored
-
LysandreJik authored
-
thomwolf authored
-
-
thomwolf authored
-
LysandreJik authored
-
LysandreJik authored
-