Commits · 899883644fcccc3482370994740d5882f15d3609 · chenpangpang / transformers

03 Oct, 2019 2 commits

Simon Layton authored Oct 03, 2019

Attention output was in bnij ordering instead of ijbn which everything
else will expect. This was an oversight on my part, and keeps the
attention inputs/outputs identical to the original code.

Also moved back from tensor slicing to index_select in rel_shift_bnij to
make the tracer happy.

89988364

Fix missed head transpose · 9ffda216
Simon Layton authored Oct 03, 2019

9ffda216

02 Oct, 2019 1 commit

Re-order attention head outputs for better perf · d51b5894

Simon Layton authored Sep 18, 2019

Significant performance boost over the original orderings
on an already somewhat optimised branch this gave me > 2x end-to-end
throughput on a squad xlnet fine-tuning task (batch 8, seq-length 612,
fp16)

d51b5894

01 Oct, 2019 6 commits
- fix #1260 - remove special logic for decoding pairs of sequence · 391db836
  thomwolf authored Oct 01, 2019
  
  391db836
- Merge pull request #1288 from echan00/master · 963529e2
  Thomas Wolf authored Oct 01, 2019
```
Typo with LM Fine tuning script
```
  963529e2
- use format instead of f-strings · f7978f70
  thomwolf authored Oct 01, 2019
  
  f7978f70
- Merge pull request #1284 from slayton58/pooler_end_logits_fp16_fix · 1e4a1913
  Thomas Wolf authored Oct 01, 2019
```
Fix fp16 masking in PoolerEndLogits
```
  1e4a1913
- Merge branch 'pooler_end_logits_fp16_fix' of... · c50783e3
  thomwolf authored Oct 01, 2019
```
Merge branch 'pooler_end_logits_fp16_fix' of https://github.com/slayton58/pytorch-transformers into pr/1284
```
  c50783e3
- Fix syntax typo in README.md · 6971556a
  DenysNahurnyi authored Oct 01, 2019
  
  6971556a
30 Sep, 2019 1 commit

Update README.md · 5c3b32d4

Santosh Gupta authored Sep 28, 2019

Lines 183 - 200, fixed indentation. Line 198, replaced `tokenizer_class` with `BertTokenizer`, since `tokenizer_class` is not defined in the loop it belongs to.

5c3b32d4

29 Sep, 2019 1 commit
- fix unknown imports (*ForMultipleChoice) in run_multiple_choice · 2dc8cb87
  VictorSanh authored Sep 29, 2019
  
  2dc8cb87
28 Sep, 2019 2 commits
- Merge pull request #1362 from FeiWang96/doc · ae50ad91
  Thomas Wolf authored Sep 28, 2019
```
fix link
```
  ae50ad91
- Fix link in readme · 60f79163
  wangfei authored Sep 28, 2019
  
  60f79163
27 Sep, 2019 18 commits
- fix padding_idx of RoBERTa model · a6a6d9e6
  Ikuya Yamada authored Sep 12, 2019
  
  a6a6d9e6
- 6 -> 8 models · d8b641c8
  Julien Chaumond authored Sep 27, 2019
  
  d8b641c8
- Close #1304 · c6acbdd5
  Julien Chaumond authored Sep 27, 2019
  
  c6acbdd5
- Merge pull request #1353 from wendingp/patch-1 · df7cd9e4
  Thomas Wolf authored Sep 27, 2019
```
Fix some typos
```
  df7cd9e4
- Merge pull request #1355 from agrinh/master · 6a17b3c5
  Thomas Wolf authored Sep 27, 2019
```
Fix tensorflow_dataset glue support
```
  6a17b3c5
- Merge pull request #1359 from dennymarcels/patch-1 · 04e9a6f5
  Thomas Wolf authored Sep 27, 2019
```
Update run_lm_finetuning.py
```
  04e9a6f5
- Update run_lm_finetuning.py · 94785906
  Denny authored Sep 27, 2019
```
The previous method, just as phrased, did not exist in the class.
```
  94785906
- Add docstring for processor method · 795b3e76
  Agrin Hilmkil authored Sep 27, 2019
  
  795b3e76
- Fix tensorflow_dataset glue support · e31a4728
  Agrin Hilmkil authored Sep 27, 2019
```
`glue_convert_examples_to_features` assumed that tensorflow_dataset
examples contains the features `'sentence1'` and `'sentence2'`. This
commit encapsulates the choice of features in the glue processor and
uses that to parse examples.
```
  e31a4728
- Fix some typos · 4f2b6579
  pj authored Sep 27, 2019
  
  4f2b6579
- Merge pull request #1349 from ogabrielluiz/master · ca559826
  Thomas Wolf authored Sep 27, 2019
```
Just some typos
```
  ca559826
- Just some typos · d2de5b9d
  Gabriel Luiz Freitas Almeida authored Sep 27, 2019
  
  d2de5b9d
- Merge pull request #1337 from mgrankin/fastdataset · d83d2957
  Thomas Wolf authored Sep 27, 2019
```
faster dataset building
```
  d83d2957
- Merge pull request #1346 from BramVanroy/documentation · f6de0003
  Thomas Wolf authored Sep 27, 2019
```
Add small  note about the output of hidden states (closes #1332)
```
  f6de0003
- Add small note about the output of hidden states · 15749bfc
  BramVanroy authored Sep 27, 2019
  
  15749bfc
- clean up a little run_tf_glue · da2e47ad
  thomwolf authored Sep 27, 2019
  
  da2e47ad
- clean up run_tf_glue · 528c288f
  thomwolf authored Sep 27, 2019
  
  528c288f
- fix input in run_glue for distilbert · 702f5898
  VictorSanh authored Sep 27, 2019
  
  702f5898
26 Sep, 2019 9 commits
- [docs] Fix doc auto-deploy · 22d2fded
  Julien Chaumond authored Sep 26, 2019
```
Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
```
  22d2fded
- [docs] Doc tweaks · fc9faa8a
  Julien Chaumond authored Sep 26, 2019
```
Co-Authored-By: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
```
  fc9faa8a
- Update RoBERTa and GPT-2 Tokenizer documentation (fix #1343) · ecfddc60
  LysandreJik authored Sep 26, 2019
  
  ecfddc60
- Repository link in the documentation · 93f0c5fc
  LysandreJik authored Sep 26, 2019
  
  93f0c5fc
- typo in readme/doc · 6c3b1315
  thomwolf authored Sep 26, 2019
  
  6c3b1315
- Merge branch 'master' of https://github.com/huggingface/pytorch-transformers · f83b35b7
  thomwolf authored Sep 26, 2019
  
  f83b35b7
- update installation instructions in readme · 4e63c907
  thomwolf authored Sep 26, 2019
  
  4e63c907
- [Doc] XLM + Torch in documentation · 7e957237
  LysandreJik authored Sep 26, 2019
  
  7e957237
- Doc building requirements [TF2] · 302a4813
  LysandreJik authored Sep 26, 2019
  
  302a4813