Commits · 659af2cbd0a8b3a314cbc704a1076c1e797deafa · chenpangpang / transformers

14 Jun, 2019 5 commits
- Merge pull request #604 from samuelbroscheit/master · 659af2cb
  Thomas Wolf authored Jun 14, 2019
```
Fixing issue "Training beyond specified 't_total' steps with schedule 'warmup_linear'" reported in #556
```
  659af2cb
- Merge pull request #597 from huggingface/attention · 2d6a5349
  Thomas Wolf authored Jun 14, 2019
```
GPT-2 (medium size model, special_tokens, fine-tuning, attention) + repo code coverage metric 
```
  2d6a5349
- Merge branch 'master' into attention · 35e6baab
  Thomas Wolf authored Jun 14, 2019
  
  35e6baab
- add attention to all bert models and add test · 5e1207b8
  thomwolf authored Jun 14, 2019
  
  5e1207b8
- fix test · bcc9e93e
  thomwolf authored Jun 14, 2019
  
  bcc9e93e
12 Jun, 2019 1 commit
- Merge pull request #675 from meetshah1995/patch-1 · f9cde97b
  Thomas Wolf authored Jun 12, 2019
```
[hotfix] Fix frozen pooler parameters in SWAG example.
```
  f9cde97b
11 Jun, 2019 2 commits
- [hotfix] Fix frozen pooler parameters in SWAG example. · e02ce4dc
  Meet Pragnesh Shah authored Jun 11, 2019
  
  e02ce4dc
- Merge pull request #668 from jeonsworld/patch-2 · 784c0ed8
  Thomas Wolf authored Jun 11, 2019
```
apply Whole Word Masking technique
```
  784c0ed8
10 Jun, 2019 1 commit

Update pregenerate_training_data.py · a3a604ce

jeonsworld authored Jun 10, 2019

apply Whole Word Masking technique.
referred to [create_pretraining_data.py](https://github.com/google-research/bert/blob/master/create_pretraining_data.py)

a3a604ce

06 Jun, 2019 6 commits
- fix typo · ee0308f7
  VictorSanh authored Jun 06, 2019
  
  ee0308f7
- fix error with torch.no_grad and loss computation · 2d07f945
  VictorSanh authored Jun 06, 2019
  
  2d07f945
- some cleaning · 6b8d2270
  VictorSanh authored Jun 06, 2019
  
  6b8d2270
- distinguish was is not trained · 122d5c52
  VictorSanh authored Jun 06, 2019
  
  122d5c52
- forgot bertForPreTraining · 2647ac32
  VictorSanh authored Jun 06, 2019
  
  2647ac32
- Add more examples to BERT models for torchhub · cf44d983
  VictorSanh authored Jun 06, 2019
  
  cf44d983
03 Jun, 2019 3 commits
- adding attention outputs in bert · a3274ac4
  thomwolf authored Jun 03, 2019
  
  a3274ac4
- Revert "add output_attentions for BertModel" · 82649658
  VictorSanh authored Jun 03, 2019
```
This reverts commit de5e5682.
```
  82649658
- add output_attentions for BertModel · de5e5682
  VictorSanh authored Jun 03, 2019
  
  de5e5682
31 May, 2019 8 commits
- Merge pull request #651 from huggingface/gpt_torchhub · 2a329c61
  Thomas Wolf authored May 31, 2019
```
Add GPT* compatibility to torchhub
```
  2a329c61
- update doc · 45d21502
  VictorSanh authored May 31, 2019
  
  45d21502
- decorelate dependencies + fix bug · 98f5c786
  VictorSanh authored May 31, 2019
  
  98f5c786
- move dependecies list to hubconf · c8bd026e
  VictorSanh authored May 31, 2019
  
  c8bd026e
- Fix typo in hubconf · 19ef2b0a
  VictorSanh authored May 31, 2019
  
  19ef2b0a
- gpt_hubconf · d0f59105
  VictorSanh authored May 31, 2019
  
  d0f59105
- Move bert_hubconf to hubconfs · 4a210c9f
  VictorSanh authored May 31, 2019
  
  4a210c9f
- modify from_pretrained for OpenAIGPT · 0c5a4fe9
  VictorSanh authored May 31, 2019
  
  0c5a4fe9
30 May, 2019 3 commits
- Hubconf doc - Specia case loading · 372a5c1c
  VictorSanh authored May 30, 2019
  
  372a5c1c
- default in __init__s for classification BERT models (#650) · 96592b54
  Victor SANH authored May 30, 2019
  
  96592b54
- Update hubconf for torchhub: paths+examples+doc · 4cda86b0
  VictorSanh authored May 30, 2019
  
  4cda86b0
13 May, 2019 1 commit
- Make num_train_optimization_steps int · 94247ad6
  samuelbroscheit authored May 13, 2019
  
  94247ad6
11 May, 2019 2 commits

Clean up a little bit · 49a77ac1
samuel.broscheit authored May 12, 2019

49a77ac1

Fixing the issues reported in https://github.com/huggingface/pytorch-pretrained-BERT/issues/556 · 3bf3f959

samuel.broscheit authored May 12, 2019

Reason for issue was that optimzation steps where computed from example size, which is different from actual size of dataloader when an example is chunked into multiple instances.

Solution in this pull request is to compute num_optimization_steps directly from len(data_loader).

3bf3f959

10 May, 2019 1 commit
- Merge pull request #598 from burcturkoglu/master · 3fc63f12
  Thomas Wolf authored May 10, 2019
```
Updating learning rate with special warm up in examples
```
  3fc63f12
09 May, 2019 3 commits
- Division to num_train_optimizer of global_step in lr_this_step is removed. · 00c7fd2b
  burcturkoglu authored May 09, 2019
  
  00c7fd2b
- Merge branch 'master' of https://github.com/huggingface/pytorch-pretrained-BERT · fa37b4da
  burcturkoglu authored May 09, 2019
  
  fa37b4da
- Division to num_train_optimizer of global_step in lr_this_step is removed. · 5289b4b9
  burcturkoglu authored May 09, 2019
  
  5289b4b9
08 May, 2019 4 commits
- output attentions in GPT-2 · 275179a0
  thomwolf authored May 08, 2019
  
  275179a0
- clean up in tokenization · 366a3b02
  thomwolf authored May 08, 2019
  
  366a3b02
- Merge pull request #585 from huntzhan/master · 701bd59b
  Thomas Wolf authored May 08, 2019
```
Make the epsilon of LayerNorm configurable.
```
  701bd59b
- Merge pull request #545 from ailzhang/cache_dir · 303b5e2b
  Thomas Wolf authored May 08, 2019
```
move pytroch_pretrained_bert cache folder under same path as torch
```
  303b5e2b