Commits · 1f5fc95b6831a2b41c45f51b2eaf9aa92d100ab7 · chenpangpang / transformers

30 Apr, 2019 2 commits
- add code coverage · 1f5fc95b
  thomwolf authored Apr 30, 2019
  
  1f5fc95b
- add special tokens to gpt-2 · c30139a0
  thomwolf authored Apr 30, 2019
  
  c30139a0
25 Apr, 2019 7 commits
- Release: 0.6.2 · b832d5bb
  thomwolf authored Apr 25, 2019
  
  b832d5bb
- Merge pull request #488 from dhpollack/fix_multichoice · e6cf62d4
  Thomas Wolf authored Apr 25, 2019
```
fixed BertForMultipleChoice model init and forward pass
```
  e6cf62d4
- Merge pull request #533 from lukovnikov/master · 1cc1c3c3
  Thomas Wolf authored Apr 25, 2019
```
Docs for new learning rate code
```
  1cc1c3c3
- Merge pull request #518 from huggingface/schedules_in_examples · dee8af4e
  Thomas Wolf authored Apr 25, 2019
```
Fix training schedules in examples to match new API
```
  dee8af4e
- - replaced OpenAIGPTAdam with OpenAIAdam in docs · 56a47ce2
  lukovnikov authored Apr 25, 2019
  
  56a47ce2
- - replaced OpenAIGPTAdam with OpenAIAdam in docs · 331a46ff
  lukovnikov authored Apr 25, 2019
  
  331a46ff
- - updated docs for new LR API · 704037ad
  lukovnikov authored Apr 25, 2019
```
- added some images for illustration
- updated comments in optimization
```
  704037ad
24 Apr, 2019 2 commits
- Merge pull request #506 from ailzhang/hubconf · d76a57b0
  Thomas Wolf authored Apr 24, 2019
```
Hubconf
```
  d76a57b0
- revert BertForMultipleChoice linear classifier · 80f995a1
  thomwolf authored Apr 24, 2019
  
  80f995a1
23 Apr, 2019 4 commits
- fix training schedules in examples to match new API · d94c6b01
  thomwolf authored Apr 23, 2019
  
  d94c6b01
- Merge pull request #515 from Rocketknight1/master · c36cca07
  Thomas Wolf authored Apr 23, 2019
```
Fix --reduce_memory in finetune_on_pregenerated
```
  c36cca07
- Merge pull request #512 from cynthia/master · 99e02c34
  Thomas Wolf authored Apr 23, 2019
```
Fix indentation weirdness in GPT-2 example.
```
  99e02c34
- Merge pull request #445 from lukovnikov/master · 98cb7b2c
  Thomas Wolf authored Apr 23, 2019
```
Learning rate schedules improvement + extension
```
  98cb7b2c
22 Apr, 2019 2 commits
- Made --reduce_memory actually do something in finetune_on_pregenerated · b8e2a9c5
  Matthew Carrigan authored Apr 22, 2019
  
  b8e2a9c5
- Merge pull request #1 from huggingface/master · af8a0384
  Matt authored Apr 22, 2019
```
Pulling commits from main repo
```
  af8a0384
21 Apr, 2019 4 commits
- Fix indentation weirdness in GPT-2 example. · 14b1f719
  Sangwhan Moon authored Apr 22, 2019
  
  14b1f719
- python 2 compat · 69850b40
  lukovnikov authored Apr 21, 2019
  
  69850b40
- - removed __all__ in optimization · bb7557d3
  lukovnikov authored Apr 21, 2019
```
- removed unused plotting code
- using ABC for LRSchedule
- added some schedule object init tests
```
  bb7557d3
- Merge remote-tracking branch 'upstream/master' · 34ccc8eb
  lukovnikov authored Apr 21, 2019
  
  34ccc8eb
17 Apr, 2019 16 commits
- fix from_pretrained positional args · bfd6f6b2
  Ailing Zhang authored Apr 17, 2019
  
  bfd6f6b2
- add hubconf · ae4c9fee
  Ailing Zhang authored Apr 13, 2019
  
  ae4c9fee
- Merge pull request #500 from huggingface/network · 68a889ee
  Thomas Wolf authored Apr 17, 2019
```
Updating network handling
```
  68a889ee
- small clean up in tests · 34ae5bf8
  thomwolf authored Apr 17, 2019
  
  34ae5bf8
- is python 2 happy now · 23d4554e
  thomwolf authored Apr 17, 2019
  
  23d4554e
- relax network connection requirements · 265550ec
  thomwolf authored Apr 17, 2019
  
  265550ec
- fix file_utils on python 2 · fa765202
  thomwolf authored Apr 17, 2019
  
  fa765202
- fix #497 · bcde2c61
  thomwolf authored Apr 17, 2019
  
  bcde2c61
- fix #497 · 929579f3
  thomwolf authored Apr 17, 2019
  
  929579f3
- adding s3 model tests with --runslow · 31d38760
  thomwolf authored Apr 17, 2019
  
  31d38760
- Merge pull request #494 from SudoSharma/patch-1 · 8407429d
  Thomas Wolf authored Apr 17, 2019
```
Fix indentation for unconditional generation
```
  8407429d
- Merge pull request #495 from SudoSharma/patch-2 · 2e153930
  Thomas Wolf authored Apr 17, 2019
```
Fix gradient overflow issue during attention mask
```
  2e153930
- Merge pull request #496 from 8enmann/patch-1 · 46078e1b
  Thomas Wolf authored Apr 17, 2019
```
[run_gpt2.py] temperature should be a float, not int
```
  46078e1b
- Merge pull request #498 from huggingface/GPT2_tokenization · b8686130
  Thomas Wolf authored Apr 17, 2019
```
Gpt2 tokenization
```
  b8686130
- fix GPT-2 tokenization to work also on python 3... · 5afa497c
  thomwolf authored Apr 17, 2019
  
  5afa497c
- fixed GPT-2 tokenization on python 2 · bc70779b
  thomwolf authored Apr 17, 2019
  
  bc70779b
16 Apr, 2019 3 commits

[run_gpt2.py] temperature should be a float, not int · 87677fcc
Ben Mann authored Apr 16, 2019

87677fcc

Fix gradient overflow issue during attention mask · 9e666aaa

Abhi Sharma authored Apr 16, 2019

This fix is in reference to issue #382. GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!

9e666aaa

Fix indentation for unconditional generation · 07154dad
Abhi Sharma authored Apr 16, 2019

07154dad