Commits · 704037ad511ea85afd3de3609434fc49cbf2f1d2 · chenpangpang / transformers

25 Apr, 2019 1 commit
- - updated docs for new LR API · 704037ad
  lukovnikov authored Apr 25, 2019
```
- added some images for illustration
- updated comments in optimization
```
  704037ad
21 Apr, 2019 2 commits
- python 2 compat · 69850b40
  lukovnikov authored Apr 21, 2019
  
  69850b40
- - removed __all__ in optimization · bb7557d3
  lukovnikov authored Apr 21, 2019
```
- removed unused plotting code
- using ABC for LRSchedule
- added some schedule object init tests
```
  bb7557d3
17 Apr, 2019 8 commits
- fix from_pretrained positional args · bfd6f6b2
  Ailing Zhang authored Apr 17, 2019
  
  bfd6f6b2
- is python 2 happy now · 23d4554e
  thomwolf authored Apr 17, 2019
  
  23d4554e
- relax network connection requirements · 265550ec
  thomwolf authored Apr 17, 2019
  
  265550ec
- fix file_utils on python 2 · fa765202
  thomwolf authored Apr 17, 2019
  
  fa765202
- fix #497 · bcde2c61
  thomwolf authored Apr 17, 2019
  
  bcde2c61
- fix #497 · 929579f3
  thomwolf authored Apr 17, 2019
  
  929579f3
- fix GPT-2 tokenization to work also on python 3... · 5afa497c
  thomwolf authored Apr 17, 2019
  
  5afa497c
- fixed GPT-2 tokenization on python 2 · bc70779b
  thomwolf authored Apr 17, 2019
  
  bc70779b
16 Apr, 2019 3 commits

Fix gradient overflow issue during attention mask · 9e666aaa

Abhi Sharma authored Apr 16, 2019

This fix is in reference to issue #382. GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!

9e666aaa

updating GPT tokenization · bdaba189
thomwolf authored Apr 16, 2019

bdaba189
improving GPT2 tokenization and adding tests · 18a8a15f
thomwolf authored Apr 16, 2019

18a8a15f

15 Apr, 2019 9 commits
- fix openai special tokens loading · d6160224
  thomwolf authored Apr 15, 2019
  
  d6160224
- load all models on cpu · df5d9c35
  thomwolf authored Apr 15, 2019
  
  df5d9c35
- added best practices for serialization in README and examples · 60ea6c59
  thomwolf authored Apr 15, 2019
  
  60ea6c59
- tokenization updates · b3c6ee0a
  thomwolf authored Apr 15, 2019
  
  b3c6ee0a
- add to_json_file method to configuration classes · 9761aa48
  thomwolf authored Apr 15, 2019
  
  9761aa48
- fixing tests · e8568a3b
  thomwolf authored Apr 15, 2019
  
  e8568a3b
- added tokenizers serialization tests · 870b734b
  thomwolf authored Apr 15, 2019
  
  870b734b
- add serialization semantics to tokenizers - fix transfo-xl tokenizer · 3e65f255
  thomwolf authored Apr 15, 2019
  
  3e65f255
- update double head model · fe2756ff
  thomwolf authored Apr 15, 2019
  
  fe2756ff
12 Apr, 2019 2 commits
- Extend the BertForSequenceClassification docs to mention the special CLS token. · 34cf67fd
  Martin Boyanov authored Apr 12, 2019
  
  34cf67fd
- updating loss computation · b509bf76
  thomwolf authored Apr 12, 2019
  
  b509bf76
11 Apr, 2019 5 commits
- back to simple indexing · 1d203a34
  thomwolf authored Apr 11, 2019
  
  1d203a34
- fix OpenAIGPTMultipleChoiceHead · 074c869b
  thomwolf authored Apr 11, 2019
  
  074c869b
- fix typo · a05fad8d
  thomwolf authored Apr 11, 2019
  
  a05fad8d
- update special token addition · 4a82f4f8
  thomwolf authored Apr 11, 2019
  
  4a82f4f8
- fixes #471 · e99b2014
  thomwolf authored Apr 11, 2019
  
  e99b2014
03 Apr, 2019 7 commits
- schedule fix · fc7693ad
  lukovnikov authored Apr 03, 2019
  
  fc7693ad
- schedule fix · 20686b78
  lukovnikov authored Apr 03, 2019
  
  20686b78
- schedule fix · 5fed5bb3
  lukovnikov authored Apr 03, 2019
  
  5fed5bb3
- schedule fix · 91a073f8
  lukovnikov authored Apr 03, 2019
  
  91a073f8
- - updated docs for optimization · 1758c8fc
  lukovnikov authored Apr 03, 2019
  
  1758c8fc
- Should fix #438 · 19666dcb
  thomwolf authored Apr 03, 2019
  
  19666dcb
- Fix #436 · 1d8c2323
  thomwolf authored Apr 03, 2019
  
  1d8c2323
01 Apr, 2019 1 commit
- Fixes to the TensorFlow conversion tool · 8b5c63e4
  Mike Arpaia authored Apr 01, 2019
  
  8b5c63e4
27 Mar, 2019 1 commit
- Remove my unhelpful comments :) · 01520d54
  Catalin Voss authored Mar 27, 2019
  
  01520d54
26 Mar, 2019 1 commit
- Remove padding_idx from position_embeddings and token_type_embeddings · 0401317b
  Ikuya Yamada authored Mar 26, 2019
  
  0401317b