1. 15 Jun, 2019 1 commit
  2. 14 Jun, 2019 2 commits
  3. 03 Jun, 2019 3 commits
  4. 01 Jun, 2019 3 commits
  5. 31 May, 2019 1 commit
  6. 30 May, 2019 1 commit
  7. 08 May, 2019 3 commits
  8. 07 May, 2019 3 commits
  9. 05 May, 2019 1 commit
  10. 02 May, 2019 1 commit
  11. 01 May, 2019 2 commits
  12. 30 Apr, 2019 3 commits
  13. 27 Apr, 2019 1 commit
  14. 25 Apr, 2019 2 commits
  15. 24 Apr, 2019 1 commit
  16. 21 Apr, 2019 2 commits
  17. 17 Apr, 2019 8 commits
  18. 16 Apr, 2019 2 commits
    • Abhi Sharma's avatar
      Fix gradient overflow issue during attention mask · 9e666aaa
      Abhi Sharma authored
      This fix is in reference to issue #382. GPT2 can now be trained in mixed precision, which I've confirmed with testing. I also tested unconditional generation on multiple seeds before and after changing 1e10 to 1e4 and there was no difference. Please let me know if there is anything else I can do to make this pull request better. Thanks for all your work!
      9e666aaa
    • thomwolf's avatar
      updating GPT tokenization · bdaba189
      thomwolf authored
      bdaba189