1. 23 Feb, 2020 1 commit
  2. 22 Feb, 2020 1 commit
  3. 20 Feb, 2020 2 commits
  4. 19 Feb, 2020 3 commits
  5. 13 Feb, 2020 1 commit
    • Joe Davison's avatar
      Preserve spaces in GPT-2 tokenizers (#2778) · f1e8a51f
      Joe Davison authored
      * Preserve spaces in GPT-2 tokenizers
      
      Preserves spaces after special tokens in GPT-2 and inhereted (RoBERTa)
      tokenizers, enabling correct BPE encoding. Automatically inserts a space
      in front of first token in encode function when adding special tokens.
      
      * Add tokenization preprocessing method
      
      * Add framework argument to pipeline factory
      
      Also fixes pipeline test issue. Each test input now treated as a
      distinct sequence.
      f1e8a51f
  6. 29 Jan, 2020 3 commits
  7. 16 Jan, 2020 3 commits
  8. 15 Jan, 2020 2 commits
  9. 14 Jan, 2020 2 commits
  10. 09 Jan, 2020 1 commit
  11. 08 Jan, 2020 1 commit
  12. 07 Jan, 2020 1 commit
  13. 06 Jan, 2020 2 commits
  14. 05 Jan, 2020 1 commit
  15. 26 Dec, 2019 4 commits
  16. 25 Dec, 2019 2 commits
  17. 24 Dec, 2019 2 commits
  18. 23 Dec, 2019 1 commit
  19. 22 Dec, 2019 7 commits