Commits · c9837a0d27632d3b4f8c27204e3f51e60c3abafc · chenpangpang / transformers

13 Feb, 2021 1 commit

Conversion from slow to fast for BPE spm vocabs contained an error. (#10120) · c9837a0d

Nicolas Patry authored Feb 13, 2021

* Conversion from slow to fast for BPE spm vocabs contained an error.

- There is only 1 test currently (tokenizers + slow) that used the modified path
and it's reformer, which does not contain any ids modification so the
bug was silent for now.
- The real issue is that vocab variable was overloaded by
SentencePieceExtractor, leading to Slow specific vocab oddities to be
completely ignored
- The bug was reported here https://github.com/huggingface/transformers/issues/9518
- Ran the complete tokenization test suite with slow without error
(`RUN_SLOW=1 pytest -sv tests/test_tokenization_*`)

* Remove rebase error.

* Adding the fixture.

c9837a0d

09 Oct, 2020 1 commit
- [pegasus] Faster tokenizer tests (#7672) · b0f05e0c
  Stas Bekman authored Oct 09, 2020
  
  b0f05e0c
06 Jan, 2020 2 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
22 Dec, 2019 1 commit
- Move tests outside of library. · 067395d5
  Aymeric Augustin authored Dec 22, 2019
  
  067395d5
26 Sep, 2019 1 commit
- [BIG] pytorch-transformers => transformers · 31c23bd5
  thomwolf authored Sep 26, 2019
  
  31c23bd5
05 Jul, 2019 1 commit
- [BIG] name change · 0bab55d5
  thomwolf authored Jul 05, 2019
  
  0bab55d5
02 Jul, 2019 1 commit
- [LARGE] updating all tests and API · 1484d67d
  thomwolf authored Jul 02, 2019
  
  1484d67d
21 Jun, 2019 1 commit
- add tokenizer and tests · 32da7548
  thomwolf authored Jun 21, 2019
  
  32da7548