• SaulLu's avatar
    update the arguments `add_prefix_space` and `trim_offsets` in... · c94c1b89
    SaulLu authored
    update the arguments `add_prefix_space` and `trim_offsets` in `backend_tokenizer.post_processor` of `RobertaTokenizerFast` (#14752)
    
    * add tests
    
    * change post-processor, pre-tokenizer and decoder (can't update decoder)
    
    * update test (remove decoder which doesn't depend on trim and add_prefix)
    
    * just update the post_processor
    
    * fix change
    
    * `trim_offsets` has no influence on `pre_tokenizer`
    
    * remove a test that need some input from the `tokenizers` lib maintainers
    
    * format
    
    * add new test offsets roberta
    
    * polish comments
    c94c1b89
test_tokenization_roberta.py 14.2 KB