• Arthur's avatar
    [`LlamaTokenizerFast`] Refactor default llama (#28881) · e34da3ee
    Arthur authored
    * push legacy to fast as well
    
    * super strange
    
    * Update src/transformers/convert_slow_tokenizer.py
    
    * make sure we are BC
    
    * fix Llama test
    
    * nit
    
    * revert
    
    * more test
    
    * style
    
    * update
    
    * small update w.r.t tokenizers
    
    * nit
    
    * don't split
    
    * lol
    
    * add a test for `add_prefix_space=False`
    
    * fix gemma tokenizer as well
    
    * update
    
    * fix gemma
    
    * nicer failures
    
    * fixup
    
    * update
    
    * fix the example for legacy = False
    
    * use `huggyllama/llama-7b` for the PR doctest
    
    * nit
    
    * use from_slow
    
    * fix llama
    e34da3ee
test_tokenization_llama.py 38.3 KB