1. 26 Apr, 2024 2 commits
  2. 25 Apr, 2024 18 commits
  3. 24 Apr, 2024 19 commits
  4. 23 Apr, 2024 1 commit
    • Arthur's avatar
      [`LlamaTokenizerFast`] Refactor default llama (#28881) · e34da3ee
      Arthur authored
      * push legacy to fast as well
      
      * super strange
      
      * Update src/transformers/convert_slow_tokenizer.py
      
      * make sure we are BC
      
      * fix Llama test
      
      * nit
      
      * revert
      
      * more test
      
      * style
      
      * update
      
      * small update w.r.t tokenizers
      
      * nit
      
      * don't split
      
      * lol
      
      * add a test for `add_prefix_space=False`
      
      * fix gemma tokenizer as well
      
      * update
      
      * fix gemma
      
      * nicer failures
      
      * fixup
      
      * update
      
      * fix the example for legacy = False
      
      * use `huggyllama/llama-7b` for the PR doctest
      
      * nit
      
      * use from_slow
      
      * fix llama
      e34da3ee