• Thomas Wolf's avatar
    [breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests... · f4e04cd2
    Thomas Wolf authored
    
    [breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests pipelines - Removing sentencepiece as a required dependency (#8073)
    
    * Fixing roberta for slow-fast tests
    
    * WIP getting equivalence on pipelines
    
    * slow-to-fast equivalence - working on question-answering pipeline
    
    * optional FAISS tests
    
    * Pipeline Q&A
    
    * Move pipeline tests to their own test job again
    
    * update tokenizer to add sequence id methods
    
    * update to tokenizers 0.9.4
    
    * set sentencepiecce as optional
    
    * clean up squad
    
    * clean up pipelines to use sequence_ids
    
    * style/quality
    
    * wording
    
    * Switch to use_fast = True by default
    
    * update tests for use_fast at True by default
    
    * fix rag tokenizer test
    
    * removing protobuf from required dependencies
    
    * fix NER test for use_fast = True by default
    
    * fixing example tests (Q&A examples use slow tokenizers for now)
    
    * protobuf in main deps extras["sentencepiece"] and example deps
    
    * fix protobug install test
    
    * try to fix seq2seq by switching to slow tokenizers for now
    
    * Update src/transformers/tokenization_utils_base.py
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    
    * Update src/transformers/tokenization_utils_base.py
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    Co-authored-by: default avatarLysandre Debut <lysandre@huggingface.co>
    f4e04cd2
run_squad_trainer.py 6.4 KB