Let inputs of fast tokenizers be tuples as well as lists (#19898)

* Let inputs of fast tokenizers be tuples as well as lists * Update src/transformers/tokenization_utils_fast.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Style Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

Let inputs of fast tokenizers be tuples as well as lists (#19898)
* Let inputs of fast tokenizers be tuples as well as lists * Update src/transformers/tokenization_utils_fast.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Style Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>
ebfd7229 · Sylvain Gugger · GitHub · 6c24443f · ebfd7229
Unverified Commit ebfd7229 authored Oct 27, 2022 by Sylvain Gugger Committed by GitHub Oct 27, 2022
Show whitespace changes
Inline Side-by-side

Showing with 4 additions and 2 deletions

src/transformers/tokenization_utils_fast.py src/transformers/tokenization_utils_fast.py +4 -2

No files found.
--- a/src/transformers/tokenization_utils_fast.py
+++ b/src/transformers/tokenization_utils_fast.py
@@ -412,8 +412,10 @@ class PreTrainedTokenizerFast(PreTrainedTokenizerBase):
        verbose: bool = True,
    ) -> BatchEncoding:
-        if not isinstance(batch_text_or_text_pairs, list):
+        if not isinstance(batch_text_or_text_pairs, (tuple, list)):
-            raise TypeError(f"batch_text_or_text_pairs has to be a list (got {type(batch_text_or_text_pairs)})")
+            raise TypeError(
+                f"batch_text_or_text_pairs has to be a list or a tuple (got {type(batch_text_or_text_pairs)})"
+            )
        # Set the truncation and padding strategy and restore the initial configuration
        self.set_truncation_and_padding(