Unverified Commit a13c8145 authored by Omar Sanseviero's avatar Omar Sanseviero Committed by GitHub
Browse files

Fix docstring of train_new_from_iterator

parent 86a15472
......@@ -587,7 +587,7 @@ class PreTrainedTokenizerFast(PreTrainedTokenizerBase):
text_iterator (generator of :obj:`List[str]`):
The training corpus. Should be a generator of batches of texts, for instance a list of lists of texts
if you have everything in memory.
vocab_size (obj:`int`):
vocab_size (:obj:`int`):
The size of the vocabulary you want for your tokenizer.
new_special_tokens (list of :obj:`str` or :obj:`AddedToken`, `optional`):
A list of new special tokens to add to the tokenizer you are training.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment