Fix typo: Roberta -> RoBERTa (#25302)

641adca5 · Victor Geislinger · GitHub · 33da2db5 · 641adca5
Unverified Commit 641adca5 authored Aug 03, 2023 by Victor Geislinger Committed by GitHub Aug 03, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

docs/source/en/tokenizer_summary.md docs/source/en/tokenizer_summary.md +1 -1

No files found.
--- a/docs/source/en/tokenizer_summary.md
+++ b/docs/source/en/tokenizer_summary.md
@@ -141,7 +141,7 @@ on.
 Byte-Pair Encoding (BPE) was introduced in [Neural Machine Translation of Rare Words with Subword Units (Sennrich et
 al., 2015)](https://arxiv.org/abs/1508.07909). BPE relies on a pre-tokenizer that splits the training data into
-words. Pretokenization can be as simple as space tokenization, e.g. [GPT-2](model_doc/gpt2), [Roberta](model_doc/roberta). More advanced pre-tokenization include rule-based tokenization, e.g. [XLM](model_doc/xlm),
+words. Pretokenization can be as simple as space tokenization, e.g. [GPT-2](model_doc/gpt2), [RoBERTa](model_doc/roberta). More advanced pre-tokenization include rule-based tokenization, e.g. [XLM](model_doc/xlm),
 [FlauBERT](model_doc/flaubert) which uses Moses for most languages, or [GPT](model_doc/gpt) which uses
 Spacy and ftfy, to count the frequency of each word in the training corpus.