"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "6be46a6e6422d7ab34984d6fbcaadad0323e5349"
Update 01-training-tokenizers.ipynb (typo issue) (#3343)
I found there are two grammar errors or typo issues in the explanation of the encoding properties. The original sentences: If your was made of multiple \"parts\" such as (question, context), then this would be a vector with for each token the segment it belongs to If your has been truncated into multiple subparts because of a length limit (for BERT for example the sequence length is limited to 512), this will contain all the remaining overflowing parts. I think "input" should be inserted after the phrase "If your".
Showing
Please register or sign in to comment