Unverified Commit b9b11795 authored by Xin Wen's avatar Xin Wen Committed by GitHub
Browse files

Update model_summary.rst (#5737)

Add '-' to make the reference of Transformer-XL more accurate and formal.
parent b21993b3
...@@ -195,7 +195,7 @@ tokens in the sentence, then allows the model to use the last n tokens to predic ...@@ -195,7 +195,7 @@ tokens in the sentence, then allows the model to use the last n tokens to predic
with a mask, the sentence is actually fed in the model in the right order, but instead of masking the first n tokens with a mask, the sentence is actually fed in the model in the right order, but instead of masking the first n tokens
for n+1, XLNet uses a mask that hides the previous tokens in some given permutation of 1,...,sequence length. for n+1, XLNet uses a mask that hides the previous tokens in some given permutation of 1,...,sequence length.
XLNet also uses the same recurrence mechanism as TransformerXL to build long-term dependencies. XLNet also uses the same recurrence mechanism as Transformer-XL to build long-term dependencies.
The library provides a version of the model for language modeling, token classification, sentence classification, The library provides a version of the model for language modeling, token classification, sentence classification,
multiple choice classification and question answering. multiple choice classification and question answering.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment