Commit 12515c23 authored by Angela Fan's avatar Angela Fan
Browse files

added clarification on the newline token we model

parent a8f0b752
......@@ -8,7 +8,7 @@ The dataset can be downloaded like this:
curl https://s3.amazonaws.com/fairseq-py/data/writingPrompts.tar.gz | tar xvzf -
```
and contains a train, test, and valid split. The dataset is described here: https://arxiv.org/abs/1805.04833, where only the first 1000 words of each story are modeled.
and contains a train, test, and valid split. The dataset is described here: https://arxiv.org/abs/1805.04833. We model only the first 1000 words of each story, including one <newLine> token.
Example usage:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment