".github/vscode:/vscode.git/clone" did not exist on "b7b7b5d79a4630840ed32fef86f6d6e2d906f2b0"
Commit f41088a5 authored by John Pope's avatar John Pope Committed by Facebook Github Bot
Browse files

match examples/stories/writingPrompts scripts to correct folder

Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/290

Differential Revision: D12876759

Pulled By: myleott

fbshipit-source-id: 9f6d1c9de27dad29368a7edb923dfcf770355938
parent 672977c1
......@@ -5,6 +5,7 @@ The following commands provide an example of pre-processing data, training a mod
The dataset can be downloaded like this:
```
cd examples/stories
curl https://s3.amazonaws.com/fairseq-py/data/writingPrompts.tar.gz | tar xvzf -
```
......@@ -27,7 +28,7 @@ $ for line in stories:
$ o.write(line.strip() + "\n")
# Binarize the dataset:
$ TEXT=examples/stories/writingPrompts
$ export TEXT=examples/stories/writingPrompts
$ python preprocess.py --source-lang wp_source --target-lang wp_target \
--trainpref $TEXT/train --validpref $TEXT/valid --testpref $TEXT/test \
--destdir data-bin/writingPrompts --padding-factor 1 --thresholdtgt 10 --thresholdsrc 10
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment