specify in readme that both datasets are required

0d81fc85 · Rémi Louf · 19e99647 · 0d81fc85
Commit 0d81fc85 authored Oct 15, 2019 by Rémi Louf
Hide whitespace changes
Inline Side-by-side

Showing with 7 additions and 3 deletions

examples/README.md examples/README.md +7 -3

No files found.
--- a/examples/README.md
+++ b/examples/README.md
@@ -395,13 +395,17 @@ This fine-tuned model is available as a checkpoint under the reference

 Based on the script [`run_seq2seq_finetuning.py`](https://github.com/huggingface/transformers/blob/master/examples/run_seq2seq_finetuning.py).

-Before running this script you should download **both** CNN and Daily Mail datasets (the links next to "Stories") from [Kyunghyun Cho's website](https://cs.nyu.edu/~kcho/DMQA/) in the same folder. Then uncompress the archives by running:
+Before running this script you should download **both** CNN and Daily Mail
+datasets from [Kyunghyun Cho's website](https://cs.nyu.edu/~kcho/DMQA/)  (the
+links next to "Stories") in the same folder. Then uncompress the archives by running:

 ```bash
 tar -xvf cnn_stories.tgz && tar -xvf dailymail_stories.tgz
 ```

-We will refer as `$DATA_PATH` the path to where you uncompressed both archive.
+note that the finetuning script **will not work** if you do not download both
+datasets. We will refer as `$DATA_PATH` the path to where you uncompressed both
+archive.

 ## Bert2Bert and abstractive summarization

@@ -414,4 +418,4 @@ python run_seq2seq_finetuning.py \
    --model_name_or_path=bert2bert \
    --do_train \
    --data_path=$DATA_PATH \
-```
\ No newline at end of file
+```