Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
297233fa
Unverified
Commit
297233fa
authored
Oct 08, 2020
by
Sam Shleifer
Committed by
GitHub
Oct 08, 2020
Browse files
[s2s] Switch README urls to cdn (#7670)
parent
a1ecc90d
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
5 additions
and
5 deletions
+5
-5
examples/seq2seq/README.md
examples/seq2seq/README.md
+4
-4
examples/seq2seq/finetune_bart_tiny.sh
examples/seq2seq/finetune_bart_tiny.sh
+1
-1
No files found.
examples/seq2seq/README.md
View file @
297233fa
...
@@ -19,7 +19,7 @@ For `bertabs` instructions, see [`bertabs/README.md`](bertabs/README.md).
...
@@ -19,7 +19,7 @@ For `bertabs` instructions, see [`bertabs/README.md`](bertabs/README.md).
#### XSUM:
#### XSUM:
```
bash
```
bash
cd
examples/seq2seq
cd
examples/seq2seq
wget https://
s3.amazonaws.com/
datasets.huggingface.co/summarization/xsum.tar.gz
wget https://
cdn-
datasets.huggingface.co/summarization/xsum.tar.gz
tar
-xzvf
xsum.tar.gz
tar
-xzvf
xsum.tar.gz
export
XSUM_DIR
=
${
PWD
}
/xsum
export
XSUM_DIR
=
${
PWD
}
/xsum
```
```
...
@@ -29,7 +29,7 @@ To use your own data, copy that files format. Each article to be summarized is o
...
@@ -29,7 +29,7 @@ To use your own data, copy that files format. Each article to be summarized is o
#### CNN/DailyMail
#### CNN/DailyMail
```
bash
```
bash
cd
examples/seq2seq
cd
examples/seq2seq
wget https://
s3.amazonaws.com/
datasets.huggingface.co/summarization/cnn_dm_v2.tgz
wget https://
cdn-
datasets.huggingface.co/summarization/cnn_dm_v2.tgz
tar
-xzvf
cnn_dm_v2.tgz
# empty lines removed
tar
-xzvf
cnn_dm_v2.tgz
# empty lines removed
mv
cnn_cln cnn_dm
mv
cnn_cln cnn_dm
export
CNN_DIR
=
${
PWD
}
/cnn_dm
export
CNN_DIR
=
${
PWD
}
/cnn_dm
...
@@ -39,7 +39,7 @@ this should make a directory called `cnn_dm/` with 6 files.
...
@@ -39,7 +39,7 @@ this should make a directory called `cnn_dm/` with 6 files.
#### WMT16 English-Romanian Translation Data:
#### WMT16 English-Romanian Translation Data:
download with this command:
download with this command:
```
bash
```
bash
wget https://
s3.amazonaws.com/
datasets.huggingface.co/translation/wmt_en_ro.tar.gz
wget https://
cdn-
datasets.huggingface.co/translation/wmt_en_ro.tar.gz
tar
-xzvf
wmt_en_ro.tar.gz
tar
-xzvf
wmt_en_ro.tar.gz
export
ENRO_DIR
=
${
PWD
}
/wmt_en_ro
export
ENRO_DIR
=
${
PWD
}
/wmt_en_ro
```
```
...
@@ -47,7 +47,7 @@ this should make a directory called `wmt_en_ro/` with 6 files.
...
@@ -47,7 +47,7 @@ this should make a directory called `wmt_en_ro/` with 6 files.
#### WMT English-German:
#### WMT English-German:
```
bash
```
bash
wget https://
s3.amazonaws.com/
datasets.huggingface.co/translation/wmt_en_de.tgz
wget https://
cdn-
datasets.huggingface.co/translation/wmt_en_de.tgz
tar
-xzvf
wmt_en_de.tgz
tar
-xzvf
wmt_en_de.tgz
export
DATA_DIR
=
${
PWD
}
/wmt_en_de
export
DATA_DIR
=
${
PWD
}
/wmt_en_de
```
```
...
...
examples/seq2seq/finetune_bart_tiny.sh
View file @
297233fa
# Script for verifying that run_bart_sum can be invoked from its directory
# Script for verifying that run_bart_sum can be invoked from its directory
# Get tiny dataset with cnn_dm format (4 examples for train, val, test)
# Get tiny dataset with cnn_dm format (4 examples for train, val, test)
wget https://
s3.amazonaws.com/
datasets.huggingface.co/summarization/cnn_tiny.tgz
wget https://
cdn-
datasets.huggingface.co/summarization/cnn_tiny.tgz
tar
-xzvf
cnn_tiny.tgz
tar
-xzvf
cnn_tiny.tgz
rm
cnn_tiny.tgz
rm
cnn_tiny.tgz
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment