Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
caf1d116
Commit
caf1d116
authored
Aug 29, 2019
by
LysandreJik
Browse files
Closing bracket in DistilBERT's token count.
parent
fe8fb10b
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
examples/distillation/scripts/token_counts.py
examples/distillation/scripts/token_counts.py
+1
-1
No files found.
examples/distillation/scripts/token_counts.py
View file @
caf1d116
...
@@ -24,7 +24,7 @@ from utils import logger
...
@@ -24,7 +24,7 @@ from utils import logger
if
__name__
==
'__main__'
:
if
__name__
==
'__main__'
:
parser
=
argparse
.
ArgumentParser
(
description
=
"Token Counts for smoothing the masking probabilities in MLM (cf XLM/word2vec)"
)
parser
=
argparse
.
ArgumentParser
(
description
=
"Token Counts for smoothing the masking probabilities in MLM (cf XLM/word2vec)"
)
parser
.
add_argument
(
"--data_file"
,
type
=
str
,
default
=
"data/dump.bert-base-uncased.pickle"
,
parser
.
add_argument
(
"--data_file"
,
type
=
str
,
default
=
"data/dump.bert-base-uncased.pickle"
,
help
=
"The binarized dataset."
help
=
"The binarized dataset."
)
parser
.
add_argument
(
"--token_counts_dump"
,
type
=
str
,
default
=
"data/token_counts.bert-base-uncased.pickle"
,
parser
.
add_argument
(
"--token_counts_dump"
,
type
=
str
,
default
=
"data/token_counts.bert-base-uncased.pickle"
,
help
=
"The dump file."
)
help
=
"The dump file."
)
parser
.
add_argument
(
"--vocab_size"
,
default
=
30522
,
type
=
int
)
parser
.
add_argument
(
"--vocab_size"
,
default
=
30522
,
type
=
int
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment