Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
dd2add9f
"...glm130b_fastertransformer.git" did not exist on "f8a481f890bd74375e57e5b4430e47696253ad96"
Commit
dd2add9f
authored
Dec 10, 2019
by
Pascal Voitot
Committed by
Lysandre Debut
Dec 13, 2019
Browse files
more tests
parent
df160af7
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
15 additions
and
1 deletion
+15
-1
transformers/tests/tokenization_bert_test.py
transformers/tests/tokenization_bert_test.py
+1
-1
transformers/tests/tokenization_gpt2_test.py
transformers/tests/tokenization_gpt2_test.py
+14
-0
No files found.
transformers/tests/tokenization_bert_test.py
View file @
dd2add9f
...
...
@@ -109,7 +109,7 @@ class BertTokenizationTest(CommonTestCases.CommonTokenizerTester):
decoded
=
tokenizer
.
decode
(
encoded
)
self
.
assertEqual
(
decoded
.
lower
(),
(
f
"[CLS]
{
input
.
lower
()
}
[SEP]"
).
lower
()
(
f
"[CLS]
{
input
}
[SEP]"
).
lower
()
)
...
...
transformers/tests/tokenization_gpt2_test.py
View file @
dd2add9f
...
...
@@ -67,6 +67,20 @@ class GPT2TokenizationTest(CommonTestCases.CommonTokenizerTester):
self
.
assertListEqual
(
tokenizer
.
convert_tokens_to_ids
(
input_tokens
),
input_bpe_tokens
)
def
test_encode_decode_with_spaces
(
self
):
tokenizer
=
self
.
get_tokenizer
()
new_toks
=
[
'[ABC]'
,
'[DEF]'
,
'GHI IHG'
]
tokenizer
.
add_tokens
(
new_toks
)
input
=
"lower newer [ABC] [DEF] newer lower [ABC] GHI IHG newer lower[DEF]"
encoded
=
tokenizer
.
encode
(
input
)
decoded
=
tokenizer
.
decode
(
encoded
)
self
.
assertEqual
(
decoded
.
lower
(),
input
.
lower
()
)
if
__name__
==
'__main__'
:
unittest
.
main
()
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment