Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
eb074af7
Unverified
Commit
eb074af7
authored
Sep 19, 2020
by
Stas Bekman
Committed by
GitHub
Sep 19, 2020
Browse files
fsmt tiny model card + script (#7244)
parent
1d90d0f3
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
51 additions
and
0 deletions
+51
-0
model_cards/stas/tiny-wmt19-en-de/README.md
model_cards/stas/tiny-wmt19-en-de/README.md
+18
-0
scripts/fsmt/fsmt-make-tiny-model.py
scripts/fsmt/fsmt-make-tiny-model.py
+33
-0
No files found.
model_cards/stas/tiny-wmt19-en-de/README.md
0 → 100644
View file @
eb074af7
---
language
:
-
en
-
de
thumbnail
:
tags
:
-
wmt19
-
testing
license
:
apache-2.0
datasets
:
-
wmt19
metrics
:
-
bleu
---
# Tiny FSMT
This is a tiny model that is used in the
`transformers`
test suite. It doesn't do anything useful, other than testing that
`FSMT`
works.
scripts/fsmt/fsmt-make-tiny-model.py
0 → 100755
View file @
eb074af7
#!/usr/bin/env python
# coding: utf-8
# this script creates a tiny model that is useful inside tests, when we just want to test that the machinery works,
# without needing to the check the quality of the outcomes.
# it will be used then as "stas/tiny-wmt19-en-de"
from
transformers
import
FSMTTokenizer
,
FSMTConfig
,
FSMTForConditionalGeneration
mname
=
"facebook/wmt19-en-de"
tokenizer
=
FSMTTokenizer
.
from_pretrained
(
mname
)
# get the correct vocab sizes, etc. from the master model
config
=
FSMTConfig
.
from_pretrained
(
mname
)
config
.
update
(
dict
(
d_model
=
4
,
encoder_layers
=
1
,
decoder_layers
=
1
,
encoder_ffn_dim
=
4
,
decoder_ffn_dim
=
4
,
encoder_attention_heads
=
1
,
decoder_attention_heads
=
1
))
tiny_model
=
FSMTForConditionalGeneration
(
config
)
print
(
f
"num of params
{
tiny_model
.
num_parameters
()
}
"
)
# Test it
batch
=
tokenizer
.
prepare_seq2seq_batch
([
"Making tiny model"
])
outputs
=
tiny_model
(
**
batch
,
return_dict
=
True
)
print
(
len
(
outputs
.
logits
[
0
]))
# Save
mname_tiny
=
"tiny-wmt19-en-de"
tiny_model
.
half
()
# makes it smaller
tiny_model
.
save_pretrained
(
mname_tiny
)
tokenizer
.
save_pretrained
(
mname_tiny
)
# Upload
# transformers-cli upload tiny-wmt19-en-de
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment