Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
20c3b8ca
"sgl-router/src/vscode:/vscode.git/clone" did not exist on "0f0c430e93ac1279fe7cd6ed949f5a68334af0dd"
Commit
20c3b8ca
authored
Apr 27, 2020
by
Manuel Romero
Committed by
Julien Chaumond
Apr 27, 2020
Browse files
Create model card
parent
b3f272ff
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
53 additions
and
0 deletions
+53
-0
model_cards/mrm8488/distilbert-base-multi-cased-finetuned-typo-detection/README.md
...lbert-base-multi-cased-finetuned-typo-detection/README.md
+53
-0
No files found.
model_cards/mrm8488/distilbert-base-multi-cased-finetuned-typo-detection/README.md
0 → 100644
View file @
20c3b8ca
---
language
:
multilingual
thumbnail
:
---
# DISTILBERT 🌎 + Typo Detection ✍❌✍✔
[
distilbert-base-multilingual-cased
](
https://huggingface.co/distilbert-base-multilingual-cased
)
fine-tuned on
[
GitHub Typo Corpus
](
https://github.com/mhagiwara/github-typo-corpus
)
for
**typo detection**
(using
*NER*
style)
## Details of the downstream task (Typo detection as NER)
-
Dataset:
[
GitHub Typo Corpus
](
https://github.com/mhagiwara/github-typo-corpus
)
📚 for 15 languages
-
[
Fine-tune script on NER dataset provided by Huggingface
](
https://github.com/huggingface/transformers/blob/master/examples/run_ner.py
)
🏋️♂️
## Metrics on test set 📋
| Metric | # score |
| :-------: | :-------: |
| F1 |
**93.51**
|
| Precision |
**96.08**
|
| Recall |
**91.06**
|
## Model in action 🔨
Fast usage with
**pipelines**
🧪
```
python
from
transformers
import
pipeline
typo_checker
=
pipeline
(
"ner"
,
model
=
"mrm8488/distilbert-base-multi-cased-finetuned-typo-detection"
,
tokenizer
=
"mrm8488/distilbert-base-multi-cased-finetuned-typo-detection"
)
result
=
typo_checker
(
"Adddd validation midelware"
)
result
[
1
:
-
1
]
# Output:
[{
'entity'
:
'ok'
,
'score'
:
0.7128152847290039
,
'word'
:
'add'
},
{
'entity'
:
'typo'
,
'score'
:
0.5388424396514893
,
'word'
:
'##dd'
},
{
'entity'
:
'ok'
,
'score'
:
0.94792640209198
,
'word'
:
'validation'
},
{
'entity'
:
'typo'
,
'score'
:
0.5839331746101379
,
'word'
:
'mid'
},
{
'entity'
:
'ok'
,
'score'
:
0.5195121765136719
,
'word'
:
'##el'
},
{
'entity'
:
'ok'
,
'score'
:
0.7222476601600647
,
'word'
:
'##ware'
}]
```
It works🎉! We typed wrong
```Add and middleware```
> Created by [Manuel Romero/@mrm8488](https://twitter.com/mrm8488)
> Made with <span style="color: #e25555;">♥</span> in Spain
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment