Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
chenpangpang
transformers
Commits
29ab4b7f
"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "0e56b37e805279ecb61670159fa8c71487214e0a"
Commit
29ab4b7f
authored
Feb 17, 2020
by
Manuel Romero
Committed by
Julien Chaumond
Feb 17, 2020
Browse files
Create README.md
parent
c88ed74c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
58 additions
and
0 deletions
+58
-0
model_cards/mrm8488/bert-spanish-cased-finetuned-ner/README.md
..._cards/mrm8488/bert-spanish-cased-finetuned-ner/README.md
+58
-0
No files found.
model_cards/mrm8488/bert-spanish-cased-finetuned-ner/README.md
0 → 100644
View file @
29ab4b7f
---
language
:
spanish
thumbnail
:
https://i.imgur.com/jgBdimh.png
---
# Spanish BERT (BETO) + NER
This model is a fine-tuned on
[
NER-C
](
https://www.kaggle.com/nltkdata/conll-corpora
)
of the Spanish BERT cased
[
(BETO)
](
https://github.com/dccuchile/beto
)
for
**NER**
downstream task.
## Details of the downstream task (NER) - Dataset
-
[
Dataset: CONLL Corpora ES
](
https://www.kaggle.com/nltkdata/conll-corpora
)
I preprocessed the dataset and splitted it as train / dev (80/20)
| Dataset | # Examples |
| ---------------------- | ----- |
| Train | 8.7 K |
| Dev | 2.2 K |
-
[
Fine-tune on NER script
](
https://github.com/huggingface/transformers/blob/master/examples/run_ner.py
)
```
bash
!
export
NER_DIR
=
'/content/ner_dataset'
!
python /content/transformers/examples/run_ner.py
\
--model_type
bert
\
--model_name_or_path
dccuchile/bert-base-spanish-wwm-cased
\
--do_train
\
--do_eval
\
--data_dir
'/content/ner_dataset'
\
--num_train_epochs
15.0
\
--max_seq_length
384
\
--output_dir
/content/model_output
\
--save_steps
5000
\
```
## Comparison:
| Model | # score |
| :--------------------------------------------------------------------------------------------------------------: | :-------: |
| bert-base-spanish-wwm-cased (BETO) | 88.43 |
|
[
bert-spanish-cased-finetuned-ner (this one)
](
https://huggingface.co/mrm8488/bert-spanish-cased-finetuned-ner
)
|
**89.65**
|
| Best Multilingual BERT | 87.38 |
```
***** All metrics on Eval results *****
f1 = 0.8965040489828165
loss = 0.11504213575173258
precision = 0.893679858239811
recall = 0.8993461462254805
```
> Created by [Manuel Romero/@mrm8488](https://twitter.com/mrm8488)
> Made with <span style="color: #e25555;">♥</span> in Spain
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment