Update README.md

- Fix path of tokenizer - Clarify that the model is not trained on the evaluation set

Update README.md
- Fix path of tokenizer - Clarify that the model is not trained on the evaluation set
6a13448a · Manuel Romero · Julien Chaumond · e57533cc · 6a13448a
Commit 6a13448a authored Mar 10, 2020 by Manuel Romero Committed by Julien Chaumond Mar 10, 2020
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

model_cards/mrm8488/bert-multi-uncased-finetuned-xquadv1/README.md ...ds/mrm8488/bert-multi-uncased-finetuned-xquadv1/README.md +2 -2

No files found.
--- a/model_cards/mrm8488/bert-multi-uncased-finetuned-xquadv1/README.md
+++ b/model_cards/mrm8488/bert-multi-uncased-finetuned-xquadv1/README.md
@@ -65,7 +65,7 @@ Citation:

 </details>

-I used `Data augmentation techniques` to obtain more samples and splited the dataset in order to have a train and test set. The test set was created in a way that contains the same number of samples for each language. Finally, I got:
+As **XQuAD** is just an evaluation dataset, I used `Data augmentation techniques` (scraping, neural machine translation, etc) to obtain more samples and splited the dataset in order to have a train and test set. The test set was created in a way that contains the same number of samples for each language. Finally, I got:

 | Dataset     | # samples |
 | ----------- | --------- |
@@ -101,7 +101,7 @@ from transformers import pipeline
 qa_pipeline = pipeline(
    "question-answering",
    model="mrm8488/bert-multi-uncased-finetuned-xquadv1",
-    tokenizer="bert-multi-uncased-finetuned-xquadv1"
+    tokenizer="mrm8488/bert-multi-uncased-finetuned-xquadv1"
 )