"example/vscode:/vscode.git/clone" did not exist on "c50c0fb43181e66095f0e6bb06de402ac2a800a3"
Commit 6a13448a authored by Manuel Romero's avatar Manuel Romero Committed by Julien Chaumond
Browse files

Update README.md

- Fix path of tokenizer
- Clarify that the model is not trained on the evaluation set
parent e57533cc
...@@ -65,7 +65,7 @@ Citation: ...@@ -65,7 +65,7 @@ Citation:
</details> </details>
I used `Data augmentation techniques` to obtain more samples and splited the dataset in order to have a train and test set. The test set was created in a way that contains the same number of samples for each language. Finally, I got: As **XQuAD** is just an evaluation dataset, I used `Data augmentation techniques` (scraping, neural machine translation, etc) to obtain more samples and splited the dataset in order to have a train and test set. The test set was created in a way that contains the same number of samples for each language. Finally, I got:
| Dataset | # samples | | Dataset | # samples |
| ----------- | --------- | | ----------- | --------- |
...@@ -101,7 +101,7 @@ from transformers import pipeline ...@@ -101,7 +101,7 @@ from transformers import pipeline
qa_pipeline = pipeline( qa_pipeline = pipeline(
"question-answering", "question-answering",
model="mrm8488/bert-multi-uncased-finetuned-xquadv1", model="mrm8488/bert-multi-uncased-finetuned-xquadv1",
tokenizer="bert-multi-uncased-finetuned-xquadv1" tokenizer="mrm8488/bert-multi-uncased-finetuned-xquadv1"
) )
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment