Add reference to NLP (package) dataset (#5029)

* Add reference to NLP (package) dataset * Update README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>

Add reference to NLP (package) dataset (#5029)
* Add reference to NLP (package) dataset * Update README.md Co-authored-by: Julien Chaumond <chaumond@gmail.com>
0946d120 · Manuel Romero · GitHub · edcb3ac5 · 0946d120
Unverified Commit 0946d120 authored Jun 16, 2020 by Manuel Romero Committed by GitHub Jun 16, 2020
Show whitespace changes
Inline Side-by-side

Showing with 12 additions and 5 deletions

model_cards/mrm8488/longformer-base-4096-finetuned-squadv2/README.md .../mrm8488/longformer-base-4096-finetuned-squadv2/README.md +12 -5

No files found.
--- a/model_cards/mrm8488/longformer-base-4096-finetuned-squadv2/README.md
+++ b/model_cards/mrm8488/longformer-base-4096-finetuned-squadv2/README.md
 ---
 language: english
-thumbnail:
+datasets:
+- squad_v2
 ---

 # Longformer-base-4096 fine-tuned on SQuAD v2
@@ -17,13 +18,19 @@ Longformer uses a combination of a sliding window (local) attention and global a

 ## Details of the downstream task (Q&A) - Dataset 📚 🧐 ❓

-[SQuAD v2](https://rajpurkar.github.io/SQuAD-explorer/) combines the 100,000 questions in SQuAD1.1 with over 50,000 unanswerable questions written adversarially by crowdworkers to look similar to answerable ones. To do well on SQuAD2.0, systems must not only answer questions when possible, but also determine when no answer is supported by the paragraph and abstain from answering.
-
+Dataset ID: ```squad_v2``` from  [HugginFace/NLP](https://github.com/huggingface/nlp)
 | Dataset  | Split | # samples |
 | -------- | ----- | --------- |
-| SQuAD2.0 | train | 130k      |
-| SQuAD2.0 | eval  | 12.3k     |
+| squad_v2 | train | 130319      |
+| squad_v2 | valid  | 11873     |
+
+How to load it from [nlp](https://github.com/huggingface/nlp)

+```python
+train_dataset  = nlp.load_dataset('squad_v2', split=nlp.Split.TRAIN)
+valid_dataset = nlp.load_dataset('squad_v2', split=nlp.Split.VALIDATION)
+```
+Check out more about this dataset and others in [NLP Viewer](https://huggingface.co/nlp/viewer/)


 ## Model fine-tuning 🏋️‍