Update README.md

375346e6 · Lintang Sutawika · GitHub · decf7d87 · 375346e6
Unverified Commit 375346e6 authored Aug 09, 2023 by Lintang Sutawika Committed by GitHub Aug 09, 2023
Show whitespace changes
Inline Side-by-side

Showing with 27 additions and 24 deletions

lm_eval/tasks/anli/README.md lm_eval/tasks/anli/README.md +27 -24

No files found.
--- a/lm_eval/tasks/anli/README.md
+++ b/lm_eval/tasks/anli/README.md
-# [Adversarial NLI: A New Benchmark for Natural Language Understanding](https://arxiv.org/abs/1910.14599)
+# ANLI
-## Abstract
+### Paper
+Title: `Adversarial NLI: A New Benchmark for Natural Language Understanding`
+Paper Link: https://arxiv.org/abs/1910.14599
 Adversarial NLI (ANLI) is a dataset collected via an iterative, adversarial
 human-and-model-in-the-loop procedure. It consists of three rounds that progressively
 increase in difficulty and complexity, and each question-answer includes annotator-
 provided explanations.
-## Subtasks
+Homepage: https://github.com/facebookresearch/anli
-List or describe tasks defined in this folder, and their names here:
-* `anli_r1`: The data collected adversarially in the first round.
-* `anli_r2`: The data collected adversarially in the second round, after training on the previous round's data.
-* `anli_r3`: The data collected adversarially in the third round, after training on the previous multiple rounds of data.
-## Checklist
-For adding novel benchmarks/datasets to the library:
-  * [x] Is the task an existing benchmark in the literature?
-  * [x] Have you referenced the original paper that introduced the task?
-  * [ ] If yes, does the original paper provide a reference implementation? If so, have you checked against the reference implementation and documented how to run such a test?
-If other tasks on this dataset are already supported:
-* [ ] Is the "Main" variant of this task clearly denoted?
-* [x] Have you provided a short sentence in a README on what each new variant adds / evaluates?
-* [ ] Have you noted which, if any, published evaluation setups are matched by this variant?
-## Citation
+### Citation
 ```
 @inproceedings{nie-etal-2020-adversarial,
@@ -44,6 +30,23 @@ If other tasks on this dataset are already supported:
 }
 ```
-## Links
+### Subtasks
+List or describe tasks defined in this folder, and their names here:
+* `anli_r1`: The data collected adversarially in the first round.
+* `anli_r2`: The data collected adversarially in the second round, after training on the previous round's data.
+* `anli_r3`: The data collected adversarially in the third round, after training on the previous multiple rounds of data.
+### Checklist
+For adding novel benchmarks/datasets to the library:
+  * [x] Is the task an existing benchmark in the literature?
+  * [x] Have you referenced the original paper that introduced the task?
+  * [ ] If yes, does the original paper provide a reference implementation? If so, have you checked against the reference implementation and documented how to run such a test?
+If other tasks on this dataset are already supported:
+* [ ] Is the "Main" variant of this task clearly denoted?
+* [x] Have you provided a short sentence in a README on what each new variant adds / evaluates?
+* [ ] Have you noted which, if any, published evaluation setups are matched by this variant?
-Homepage: https://github.com/facebookresearch/anli