Commit 0456d543 authored by lintangsutawika's avatar lintangsutawika
Browse files

edit anli

parent ae55476a
...@@ -30,19 +30,17 @@ Homepage: https://github.com/facebookresearch/anli ...@@ -30,19 +30,17 @@ Homepage: https://github.com/facebookresearch/anli
} }
``` ```
### Subtasks ### Groups and Tasks
List or describe tasks defined in this folder, and their names here: #### Groups
* `anli`: Evaluates `anli_r1`, `anli_r2`, and `anli_r3`
#### Tasks
* `anli_r1`: The data collected adversarially in the first round. * `anli_r1`: The data collected adversarially in the first round.
* `anli_r2`: The data collected adversarially in the second round, after training on the previous round's data. * `anli_r2`: The data collected adversarially in the second round, after training on the previous round's data.
* `anli_r3`: The data collected adversarially in the third round, after training on the previous multiple rounds of data. * `anli_r3`: The data collected adversarially in the third round, after training on the previous multiple rounds of data.
### Groups
- `multiple_choice`
- `natural_language_inference`
- `nli`
- `adverserial`
### Checklist ### Checklist
......
group: group:
- multiple_choice - anli
- natural_language_inference
- nli
- adverserial
task: anli_r1 task: anli_r1
dataset_path: anli dataset_path: anli
dataset_name: null dataset_name: null
......
group: include: anli_r1.yaml
- multiple_choice
- natural_language_inference
- nli
- adverserial
task: anli_r2 task: anli_r2
dataset_path: anli
dataset_name: null
output_type: multiple_choice
training_split: train_r2 training_split: train_r2
validation_split: dev_r2 validation_split: dev_r2
test_split: test_r2 test_split: test_r2
doc_to_text: "{{premise}}\nQuestion: {{hypothesis}} True, False, or Neither?\nAnswer:"
# True = entailment
# False = contradiction
# Neither = neutral
doc_to_target: "{{['True', 'Neither', 'False'][label]}}"
doc_to_choice:
- "True"
- "Neither"
- "False"
should_decontaminate: true
doc_to_decontamination_query: premise
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
group: include: anli_r1.yaml
- multiple_choice
- natural_language_inference
- nli
- adverserial
task: anli_r3 task: anli_r3
dataset_path: anli
dataset_name: null
output_type: multiple_choice
training_split: train_r3 training_split: train_r3
validation_split: dev_r3 validation_split: dev_r3
test_split: test_r3 test_split: test_r3
doc_to_text: "{{premise}}\nQuestion: {{hypothesis}} True, False, or Neither?\nAnswer:"
# True = entailment
# False = contradiction
# Neither = neutral
doc_to_target: "{{['True', 'Neither', 'False'][label]}}"
doc_to_choice:
- "True"
- "Neither"
- "False"
should_decontaminate: true
doc_to_decontamination_query: premise
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment