Commit 97e3c9fe authored by lintangsutawika's avatar lintangsutawika
Browse files

added lama primed and negated dataset

parent 7852985b
# LAMA Primed Negated
### Paper
Title:
Abstract:
Homepage:
### Citation
```
```
### Groups and Tasks
#### Groups
#### Tasks
### Checklist
For adding novel benchmarks/datasets to the library:
* [ ] Is the task an existing benchmark in the literature?
* [ ] Have you referenced the original paper that introduced the task?
* [ ] If yes, does the original paper provide a reference implementation? If so, have you checked against the reference implementation and documented how to run such a test?
If other tasks on this dataset are already supported:
* [ ] Is the "Main" variant of this task clearly denoted?
* [ ] Have you provided a short sentence in a README on what each new variant adds / evaluates?
* [ ] Have you noted which, if any, published evaluation setups are matched by this variant?
group: lama_misprimed_correct_incorrect
dataset_path: lintang/lama_primed_negated
output_type: multiple_choice
# description: "Answer the following question with either Correct or Incorrect. "
doc_to_text: "Question: {{masked_misprimed[0]|replace('[MASK]', obj_label)}} Correct or Incorrect?\n\nAnswer:"
doc_to_target: "Correct"
doc_to_choice: ["Correct", "Incorrect"]
should_decontaminate: true
doc_to_decontamination_query: "Question: {{masked_misprimed[0]|replace('[MASK]', obj_label)}} Correct or Incorrect?\n\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: Correct
- metric: acc_norm
aggregation: mean
higher_is_better: Correct
metadata:
version: 1.0
group: lama_negated_correct_incorrect
dataset_path: lintang/lama_primed_negated
output_type: multiple_choice
# description: "Answer the following question with either Correct or Incorrect. "
doc_to_text: "Question: {{masked_negations[0]|replace('[MASK]', obj_label)}} Correct or Incorrect?\n\nAnswer:"
doc_to_target: "Incorrect"
doc_to_choice: ["Correct", "Incorrect"]
should_decontaminate: true
doc_to_decontamination_query: "Question: {{masked_negations[0]|replace('[MASK]', obj_label)}} Correct or Incorrect?\n\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: Correct
- metric: acc_norm
aggregation: mean
higher_is_better: Correct
metadata:
version: 1.0
group: lama_normal_correct_incorrect
dataset_path: lintang/lama_primed_negated
output_type: multiple_choice
# description: "Answer the following question with either Correct or Incorrect. "
doc_to_text: "Question: {{masked_sentences[0]|replace('[MASK]', obj_label)}} Correct or Incorrect?\n\nAnswer:"
doc_to_target: "Correct"
doc_to_choice: ["Correct", "Incorrect"]
should_decontaminate: true
doc_to_decontamination_query: "Question: {{masked_sentences[0]|replace('[MASK]', obj_label)}} Correct or Incorrect?\n\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: Correct
- metric: acc_norm
aggregation: mean
higher_is_better: Correct
metadata:
version: 1.0
include: _misprimed_correct_incorrect_template_yaml
task: lama_misprimed_conceptnet_correct_incorrect
dataset_name: ConceptNet
test_split: high_ranked
include: _misprimed_correct_incorrect_template_yaml
task: lama_misprimed_google_re_correct_incorrect
dataset_name: GoogleRE
test_split: high_ranked
include: _misprimed_correct_incorrect_template_yaml
task: lama_misprimed_squad_correct_incorrect
dataset_name: SQUAD
test_split: high_ranked
include: _misprimed_correct_incorrect_template_yaml
task: lama_misprimed_trex_correct_incorrect
dataset_name: TREx
test_split: high_ranked
include: _negated_correct_incorrect_template_yaml
task: lama_negated_conceptnet_correct_incorrect
dataset_name: ConceptNet
test_split: high_ranked
include: _negated_correct_incorrect_template_yaml
task: lama_negated_google_re_correct_incorrect
dataset_name: GoogleRE
test_split: high_ranked
include: _negated_correct_incorrect_template_yaml
task: lama_negated_squad_correct_incorrect
dataset_name: SQUAD
test_split: high_ranked
include: _negated_correct_incorrect_template_yaml
task: lama_negated_trex_correct_incorrect
dataset_name: TREx
test_split: high_ranked
include: _normal_correct_incorrect_template_yaml
task: lama_normal_conceptnet_correct_incorrect
dataset_name: ConceptNet
test_split: high_ranked
include: _normal_correct_incorrect_template_yaml
task: lama_normal_google_re_correct_incorrect
dataset_name: GoogleRE
test_split: high_ranked
include: _normal_correct_incorrect_template_yaml
task: lama_normal_squad_correct_incorrect
dataset_name: SQUAD
test_split: high_ranked
include: _normal_correct_incorrect_template_yaml
task: lama_normal_trex_correct_incorrect
dataset_name: TREx
test_split: high_ranked
group: lama_misprimed_right_wrong
dataset_path: lintang/lama_primed_negated
output_type: multiple_choice
# description: "Answer the following question with either Right or Wrong. "
doc_to_text: "Question: {{masked_misprimed[0]|replace('[MASK]', obj_label)}} Right or Wrong?\n\nAnswer:"
doc_to_target: "Right"
doc_to_choice: ["Right", "Wrong"]
should_decontaminate: true
doc_to_decontamination_query: "Question: {{masked_misprimed[0]|replace('[MASK]', obj_label)}} Right or Wrong?\n\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: Right
- metric: acc_norm
aggregation: mean
higher_is_better: Right
metadata:
version: 1.0
group: lama_negated_right_wrong
dataset_path: lintang/lama_primed_negated
output_type: multiple_choice
# description: "Answer the following question with either Right or Wrong. "
doc_to_text: "Question: {{masked_negations[0]|replace('[MASK]', obj_label)}} Right or Wrong?\n\nAnswer:"
doc_to_target: "Wrong"
doc_to_choice: ["Right", "Wrong"]
should_decontaminate: true
doc_to_decontamination_query: "Question: {{masked_negations[0]|replace('[MASK]', obj_label)}} Right or Wrong?\n\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: Right
- metric: acc_norm
aggregation: mean
higher_is_better: Right
metadata:
version: 1.0
group: lama_normal_right_wrong
dataset_path: lintang/lama_primed_negated
output_type: multiple_choice
# description: "Answer the following question with either Right or Wrong. "
doc_to_text: "Question: {{masked_sentences[0]|replace('[MASK]', obj_label)}} Right or Wrong?\n\nAnswer:"
doc_to_target: "Right"
doc_to_choice: ["Right", "Wrong"]
should_decontaminate: true
doc_to_decontamination_query: "Question: {{masked_sentences[0]|replace('[MASK]', obj_label)}} Right or Wrong?\n\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: Right
- metric: acc_norm
aggregation: mean
higher_is_better: Right
metadata:
version: 1.0
include: _misprimed_right_wrong_template_yaml
task: lama_misprimed_conceptnet_right_wrong
dataset_name: ConceptNet
test_split: high_ranked
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment