Commit 47a7d41c authored by ManuelFay's avatar ManuelFay
Browse files

match beleble paper prompts more closely

parent 2d6bc236
...@@ -32,7 +32,9 @@ Homepage: https://github.com/facebookresearch/belebele ...@@ -32,7 +32,9 @@ Homepage: https://github.com/facebookresearch/belebele
The following tasks evaluate languages in the Belebele dataset using loglikelihood-based multiple-choice scoring: The following tasks evaluate languages in the Belebele dataset using loglikelihood-based multiple-choice scoring:
- `cmmlu_{language}` - `belebele_{language}`
The variant evaluated here is the 0-shot or few-shot evaluation with English Instructions.
### Checklist ### Checklist
......
group: belebele group: belebele
dataset_path: facebook/belebele dataset_path: facebook/belebele
description: "Choose the best answer to the question.\n"
test_split: test test_split: test
fewshot_split: test fewshot_split: test
fewshot_config: fewshot_config:
...@@ -7,7 +8,7 @@ fewshot_config: ...@@ -7,7 +8,7 @@ fewshot_config:
output_type: multiple_choice output_type: multiple_choice
should_decontaminate: true should_decontaminate: true
doc_to_decontamination_query: "{{question}}" doc_to_decontamination_query: "{{question}}"
doc_to_text: "{{question.strip()}}\nA. {{mc_answer1}}\nB. {{mc_answer2}}\nC. {{mc_answer3}}\nD. {{mc_answer4}}\nAnswer:" doc_to_text: "P: {{flores_passage}}\nQ: {{question.strip()}}\nA. {{mc_answer1}}\nB. {{mc_answer2}}\nC. {{mc_answer3}}\nD. {{mc_answer4}}\nAnswer:"
doc_to_choice: ["A", "B", "C", "D"] doc_to_choice: ["A", "B", "C", "D"]
doc_to_target: "{{['1', '2', '3', '4'].index(correct_answer_num)}}" doc_to_target: "{{['1', '2', '3', '4'].index(correct_answer_num)}}"
metric_list: metric_list:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment