Unverified Commit a2af2101 authored by Yen-Ting Lin's avatar Yen-Ting Lin Committed by GitHub
Browse files

Merge branch 'EleutherAI:main' into main

parents 82cb25c1 d5f39bf8
"dataset_name": "world_religions" dataset_name: world_religions
"description": "The following are multiple choice questions (with answers) about world\ description: The following are multiple choice questions (with answers) about world
\ religions.\n\nQ: How can the Upanishads be characterized?\n(A) Ritual texts (B)\ religions.
\ Philosophical texts (C) Hymns (D) Origin stories\nA: Let's think step by step.\ fewshot_config:
\ We refer to Wikipedia articles on world religions for help. The Upanishads are\ sampler: first_n
\ the most recent part of Vedas (the oldest scriptures in Hinduism) and supplied\ samples:
\ the basis of later Hindu philosophy. So they are philosophical texts. The answer\ - question: 'How can the Upanishads be characterized?
\ is (B).\n\nQ: What is the Second Gem in Buddhism?\n(A) The Dharma (B) The Sangha\
\ (C) The Buddha (D) The Bodhisattva\nA: Let's think step by step. We refer to Wikipedia\ (A) Ritual texts (B) Philosophical texts (C) Hymns (D) Origin stories'
\ articles on world religions for help. The Second Gem in Buddhism is The Dharma.\ target: Let's think step by step. We refer to Wikipedia articles on world religions
\ The answer is (A).\n\nQ: Which Japanese government promoted a kind of national\ for help. The Upanishads are the most recent part of Vedas (the oldest scriptures
\ cult based on the emperor and his associations with kami?\n(A) Honen (B) Tanaka\ in Hinduism) and supplied the basis of later Hindu philosophy. So they are philosophical
\ (C) Tokugawa (D) Meiji\nA: Let's think step by step. We refer to Wikipedia articles\ texts. The answer is (B).
\ on world religions for help. The promotion of a national cult based on the emperor\ - question: 'What is the Second Gem in Buddhism?
\ and his associations with Kami happened during the reign of Emperor Meiji (1852-1912).\
\ The answer is (D).\n\nQ: In which dynasty was the \"Mandate of Heaven\" developed\ (A) The Dharma (B) The Sangha (C) The Buddha (D) The Bodhisattva'
\ to legitimatize the new rulers?\n(A) Shang (B) Zhou (C) Han (D) Xia\nA: Let's\ target: Let's think step by step. We refer to Wikipedia articles on world religions
\ think step by step. We refer to Wikipedia articles on world religions for help.\ for help. The Second Gem in Buddhism is The Dharma. The answer is (A).
\ The \"Mandate of Heaven\" was developed as an ancient Chinese philosophical concept\ - question: 'Which Japanese government promoted a kind of national cult based on the
\ during the Zhou Dynasty (1046-256 BCE). The answer is (B).\n\nQ: What is the sign\ emperor and his associations with kami?
\ of the covenant for Jewish males?\n(A) The rainbow (B) Circumcision (C) A son\
\ (D) Bar mitzvah\nA: Let's think step by step. We refer to Wikipedia articles on\ (A) Honen (B) Tanaka (C) Tokugawa (D) Meiji'
\ world religions for help. In Judaism, the most distinctive sign of the covenant\ target: Let's think step by step. We refer to Wikipedia articles on world religions
\ is circumcision (brit milah). The answer is (B).\n\n" for help. The promotion of a national cult based on the emperor and his associations
"group": "mmlu_flan_cot_fewshot_humanities" with Kami happened during the reign of Emperor Meiji (1852-1912). The answer
"include": "_mmlu_flan_cot_fewshot_template_yaml" is (D).
"task": "mmlu_flan_cot_fewshot_world_religions" - question: 'In which dynasty was the "Mandate of Heaven" developed to legitimatize
the new rulers?
(A) Shang (B) Zhou (C) Han (D) Xia'
target: Let's think step by step. We refer to Wikipedia articles on world religions
for help. The "Mandate of Heaven" was developed as an ancient Chinese philosophical
concept during the Zhou Dynasty (1046-256 BCE). The answer is (B).
- question: 'What is the sign of the covenant for Jewish males?
(A) The rainbow (B) Circumcision (C) A son (D) Bar mitzvah'
target: 'Let''s think step by step. We refer to Wikipedia articles on world religions
for help. In Judaism, the most distinctive sign of the covenant is circumcision
(brit milah). The answer is (B).'
tag: mmlu_flan_cot_fewshot_humanities
include: _mmlu_flan_cot_fewshot_template_yaml
task: mmlu_flan_cot_fewshot_world_religions
group: mmlu_flan_cot_zeroshot group: mmlu_flan_cot_zeroshot
group_alias: mmlu (flan style, zeroshot cot)
task: task:
- mmlu_flan_cot_zeroshot_stem - group: stem
- mmlu_flan_cot_zeroshot_other task:
- mmlu_flan_cot_zeroshot_social_sciences - mmlu_flan_cot_zeroshot_stem
- mmlu_flan_cot_zeroshot_humanities aggregate_metric_list:
- metric: acc
weight_by_size: True
- group: other
task:
- mmlu_flan_cot_zeroshot_other
aggregate_metric_list:
- metric: acc
weight_by_size: True
- group: social sciences
task:
- mmlu_flan_cot_zeroshot_social_sciences
aggregate_metric_list:
- metric: acc
weight_by_size: True
- group: humanities
task:
- mmlu_flan_cot_zeroshot_humanities
aggregate_metric_list:
- metric: acc
weight_by_size: True
aggregate_metric_list:
- metric: acc
weight_by_size: True
metadata:
version: 1
...@@ -8,7 +8,7 @@ filter_list: ...@@ -8,7 +8,7 @@ filter_list:
- name: "strict-match" - name: "strict-match"
filter: filter:
- function: "regex" - function: "regex"
regex_pattern: "((?<=The answer is )(.*)(?=.)|(?<=the answer is )(.*)(?=.)|(?<=The answer: )(.*)(?=.)|(?<=The final answer: )(.*)(?=.))" regex_pattern: "((?<=The answer is )(.*)(?=.)|(?<=answer is )(.*)(?=.)|(?<=The answer: )(.*)(?=.)|(?<=The final answer: )(.*)(?=.))"
- function: "take_first" - function: "take_first"
- name: "flexible-extract" - name: "flexible-extract"
filter: filter:
...@@ -33,4 +33,6 @@ metric_list: ...@@ -33,4 +33,6 @@ metric_list:
ignore_case: true ignore_case: true
ignore_punctuation: true ignore_punctuation: true
metadata: metadata:
version: 1.0 version: 2.0
dataset_kwargs:
trust_remote_code: true
"dataset_name": "abstract_algebra" "dataset_name": "abstract_algebra"
"description": "The following are multiple choice questions (with answers) about abstract\ "description": "The following are multiple choice questions (with answers) about abstract\
\ algebra.\n\n" \ algebra.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_abstract_algebra" "task": "mmlu_flan_cot_zeroshot_abstract_algebra"
"dataset_name": "anatomy" "dataset_name": "anatomy"
"description": "The following are multiple choice questions (with answers) about anatomy.\n\ "description": "The following are multiple choice questions (with answers) about anatomy.\n\
\n" \n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_anatomy" "task": "mmlu_flan_cot_zeroshot_anatomy"
"dataset_name": "astronomy" "dataset_name": "astronomy"
"description": "The following are multiple choice questions (with answers) about astronomy.\n\ "description": "The following are multiple choice questions (with answers) about astronomy.\n\
\n" \n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_astronomy" "task": "mmlu_flan_cot_zeroshot_astronomy"
"dataset_name": "business_ethics" "dataset_name": "business_ethics"
"description": "The following are multiple choice questions (with answers) about business\ "description": "The following are multiple choice questions (with answers) about business\
\ ethics.\n\n" \ ethics.\n\n"
"group": "mmlu_flan_cot_zeroshot_other" "tag": "mmlu_flan_cot_zeroshot_other"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_business_ethics" "task": "mmlu_flan_cot_zeroshot_business_ethics"
"dataset_name": "clinical_knowledge" "dataset_name": "clinical_knowledge"
"description": "The following are multiple choice questions (with answers) about clinical\ "description": "The following are multiple choice questions (with answers) about clinical\
\ knowledge.\n\n" \ knowledge.\n\n"
"group": "mmlu_flan_cot_zeroshot_other" "tag": "mmlu_flan_cot_zeroshot_other"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_clinical_knowledge" "task": "mmlu_flan_cot_zeroshot_clinical_knowledge"
"dataset_name": "college_biology" "dataset_name": "college_biology"
"description": "The following are multiple choice questions (with answers) about college\ "description": "The following are multiple choice questions (with answers) about college\
\ biology.\n\n" \ biology.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_college_biology" "task": "mmlu_flan_cot_zeroshot_college_biology"
"dataset_name": "college_chemistry" "dataset_name": "college_chemistry"
"description": "The following are multiple choice questions (with answers) about college\ "description": "The following are multiple choice questions (with answers) about college\
\ chemistry.\n\n" \ chemistry.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_college_chemistry" "task": "mmlu_flan_cot_zeroshot_college_chemistry"
"dataset_name": "college_computer_science" "dataset_name": "college_computer_science"
"description": "The following are multiple choice questions (with answers) about college\ "description": "The following are multiple choice questions (with answers) about college\
\ computer science.\n\n" \ computer science.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_college_computer_science" "task": "mmlu_flan_cot_zeroshot_college_computer_science"
"dataset_name": "college_mathematics" "dataset_name": "college_mathematics"
"description": "The following are multiple choice questions (with answers) about college\ "description": "The following are multiple choice questions (with answers) about college\
\ mathematics.\n\n" \ mathematics.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_college_mathematics" "task": "mmlu_flan_cot_zeroshot_college_mathematics"
"dataset_name": "college_medicine" "dataset_name": "college_medicine"
"description": "The following are multiple choice questions (with answers) about college\ "description": "The following are multiple choice questions (with answers) about college\
\ medicine.\n\n" \ medicine.\n\n"
"group": "mmlu_flan_cot_zeroshot_other" "tag": "mmlu_flan_cot_zeroshot_other"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_college_medicine" "task": "mmlu_flan_cot_zeroshot_college_medicine"
"dataset_name": "college_physics" "dataset_name": "college_physics"
"description": "The following are multiple choice questions (with answers) about college\ "description": "The following are multiple choice questions (with answers) about college\
\ physics.\n\n" \ physics.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_college_physics" "task": "mmlu_flan_cot_zeroshot_college_physics"
"dataset_name": "computer_security" "dataset_name": "computer_security"
"description": "The following are multiple choice questions (with answers) about computer\ "description": "The following are multiple choice questions (with answers) about computer\
\ security.\n\n" \ security.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_computer_security" "task": "mmlu_flan_cot_zeroshot_computer_security"
"dataset_name": "conceptual_physics" "dataset_name": "conceptual_physics"
"description": "The following are multiple choice questions (with answers) about conceptual\ "description": "The following are multiple choice questions (with answers) about conceptual\
\ physics.\n\n" \ physics.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_conceptual_physics" "task": "mmlu_flan_cot_zeroshot_conceptual_physics"
"dataset_name": "econometrics" "dataset_name": "econometrics"
"description": "The following are multiple choice questions (with answers) about econometrics.\n\ "description": "The following are multiple choice questions (with answers) about econometrics.\n\
\n" \n"
"group": "mmlu_flan_cot_zeroshot_social_sciences" "tag": "mmlu_flan_cot_zeroshot_social_sciences"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_econometrics" "task": "mmlu_flan_cot_zeroshot_econometrics"
"dataset_name": "electrical_engineering" "dataset_name": "electrical_engineering"
"description": "The following are multiple choice questions (with answers) about electrical\ "description": "The following are multiple choice questions (with answers) about electrical\
\ engineering.\n\n" \ engineering.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_electrical_engineering" "task": "mmlu_flan_cot_zeroshot_electrical_engineering"
"dataset_name": "elementary_mathematics" "dataset_name": "elementary_mathematics"
"description": "The following are multiple choice questions (with answers) about elementary\ "description": "The following are multiple choice questions (with answers) about elementary\
\ mathematics.\n\n" \ mathematics.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem" "tag": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_elementary_mathematics" "task": "mmlu_flan_cot_zeroshot_elementary_mathematics"
"dataset_name": "formal_logic" "dataset_name": "formal_logic"
"description": "The following are multiple choice questions (with answers) about formal\ "description": "The following are multiple choice questions (with answers) about formal\
\ logic.\n\n" \ logic.\n\n"
"group": "mmlu_flan_cot_zeroshot_humanities" "tag": "mmlu_flan_cot_zeroshot_humanities"
"include": "_mmlu_flan_cot_zeroshot_template_yaml" "include": "_mmlu_flan_cot_zeroshot_template_yaml"
"task": "mmlu_flan_cot_zeroshot_formal_logic" "task": "mmlu_flan_cot_zeroshot_formal_logic"
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment