Unverified Commit 815f59e6 authored by Lintang Sutawika's avatar Lintang Sutawika Committed by GitHub
Browse files

Merge pull request #922 from EleutherAI/mmlu_subgroups

[Refactor] Mmlu subgroups and weight avg
parents 3533e4b9 44124d95
dataset_name: logical_fallacies "dataset_name": "logical_fallacies"
description: 'The following are multiple choice questions (with answers) about logical "description": "The following are multiple choice questions (with answers) about logical\
fallacies. \ fallacies.\n\n"
"group": "mmlu_flan_cot_zeroshot_humanities"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_logical_fallacies"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_logical_fallacies
dataset_name: machine_learning "dataset_name": "machine_learning"
description: 'The following are multiple choice questions (with answers) about machine "description": "The following are multiple choice questions (with answers) about machine\
learning. \ learning.\n\n"
"group": "mmlu_flan_cot_zeroshot_stem"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_machine_learning"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_machine_learning
dataset_name: management "dataset_name": "management"
description: 'The following are multiple choice questions (with answers) about management. "description": "The following are multiple choice questions (with answers) about management.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_other"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_management"
task: mmlu_flan_cot_zeroshot_management
dataset_name: marketing "dataset_name": "marketing"
description: 'The following are multiple choice questions (with answers) about marketing. "description": "The following are multiple choice questions (with answers) about marketing.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_other"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_marketing"
task: mmlu_flan_cot_zeroshot_marketing
dataset_name: medical_genetics "dataset_name": "medical_genetics"
description: 'The following are multiple choice questions (with answers) about medical "description": "The following are multiple choice questions (with answers) about medical\
genetics. \ genetics.\n\n"
"group": "mmlu_flan_cot_zeroshot_other"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_medical_genetics"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_medical_genetics
dataset_name: miscellaneous "dataset_name": "miscellaneous"
description: 'The following are multiple choice questions (with answers) about miscellaneous. "description": "The following are multiple choice questions (with answers) about miscellaneous.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_other"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_miscellaneous"
task: mmlu_flan_cot_zeroshot_miscellaneous
dataset_name: moral_disputes "dataset_name": "moral_disputes"
description: 'The following are multiple choice questions (with answers) about moral "description": "The following are multiple choice questions (with answers) about moral\
disputes. \ disputes.\n\n"
"group": "mmlu_flan_cot_zeroshot_humanities"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_moral_disputes"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_moral_disputes
dataset_name: moral_scenarios "dataset_name": "moral_scenarios"
description: 'The following are multiple choice questions (with answers) about moral "description": "The following are multiple choice questions (with answers) about moral\
scenarios. \ scenarios.\n\n"
"group": "mmlu_flan_cot_zeroshot_humanities"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_moral_scenarios"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_moral_scenarios
dataset_name: nutrition "dataset_name": "nutrition"
description: 'The following are multiple choice questions (with answers) about nutrition. "description": "The following are multiple choice questions (with answers) about nutrition.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_other"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_nutrition"
task: mmlu_flan_cot_zeroshot_nutrition
dataset_name: philosophy "dataset_name": "philosophy"
description: 'The following are multiple choice questions (with answers) about philosophy. "description": "The following are multiple choice questions (with answers) about philosophy.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_humanities"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_philosophy"
task: mmlu_flan_cot_zeroshot_philosophy
dataset_name: prehistory "dataset_name": "prehistory"
description: 'The following are multiple choice questions (with answers) about prehistory. "description": "The following are multiple choice questions (with answers) about prehistory.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_humanities"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_prehistory"
task: mmlu_flan_cot_zeroshot_prehistory
dataset_name: professional_accounting "dataset_name": "professional_accounting"
description: 'The following are multiple choice questions (with answers) about professional "description": "The following are multiple choice questions (with answers) about professional\
accounting. \ accounting.\n\n"
"group": "mmlu_flan_cot_zeroshot_other"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_professional_accounting"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_professional_accounting
dataset_name: professional_law "dataset_name": "professional_law"
description: 'The following are multiple choice questions (with answers) about professional "description": "The following are multiple choice questions (with answers) about professional\
law. \ law.\n\n"
"group": "mmlu_flan_cot_zeroshot_humanities"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_professional_law"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_professional_law
dataset_name: professional_medicine "dataset_name": "professional_medicine"
description: 'The following are multiple choice questions (with answers) about professional "description": "The following are multiple choice questions (with answers) about professional\
medicine. \ medicine.\n\n"
"group": "mmlu_flan_cot_zeroshot_other"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_professional_medicine"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_professional_medicine
dataset_name: professional_psychology "dataset_name": "professional_psychology"
description: 'The following are multiple choice questions (with answers) about professional "description": "The following are multiple choice questions (with answers) about professional\
psychology. \ psychology.\n\n"
"group": "mmlu_flan_cot_zeroshot_social_sciences"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_professional_psychology"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_professional_psychology
dataset_name: public_relations "dataset_name": "public_relations"
description: 'The following are multiple choice questions (with answers) about public "description": "The following are multiple choice questions (with answers) about public\
relations. \ relations.\n\n"
"group": "mmlu_flan_cot_zeroshot_social_sciences"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_public_relations"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_public_relations
dataset_name: security_studies "dataset_name": "security_studies"
description: 'The following are multiple choice questions (with answers) about security "description": "The following are multiple choice questions (with answers) about security\
studies. \ studies.\n\n"
"group": "mmlu_flan_cot_zeroshot_social_sciences"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_security_studies"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_security_studies
dataset_name: sociology "dataset_name": "sociology"
description: 'The following are multiple choice questions (with answers) about sociology. "description": "The following are multiple choice questions (with answers) about sociology.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_social_sciences"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_sociology"
task: mmlu_flan_cot_zeroshot_sociology
dataset_name: us_foreign_policy "dataset_name": "us_foreign_policy"
description: 'The following are multiple choice questions (with answers) about us "description": "The following are multiple choice questions (with answers) about us\
foreign policy. \ foreign policy.\n\n"
"group": "mmlu_flan_cot_zeroshot_social_sciences"
"include": "_mmlu_flan_cot_zeroshot_template_yaml"
' "task": "mmlu_flan_cot_zeroshot_us_foreign_policy"
include: _mmlu_flan_generative_template_yaml
task: mmlu_flan_cot_zeroshot_us_foreign_policy
dataset_name: virology "dataset_name": "virology"
description: 'The following are multiple choice questions (with answers) about virology. "description": "The following are multiple choice questions (with answers) about virology.\n\
\n"
"group": "mmlu_flan_cot_zeroshot_other"
' "include": "_mmlu_flan_cot_zeroshot_template_yaml"
include: _mmlu_flan_generative_template_yaml "task": "mmlu_flan_cot_zeroshot_virology"
task: mmlu_flan_cot_zeroshot_virology
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment