Unverified Commit fec9dde7 authored by Luis Cosio's avatar Luis Cosio Committed by GitHub
Browse files

feat: Add mmlu-redux and it's spanish transaltion as generative task definitions (#2705)



* Added benchmark

* Added more testing

* Added task definition for mmlu_redux and mmlu_redux_spanish

* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs

* Add remaining MMLU Redux YAMLs and updated tasks README

* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs

* Add MMLU Redux changes from pr-2705

* Resolve pre-commit hook and pytest overlapping group issues by adding mmlu_redux_spanish task entries and unique subgroup names

* Enhance retry logic to prevent 429 error when using Hugging Face API for tests, apply pre-commit fixes

* Revert python test changes and comments one task group to avoid Hugging Face rate limit and task failure

---------
Co-authored-by: default avatarCT-6282 <ricardo.godric@hotmail.com>
parent 368275f3
"dataset_name": "logical_fallacies"
"description":
"The following are multiple choice questions (with answers) about logical\
\ fallacies.\n\n"
"tag": "mmlu_humanities_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_logical_fallacies_generative_spanish"
"task_alias": "logical_fallacies_spanish"
"dataset_name": "machine_learning"
"description":
"The following are multiple choice questions (with answers) about machine\
\ learning.\n\n"
"tag": "mmlu_stem_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_machine_learning_generative_spanish"
"task_alias": "machine_learning_spanish"
"dataset_name": "management"
"description":
"The following are multiple choice questions (with answers) about management.\n\
\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_management_generative_spanish"
"task_alias": "management_spanish"
"dataset_name": "marketing"
"description":
"The following are multiple choice questions (with answers) about marketing.\n\
\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_marketing_generative_spanish"
"task_alias": "marketing_spanish"
"dataset_name": "medical_genetics"
"description":
"The following are multiple choice questions (with answers) about medical\
\ genetics.\n\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_medical_genetics_generative_spanish"
"task_alias": "medical_genetics_spanish"
"dataset_name": "miscellaneous"
"description":
"The following are multiple choice questions (with answers) about miscellaneous.\n\
\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_miscellaneous_generative_spanish"
"task_alias": "miscellaneous_spanish"
"dataset_name": "moral_disputes"
"description":
"The following are multiple choice questions (with answers) about moral\
\ disputes.\n\n"
"tag": "mmlu_humanities_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_moral_disputes_generative_spanish"
"task_alias": "moral_disputes_spanish"
"dataset_name": "moral_scenarios"
"description":
"The following are multiple choice questions (with answers) about moral\
\ scenarios.\n\n"
"tag": "mmlu_humanities_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_moral_scenarios_generative_spanish"
"task_alias": "moral_scenarios_spanish"
"dataset_name": "nutrition"
"description":
"The following are multiple choice questions (with answers) about nutrition.\n\
\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_nutrition_generative_spanish"
"task_alias": "nutrition_spanish"
"dataset_name": "philosophy"
"description":
"The following are multiple choice questions (with answers) about philosophy.\n\
\n"
"tag": "mmlu_humanities_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_philosophy_generative_spanish"
"task_alias": "philosophy_spanish"
"dataset_name": "prehistory"
"description":
"The following are multiple choice questions (with answers) about prehistory.\n\
\n"
"tag": "mmlu_humanities_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_prehistory_generative_spanish"
"task_alias": "prehistory_spanish"
"dataset_name": "professional_accounting"
"description":
"The following are multiple choice questions (with answers) about professional\
\ accounting.\n\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_professional_accounting_generative_spanish"
"task_alias": "professional_accounting_spanish"
"dataset_name": "professional_law"
"description":
"The following are multiple choice questions (with answers) about professional\
\ law.\n\n"
"tag": "mmlu_humanities_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_professional_law_generative_spanish"
"task_alias": "professional_law_spanish"
"dataset_name": "professional_medicine"
"description":
"The following are multiple choice questions (with answers) about professional\
\ medicine.\n\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_professional_medicine_generative_spanish"
"task_alias": "professional_medicine_spanish"
"dataset_name": "professional_psychology"
"description":
"The following are multiple choice questions (with answers) about professional\
\ psychology.\n\n"
"tag": "mmlu_social_sciences_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_professional_psychology_generative_spanish"
"task_alias": "professional_psychology_spanish"
"dataset_name": "public_relations"
"description":
"The following are multiple choice questions (with answers) about public\
\ relations.\n\n"
"tag": "mmlu_social_sciences_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_public_relations_generative_spanish"
"task_alias": "public_relations_spanish"
"dataset_name": "security_studies"
"description":
"The following are multiple choice questions (with answers) about security\
\ studies.\n\n"
"tag": "mmlu_social_sciences_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_security_studies_generative_spanish"
"task_alias": "security_studies_spanish"
"dataset_name": "sociology"
"description":
"The following are multiple choice questions (with answers) about sociology.\n\
\n"
"tag": "mmlu_social_sciences_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_sociology_generative_spanish"
"task_alias": "sociology_spanish"
"dataset_name": "us_foreign_policy"
"description":
"The following are multiple choice questions (with answers) about us\
\ foreign policy.\n\n"
"tag": "mmlu_social_sciences_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_us_foreign_policy_generative_spanish"
"task_alias": "us_foreign_policy_spanish"
"dataset_name": "virology"
"description":
"The following are multiple choice questions (with answers) about virology.\n\
\n"
"tag": "mmlu_other_generative_spanish"
"include": "_default_template_spanish_yaml"
"task": "mmlu_virology_generative_spanish"
"task_alias": "virology_spanish"
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment