Unverified Commit fec9dde7 authored by Luis Cosio's avatar Luis Cosio Committed by GitHub
Browse files

feat: Add mmlu-redux and it's spanish transaltion as generative task definitions (#2705)



* Added benchmark

* Added more testing

* Added task definition for mmlu_redux and mmlu_redux_spanish

* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs

* Add remaining MMLU Redux YAMLs and updated tasks README

* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs

* Add MMLU Redux changes from pr-2705

* Resolve pre-commit hook and pytest overlapping group issues by adding mmlu_redux_spanish task entries and unique subgroup names

* Enhance retry logic to prevent 429 error when using Hugging Face API for tests, apply pre-commit fixes

* Revert python test changes and comments one task group to avoid Hugging Face rate limit and task failure

---------
Co-authored-by: default avatarCT-6282 <ricardo.godric@hotmail.com>
parent 368275f3
"dataset_name": "virology"
"description": "The following are multiple choice questions (with answers) about virology.\n\
\n"
"tag": "mmlu_other_generative"
"include": "_default_template_yaml"
"task": "mmlu_virology_generative"
"task_alias": "virology"
"dataset_name": "world_religions"
"description": "The following are multiple choice questions (with answers) about world\
\ religions.\n\n"
"tag": "mmlu_humanities_generative"
"include": "_default_template_yaml"
"task": "mmlu_world_religions_generative"
"task_alias": "world_religions"
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment