feat: Add mmlu-redux and it's spanish transaltion as generative task definitions (#2705)
* Added benchmark
* Added more testing
* Added task definition for mmlu_redux and mmlu_redux_spanish
* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
* Add remaining MMLU Redux YAMLs and updated tasks README
* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
* Add MMLU Redux changes from pr-2705
* Resolve pre-commit hook and pytest overlapping group issues by adding mmlu_redux_spanish task entries and unique subgroup names
* Enhance retry logic to prevent 429 error when using Hugging Face API for tests, apply pre-commit fixes
* Revert python test changes and comments one task group to avoid Hugging Face rate limit and task failure
---------
Co-authored-by:
CT-6282 <ricardo.godric@hotmail.com>
Showing
This diff is collapsed.
Please register or sign in to comment