feat: Add mmlu-redux and it's spanish transaltion as generative task definitions (#2705)
* Added benchmark
* Added more testing
* Added task definition for mmlu_redux and mmlu_redux_spanish
* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
* Add remaining MMLU Redux YAMLs and updated tasks README
* Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
* Add MMLU Redux changes from pr-2705
* Resolve pre-commit hook and pytest overlapping group issues by adding mmlu_redux_spanish task entries and unique subgroup names
* Enhance retry logic to prevent 429 error when using Hugging Face API for tests, apply pre-commit fixes
* Revert python test changes and comments one task group to avoid Hugging Face rate limit and task failure
---------
Co-authored-by:
CT-6282 <ricardo.godric@hotmail.com>
Showing
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
This diff is collapsed.
Please register or sign in to comment