• Luis Cosio's avatar
    feat: Add mmlu-redux and it's spanish transaltion as generative task definitions (#2705) · fec9dde7
    Luis Cosio authored
    
    
    * Added benchmark
    
    * Added more testing
    
    * Added task definition for mmlu_redux and mmlu_redux_spanish
    
    * Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
    
    * Add remaining MMLU Redux YAMLs and updated tasks README
    
    * Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
    
    * Add MMLU Redux changes from pr-2705
    
    * Resolve pre-commit hook and pytest overlapping group issues by adding mmlu_redux_spanish task entries and unique subgroup names
    
    * Enhance retry logic to prevent 429 error when using Hugging Face API for tests, apply pre-commit fixes
    
    * Revert python test changes and comments one task group to avoid Hugging Face rate limit and task failure
    
    ---------
    Co-authored-by: default avatarCT-6282 <ricardo.godric@hotmail.com>
    fec9dde7
README.md 94.5 KB