1. 21 Sep, 2025 1 commit
    • Luis Cosio's avatar
      feat: Add mmlu-redux and it's spanish transaltion as generative task definitions (#2705) · fec9dde7
      Luis Cosio authored
      
      
      * Added benchmark
      
      * Added more testing
      
      * Added task definition for mmlu_redux and mmlu_redux_spanish
      
      * Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
      
      * Add remaining MMLU Redux YAMLs and updated tasks README
      
      * Add MMLU Redux English and Spanish tasks with YAML fixes and READMEs
      
      * Add MMLU Redux changes from pr-2705
      
      * Resolve pre-commit hook and pytest overlapping group issues by adding mmlu_redux_spanish task entries and unique subgroup names
      
      * Enhance retry logic to prevent 429 error when using Hugging Face API for tests, apply pre-commit fixes
      
      * Revert python test changes and comments one task group to avoid Hugging Face rate limit and task failure
      
      ---------
      Co-authored-by: default avatarCT-6282 <ricardo.godric@hotmail.com>
      fec9dde7