1. 24 Sep, 2025 2 commits
  2. 22 Sep, 2025 1 commit
    • priverabsc's avatar
      Add eqbench tasks in Spanish and Catalan (#3168) · de496b80
      priverabsc authored
      * Add eqbench tasks in Spanish and Catalan
      
      * Incremented catalan_bench and spanish_bench versions. Added 'multilingual' folder inside 'eq_bench' and moved the eqbench_ca and eqbench_es .yaml to that folder. Updated the tasks README with eqbench_es and eqbench_ca, expliciting inside each description both the Hugging Face link and the translation method.
      
      * Fixed tasks table.
      
      * remove test_task.sh and results folder
      
      * Add utils.py to multilingual folder
      de496b80
  3. 21 Sep, 2025 6 commits
  4. 12 Sep, 2025 1 commit
  5. 08 Sep, 2025 5 commits
  6. 02 Sep, 2025 4 commits
  7. 27 Aug, 2025 3 commits
  8. 26 Aug, 2025 1 commit
    • Janna's avatar
      Support for AIME dataset (#3248) · 5ac7cdf8
      Janna authored
      * add AIME tasks
      
      * standardize the repeats
      
      * fix task naming
      
      * aime25 only has test set
      
      * edit readme
      
      * add utils
      
      * standardize
      
      * fix case sensitivity
      
      * repeat once
      
      * lint
      
      * more linting
      
      * lint huggingface.py
      5ac7cdf8
  9. 25 Aug, 2025 5 commits
  10. 23 Aug, 2025 1 commit
  11. 22 Aug, 2025 1 commit
  12. 21 Aug, 2025 9 commits
  13. 13 Aug, 2025 1 commit