• Janna's avatar
    Support for AIME dataset (#3248) · 5ac7cdf8
    Janna authored
    * add AIME tasks
    
    * standardize the repeats
    
    * fix task naming
    
    * aime25 only has test set
    
    * edit readme
    
    * add utils
    
    * standardize
    
    * fix case sensitivity
    
    * repeat once
    
    * lint
    
    * more linting
    
    * lint huggingface.py
    5ac7cdf8
utils.py 6.35 KB