• Rima Shahbazyan's avatar
    Score tasks (#2452) · 0ef7548d
    Rima Shahbazyan authored
    * score readme added
    
    * generate until task's "until" parameter's default value fixed.
    
    * score mmlu-pro and agieval added
    
    * changed macro accuracy to micro for agieval
    
    * Always E removed from agi eval
    
    * redundancies removed
    
    * MATH added
    
    * minor cosmetic changes for math
    
    * Licenses added Readme updated
    
    * changes for flake8 + license header on math
    
    * Score added to readme and precommit was run.
    
    * Score added to readme and precommit was run.
    
    * Import error fixed
    
    * math task bugfix
    postprocess minor fix
    
    * CR for math added
    
    * math CR
    
    * math task bugfix
    postprocess minor fix
    
    CR for math added
    
    * Math cr fixed
    
    * reverting the default "until" parameter change and adjusting  score task configs
    0ef7548d
utils.py 8.77 KB