Adding new subtask to SCORE tasks: non greedy robustness (#2558)
* score readme added * generate until task's "until" parameter's default value fixed. * score mmlu-pro and agieval added * changed macro accuracy to micro for agieval * Always E removed from agi eval * redundancies removed * MATH added * minor cosmetic changes for math * Licenses added Readme updated * changes for flake8 + license header on math * Score added to readme and precommit was run. * Score added to readme and precommit was run. * Import error fixed * math task bugfix postprocess minor fix * CR for math added * math CR * math task bugfix postprocess minor fix CR for math added * Math cr fixed * mmlu_pro non_greedy task added * non greedy summarizer added * Non greedy for all score tasks * Bugfixes for non-greedy * fixing the until argument * undoing the change to "until" arguments default behaviour * minor fix in summarizer * log naming changes for better readability * math subtasks naming fix * agieval subtask naming fix * logging added for debugging * path issue fixed * minor fix * path fix * path fix * non_greedy_math minor fix * final changes * changed readme for non-greedy added Nvidia header added wxample script for non_greedy changed prompts to match that fo trt runs * non greedy summarizer bugfix * non_greedy summarizer fixed
Showing
Please register or sign in to comment