- 06 May, 2025 1 commit
-
-
Anna Fontana authored
* Fix import error for eval_logger in score utils * pacify pre-commit --------- Co-authored-by:Baber <baber@hey.com>
-
- 16 Dec, 2024 1 commit
-
-
Rima Shahbazyan authored
* score readme added * generate until task's "until" parameter's default value fixed. * score mmlu-pro and agieval added * changed macro accuracy to micro for agieval * Always E removed from agi eval * redundancies removed * MATH added * minor cosmetic changes for math * Licenses added Readme updated * changes for flake8 + license header on math * Score added to readme and precommit was run. * Score added to readme and precommit was run. * Import error fixed * math task bugfix postprocess minor fix * CR for math added * math CR * math task bugfix postprocess minor fix CR for math added * Math cr fixed * mmlu_pro non_greedy task added * non greedy summarizer added * Non greedy for all score tasks * Bugfixes for non-greedy * fixing the until argument * undoing the change to "until" arguments default behaviour * minor fix in summarizer * log naming changes for better readability * math subtasks naming fix * agieval subtask naming fix * logging added for debugging * path issue fixed * minor fix * path fix * path fix * non_greedy_math minor fix * final changes * changed readme for non-greedy added Nvidia header added wxample script for non_greedy changed prompts to match that fo trt runs * non greedy summarizer bugfix * non_greedy summarizer fixed
-
- 26 Nov, 2024 1 commit
-
-
Rima Shahbazyan authored
* score readme added * generate until task's "until" parameter's default value fixed. * score mmlu-pro and agieval added * changed macro accuracy to micro for agieval * Always E removed from agi eval * redundancies removed * MATH added * minor cosmetic changes for math * Licenses added Readme updated * changes for flake8 + license header on math * Score added to readme and precommit was run. * Score added to readme and precommit was run. * Import error fixed * math task bugfix postprocess minor fix * CR for math added * math CR * math task bugfix postprocess minor fix CR for math added * Math cr fixed * reverting the default "until" parameter change and adjusting score task configs
-