Log which subtasks were called with which groups (#1456)

* log group membership * no stray prints * Update evaluator.py

Log which subtasks were called with which groups (#1456)
* log group membership * no stray prints * Update evaluator.py
00dc9960 · Hailey Schoelkopf · GitHub · ba5cdf0f · 00dc9960
Unverified Commit 00dc9960 authored Feb 22, 2024 by Hailey Schoelkopf Committed by GitHub Feb 22, 2024
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 0 deletions

lm_eval/evaluator.py lm_eval/evaluator.py +1 -0

No files found.
--- a/lm_eval/evaluator.py
+++ b/lm_eval/evaluator.py
@@ -636,6 +636,7 @@ def evaluate(
        results_dict = {
            "results": dict(results_agg.items()),
            **({"groups": dict(groups_agg.items())} if bool(groups_agg) else {}),
+            "group_subtasks": {k: v for k, v in reversed(task_hierarchy.items())},
            "configs": dict(sorted(configs.items())),
            "versions": dict(sorted(versions.items())),
            "n-shot": dict(sorted(num_fewshot.items())),