Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
221c7d7d
Unverified
Commit
221c7d7d
authored
Aug 20, 2024
by
Nathan Habib
Committed by
GitHub
Aug 20, 2024
Browse files
fix the leaderboard doc to reflect the tasks (#2219)
parent
97327e43
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
4 deletions
+1
-4
lm_eval/tasks/leaderboard/README.md
lm_eval/tasks/leaderboard/README.md
+1
-4
No files found.
lm_eval/tasks/leaderboard/README.md
View file @
221c7d7d
...
...
@@ -51,7 +51,6 @@ In this work, we focus on a suite of 23 challenging BIG-Bench tasks which we cal
-
`leaderboard_bbh_causal_judgement`
-
`leaderboard_bbh_date_understanding`
-
`leaderboard_bbh_disambiguation_qa`
-
`leaderboard_bbh_dyck_languages`
-
`leaderboard_bbh_formal_fallacies`
-
`leaderboard_bbh_geometric_shapes`
-
`leaderboard_bbh_hyperbaton`
...
...
@@ -59,7 +58,6 @@ In this work, we focus on a suite of 23 challenging BIG-Bench tasks which we cal
-
`leaderboard_bbh_logical_deduction_seven_objects`
-
`leaderboard_bbh_logical_deduction_three_objects`
-
`leaderboard_bbh_movie_recommendation`
-
`leaderboard_bbh_multistep_arithmetic_two`
-
`leaderboard_bbh_navigate`
-
`leaderboard_bbh_object_counting`
-
`leaderboard_bbh_penguins_in_a_table`
...
...
@@ -73,7 +71,6 @@ In this work, we focus on a suite of 23 challenging BIG-Bench tasks which we cal
-
`leaderboard_bbh_tracking_shuffled_objects_seven_objects`
-
`leaderboard_bbh_tracking_shuffled_objects_three_objects`
-
`leaderboard_bbh_web_of_lies`
-
`leaderboard_bbh_word_sorting`
## GPQA
...
...
@@ -215,7 +212,7 @@ Eprint = {arXiv:2206.14858},
-
`leaderboard_math_intermediate_algebra_hard`
-
`leaderboard_math_num_theory_hard`
-
`leaderboard_math_prealgebra_hard`
-
`leaderboard_math_precalc_hard`
-
`leaderboard_math_precalc
ulus
_hard`
## MMLU-Pro
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment