Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
2106fbeb
Commit
2106fbeb
authored
Jan 15, 2025
by
Baber
Browse files
Merge branch 'main' into mathvista
# Conflicts: # lm_eval/models/openai_completions.py
parents
4354fe46
703fbffd
Changes
574
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
100 additions
and
0 deletions
+100
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_biology.yaml
.../tasks/llama3/instruct/mmlu/mmlu_high_school_biology.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_chemistry.yaml
...asks/llama3/instruct/mmlu/mmlu_high_school_chemistry.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_computer_science.yaml
...ama3/instruct/mmlu/mmlu_high_school_computer_science.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_european_history.yaml
...ama3/instruct/mmlu/mmlu_high_school_european_history.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_geography.yaml
...asks/llama3/instruct/mmlu/mmlu_high_school_geography.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_government_and_politics.yaml
...struct/mmlu/mmlu_high_school_government_and_politics.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_macroeconomics.yaml
...llama3/instruct/mmlu/mmlu_high_school_macroeconomics.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_mathematics.yaml
...ks/llama3/instruct/mmlu/mmlu_high_school_mathematics.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_microeconomics.yaml
...llama3/instruct/mmlu/mmlu_high_school_microeconomics.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_physics.yaml
.../tasks/llama3/instruct/mmlu/mmlu_high_school_physics.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_psychology.yaml
...sks/llama3/instruct/mmlu/mmlu_high_school_psychology.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_statistics.yaml
...sks/llama3/instruct/mmlu/mmlu_high_school_statistics.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_us_history.yaml
...sks/llama3/instruct/mmlu/mmlu_high_school_us_history.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_world_history.yaml
.../llama3/instruct/mmlu/mmlu_high_school_world_history.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_human_aging.yaml
lm_eval/tasks/llama3/instruct/mmlu/mmlu_human_aging.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_human_sexuality.yaml
lm_eval/tasks/llama3/instruct/mmlu/mmlu_human_sexuality.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_international_law.yaml
...al/tasks/llama3/instruct/mmlu/mmlu_international_law.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_jurisprudence.yaml
lm_eval/tasks/llama3/instruct/mmlu/mmlu_jurisprudence.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_logical_fallacies.yaml
...al/tasks/llama3/instruct/mmlu/mmlu_logical_fallacies.yaml
+5
-0
lm_eval/tasks/llama3/instruct/mmlu/mmlu_machine_learning.yaml
...val/tasks/llama3/instruct/mmlu/mmlu_machine_learning.yaml
+5
-0
No files found.
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_biology.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_biology"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_high_school_biology"
"
task_alias"
:
"
high
school
biology"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_chemistry.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_chemistry"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_high_school_chemistry"
"
task_alias"
:
"
high
school
chemistry"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_computer_science.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_computer_science"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_high_school_computer_science"
"
task_alias"
:
"
high
school
computer
science"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_european_history.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_european_history"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_humanities_tasks"
"
task"
:
"
mmlu_llama_high_school_european_history"
"
task_alias"
:
"
high
school
european
history"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_geography.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_geography"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_social_sciences_tasks"
"
task"
:
"
mmlu_llama_high_school_geography"
"
task_alias"
:
"
high
school
geography"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_government_and_politics.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_government_and_politics"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_social_sciences_tasks"
"
task"
:
"
mmlu_llama_high_school_government_and_politics"
"
task_alias"
:
"
high
school
government
and
politics"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_macroeconomics.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_macroeconomics"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_social_sciences_tasks"
"
task"
:
"
mmlu_llama_high_school_macroeconomics"
"
task_alias"
:
"
high
school
macroeconomics"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_mathematics.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_mathematics"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_high_school_mathematics"
"
task_alias"
:
"
high
school
mathematics"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_microeconomics.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_microeconomics"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_social_sciences_tasks"
"
task"
:
"
mmlu_llama_high_school_microeconomics"
"
task_alias"
:
"
high
school
microeconomics"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_physics.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_physics"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_high_school_physics"
"
task_alias"
:
"
high
school
physics"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_psychology.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_psychology"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_social_sciences_tasks"
"
task"
:
"
mmlu_llama_high_school_psychology"
"
task_alias"
:
"
high
school
psychology"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_statistics.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_statistics"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_high_school_statistics"
"
task_alias"
:
"
high
school
statistics"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_us_history.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_us_history"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_humanities_tasks"
"
task"
:
"
mmlu_llama_high_school_us_history"
"
task_alias"
:
"
high
school
us
history"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_high_school_world_history.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
high_school_world_history"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_humanities_tasks"
"
task"
:
"
mmlu_llama_high_school_world_history"
"
task_alias"
:
"
high
school
world
history"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_human_aging.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
human_aging"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_other_tasks"
"
task"
:
"
mmlu_llama_human_aging"
"
task_alias"
:
"
human
aging"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_human_sexuality.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
human_sexuality"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_social_sciences_tasks"
"
task"
:
"
mmlu_llama_human_sexuality"
"
task_alias"
:
"
human
sexuality"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_international_law.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
international_law"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_humanities_tasks"
"
task"
:
"
mmlu_llama_international_law"
"
task_alias"
:
"
international
law"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_jurisprudence.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
jurisprudence"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_humanities_tasks"
"
task"
:
"
mmlu_llama_jurisprudence"
"
task_alias"
:
"
jurisprudence"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_logical_fallacies.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
logical_fallacies"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_humanities_tasks"
"
task"
:
"
mmlu_llama_logical_fallacies"
"
task_alias"
:
"
logical
fallacies"
lm_eval/tasks/llama3/instruct/mmlu/mmlu_machine_learning.yaml
0 → 100644
View file @
2106fbeb
"
dataset_name"
:
"
machine_learning"
"
include"
:
"
_continuation_template_yaml"
"
tag"
:
"
mmlu_llama_stem_tasks"
"
task"
:
"
mmlu_llama_machine_learning"
"
task_alias"
:
"
machine
learning"
Prev
1
…
13
14
15
16
17
18
19
20
21
…
29
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment