Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
75dfac43
"vscode:/vscode.git/clone" did not exist on "58692fb53fddff94c0274d9fc1a7a9dcc2ec53fe"
Commit
75dfac43
authored
May 07, 2024
by
lintangsutawika
Browse files
readd files
parent
03982e03
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
50 additions
and
50 deletions
+50
-50
lm_eval/tasks/mmlu/default/_mmlu_humanities.yaml
lm_eval/tasks/mmlu/default/_mmlu_humanities.yaml
+11
-11
lm_eval/tasks/mmlu/default/_mmlu_other.yaml
lm_eval/tasks/mmlu/default/_mmlu_other.yaml
+11
-11
lm_eval/tasks/mmlu/default/_mmlu_social_sciences.yaml
lm_eval/tasks/mmlu/default/_mmlu_social_sciences.yaml
+11
-11
lm_eval/tasks/mmlu/default/_mmlu_stem.yaml
lm_eval/tasks/mmlu/default/_mmlu_stem.yaml
+17
-17
No files found.
lm_eval/tasks/mmlu/default/_mmlu_humanities.yaml
View file @
75dfac43
...
...
@@ -3,17 +3,17 @@ group_alias: humanities
task
:
-
mmlu_formal_logic
-
mmlu_high_school_european_history
#
- mmlu_high_school_us_history
#
- mmlu_high_school_world_history
#
- mmlu_international_law
#
- mmlu_jurisprudence
#
- mmlu_logical_fallacies
#
- mmlu_moral_disputes
#
- mmlu_moral_scenarios
#
- mmlu_philosophy
#
- mmlu_prehistory
#
- mmlu_professional_law
#
- mmlu_world_religions
-
mmlu_high_school_us_history
-
mmlu_high_school_world_history
-
mmlu_international_law
-
mmlu_jurisprudence
-
mmlu_logical_fallacies
-
mmlu_moral_disputes
-
mmlu_moral_scenarios
-
mmlu_philosophy
-
mmlu_prehistory
-
mmlu_professional_law
-
mmlu_world_religions
aggregate_metric
:
True
weight_by_size
:
True
version
:
1
lm_eval/tasks/mmlu/default/_mmlu_other.yaml
View file @
75dfac43
...
...
@@ -3,17 +3,17 @@ group_alias: other
task
:
-
mmlu_business_ethics
-
mmlu_clinical_knowledge
#
- mmlu_college_medicine
#
- mmlu_global_facts
#
- mmlu_human_aging
#
- mmlu_management
#
- mmlu_marketing
#
- mmlu_medical_genetics
#
- mmlu_miscellaneous
#
- mmlu_nutrition
#
- mmlu_professional_accounting
#
- mmlu_professional_medicine
#
- mmlu_virology
-
mmlu_college_medicine
-
mmlu_global_facts
-
mmlu_human_aging
-
mmlu_management
-
mmlu_marketing
-
mmlu_medical_genetics
-
mmlu_miscellaneous
-
mmlu_nutrition
-
mmlu_professional_accounting
-
mmlu_professional_medicine
-
mmlu_virology
aggregate_metric
:
True
weight_by_size
:
True
version
:
1
lm_eval/tasks/mmlu/default/_mmlu_social_sciences.yaml
View file @
75dfac43
group
:
mmlu_social_sciences
group_alias
:
social
_
sciences
group_alias
:
social
sciences
task
:
-
mmlu_econometrics
-
mmlu_high_school_geography
#
- mmlu_high_school_government_and_politics
#
- mmlu_high_school_macroeconomics
#
- mmlu_high_school_microeconomics
#
- mmlu_high_school_psychology
#
- mmlu_human_sexuality
#
- mmlu_professional_psychology
#
- mmlu_public_relations
#
- mmlu_security_studies
#
- mmlu_sociology
#
- mmlu_us_foreign_policy
-
mmlu_high_school_government_and_politics
-
mmlu_high_school_macroeconomics
-
mmlu_high_school_microeconomics
-
mmlu_high_school_psychology
-
mmlu_human_sexuality
-
mmlu_professional_psychology
-
mmlu_public_relations
-
mmlu_security_studies
-
mmlu_sociology
-
mmlu_us_foreign_policy
aggregate_metric
:
True
weight_by_size
:
True
version
:
1
lm_eval/tasks/mmlu/default/_mmlu_stem.yaml
View file @
75dfac43
...
...
@@ -3,23 +3,23 @@ group_alias: stem
task
:
-
mmlu_abstract_algebra
-
mmlu_anatomy
#
- mmlu_astronomy
#
- mmlu_college_biology
#
- mmlu_college_chemistry
#
- mmlu_college_computer_science
#
- mmlu_college_mathematics
#
- mmlu_college_physics
#
- mmlu_computer_security
#
- mmlu_conceptual_physics
#
- mmlu_electrical_engineering
#
- mmlu_elementary_mathematics
#
- mmlu_high_school_biology
#
- mmlu_high_school_chemistry
#
- mmlu_high_school_computer_science
#
- mmlu_high_school_mathematics
#
- mmlu_high_school_physics
#
- mmlu_high_school_statistics
#
- mmlu_machine_learning
-
mmlu_astronomy
-
mmlu_college_biology
-
mmlu_college_chemistry
-
mmlu_college_computer_science
-
mmlu_college_mathematics
-
mmlu_college_physics
-
mmlu_computer_security
-
mmlu_conceptual_physics
-
mmlu_electrical_engineering
-
mmlu_elementary_mathematics
-
mmlu_high_school_biology
-
mmlu_high_school_chemistry
-
mmlu_high_school_computer_science
-
mmlu_high_school_mathematics
-
mmlu_high_school_physics
-
mmlu_high_school_statistics
-
mmlu_machine_learning
aggregate_metric
:
True
weight_by_size
:
True
version
:
1
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment