Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
cb085b02
Commit
cb085b02
authored
May 07, 2024
by
lintangsutawika
Browse files
update mmlu
parent
09bc7c68
Changes
5
Hide whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
75 additions
and
28 deletions
+75
-28
lm_eval/tasks/mmlu/default/_mmlu_humanities.yaml
lm_eval/tasks/mmlu/default/_mmlu_humanities.yaml
+18
-0
lm_eval/tasks/mmlu/default/_mmlu_other.yaml
lm_eval/tasks/mmlu/default/_mmlu_other.yaml
+18
-0
lm_eval/tasks/mmlu/default/_mmlu_social_sciences.yaml
lm_eval/tasks/mmlu/default/_mmlu_social_sciences.yaml
+17
-0
lm_eval/tasks/mmlu/default/_mmlu_stem.yaml
lm_eval/tasks/mmlu/default/_mmlu_stem.yaml
+22
-22
lm_eval/tasks/mmlu/default/mmlu.yaml
lm_eval/tasks/mmlu/default/mmlu.yaml
+0
-6
No files found.
lm_eval/tasks/mmlu/default/_mmlu_humanities.yaml
0 → 100644
View file @
cb085b02
group
:
mmlu_humanities
group_alias
:
humanities
task
:
-
formal_logic
-
high_school_european_history
-
high_school_us_history
-
high_school_world_history
-
international_law
-
jurisprudence
-
logical_fallacies
-
moral_disputes
-
moral_scenarios
-
philosophy
-
prehistory
-
professional_law
-
world_religions
aggregate_metric
:
True
weight_by_size
:
True
lm_eval/tasks/mmlu/default/_mmlu_other.yaml
0 → 100644
View file @
cb085b02
group
:
mmlu_other
group_alias
:
other
task
:
-
mmlu_business_ethics
-
mmlu_clinical_knowledge
-
mmlu_college_medicine
-
mmlu_global_facts
-
mmlu_human_aging
-
mmlu_management
-
mmlu_marketing
-
mmlu_medical_genetics
-
mmlu_miscellaneous
-
mmlu_nutrition
-
mmlu_professional_accounting
-
mmlu_professional_medicine
-
mmlu_virology
aggregate_metric
:
True
weight_by_size
:
True
lm_eval/tasks/mmlu/default/_mmlu_social_sciences.yaml
0 → 100644
View file @
cb085b02
group
:
mmlu_social_sciences
group_alias
:
social_sciences
task
:
-
econometrics
-
high_school_geography
-
high_school_government_and_politics
-
high_school_macroeconomics
-
high_school_microeconomics
-
high_school_psychology
-
human_sexuality
-
professional_psychology
-
public_relations
-
security_studies
-
sociology
-
us_foreign_policy
aggregate_metric
:
True
weight_by_size
:
True
lm_eval/tasks/mmlu/default/_mmlu_stem.yaml
View file @
cb085b02
group
:
mmlu_stem
group_alias
:
stem
task
:
-
abstract_algebra
-
anatomy
-
astronomy
-
college_biology
-
college_chemistry
-
college_computer_science
-
college_mathematics
-
college_physics
-
computer_security
-
conceptual_physics
-
electrical_engineering
-
elementary_mathematics
-
high_school_biology
-
high_school_chemistry
-
high_school_computer_science
-
high_school_mathematics
-
high_school_physics
-
high_school_statistics
-
machine_learning
group_config
:
aggregate_metric
:
True
weight_by_size
:
True
-
mmlu_abstract_algebra
-
mmlu_anatomy
-
mmlu_astronomy
-
mmlu_college_biology
-
mmlu_college_chemistry
-
mmlu_college_computer_science
-
mmlu_college_mathematics
-
mmlu_college_physics
-
mmlu_computer_security
-
mmlu_conceptual_physics
-
mmlu_electrical_engineering
-
mmlu_elementary_mathematics
-
mmlu_high_school_biology
-
mmlu_high_school_chemistry
-
mmlu_high_school_computer_science
-
mmlu_high_school_mathematics
-
mmlu_high_school_physics
-
mmlu_high_school_statistics
-
mmlu_machine_learning
aggregate_metric
:
True
weight_by_size
:
True
lm_eval/tasks/mmlu/default/mmlu.yaml
deleted
100644 → 0
View file @
09bc7c68
group
:
mmlu
task
:
-
mmlu_stem
-
mmlu_other
-
mmlu_social_sciences
-
mmlu_humanities
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment