Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
f23ae748
Commit
f23ae748
authored
Sep 03, 2023
by
lintangsutawika
Browse files
add mmlu variants
parent
191458b8
Changes
235
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
161 additions
and
0 deletions
+161
-0
lm_eval/tasks/mmlu/flan_n_shot/_mmlu_flan_loglikelihood_template_yaml
...s/mmlu/flan_n_shot/_mmlu_flan_loglikelihood_template_yaml
+12
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_abstract_algebra.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_abstract_algebra.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_anatomy.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_anatomy.yaml
+7
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_astronomy.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_astronomy.yaml
+7
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_business_ethics.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_business_ethics.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_clinical_knowledge.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_clinical_knowledge.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_biology.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_biology.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_chemistry.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_chemistry.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_computer_science.yaml
...tasks/mmlu/flan_n_shot/mmlu_college_computer_science.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_mathematics.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_mathematics.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_medicine.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_medicine.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_physics.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_physics.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_computer_security.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_computer_security.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_conceptual_physics.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_conceptual_physics.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_econometrics.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_econometrics.yaml
+7
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_electrical_engineering.yaml
...l/tasks/mmlu/flan_n_shot/mmlu_electrical_engineering.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_elementary_mathematics.yaml
...l/tasks/mmlu/flan_n_shot/mmlu_elementary_mathematics.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_formal_logic.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_formal_logic.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_global_facts.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_global_facts.yaml
+8
-0
lm_eval/tasks/mmlu/flan_n_shot/mmlu_high_school_biology.yaml
lm_eval/tasks/mmlu/flan_n_shot/mmlu_high_school_biology.yaml
+8
-0
No files found.
lm_eval/tasks/mmlu/flan_n_shot/_mmlu_flan_loglikelihood_template_yaml
0 → 100644
View file @
f23ae748
group: mmlu_flan_loglikelihood
dataset_path: cais/mmlu
validation_split: validation
fewshot_split: dev
doc_to_text: "Q: {{question.strip()}}\n(A) {{choices[0]}} (B) {{choices[1]}} (C) {{choices[2]}} (D) {{choices[3]}}\nA:"
output_type: multiple_choice
doc_to_choice: ['(A)', '(B)', '(C)', '(D)']
doc_to_target: answer
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
lm_eval/tasks/mmlu/flan_n_shot/mmlu_abstract_algebra.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
abstract_algebra
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
abstract
algebra.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_abstract_algebra
lm_eval/tasks/mmlu/flan_n_shot/mmlu_anatomy.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
anatomy
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
anatomy.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_anatomy
lm_eval/tasks/mmlu/flan_n_shot/mmlu_astronomy.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
astronomy
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
astronomy.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_astronomy
lm_eval/tasks/mmlu/flan_n_shot/mmlu_business_ethics.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
business_ethics
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
business
ethics.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_business_ethics
lm_eval/tasks/mmlu/flan_n_shot/mmlu_clinical_knowledge.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
clinical_knowledge
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
clinical
knowledge.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_clinical_knowledge
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_biology.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
college_biology
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
college
biology.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_college_biology
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_chemistry.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
college_chemistry
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
college
chemistry.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_college_chemistry
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_computer_science.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
college_computer_science
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
college
computer
science.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_college_computer_science
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_mathematics.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
college_mathematics
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
college
mathematics.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_college_mathematics
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_medicine.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
college_medicine
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
college
medicine.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_college_medicine
lm_eval/tasks/mmlu/flan_n_shot/mmlu_college_physics.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
college_physics
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
college
physics.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_college_physics
lm_eval/tasks/mmlu/flan_n_shot/mmlu_computer_security.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
computer_security
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
computer
security.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_computer_security
lm_eval/tasks/mmlu/flan_n_shot/mmlu_conceptual_physics.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
conceptual_physics
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
conceptual
physics.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_conceptual_physics
lm_eval/tasks/mmlu/flan_n_shot/mmlu_econometrics.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
econometrics
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
econometrics.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_econometrics
lm_eval/tasks/mmlu/flan_n_shot/mmlu_electrical_engineering.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
electrical_engineering
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
electrical
engineering.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_electrical_engineering
lm_eval/tasks/mmlu/flan_n_shot/mmlu_elementary_mathematics.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
elementary_mathematics
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
elementary
mathematics.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_elementary_mathematics
lm_eval/tasks/mmlu/flan_n_shot/mmlu_formal_logic.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
formal_logic
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
formal
logic.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_formal_logic
lm_eval/tasks/mmlu/flan_n_shot/mmlu_global_facts.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
global_facts
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
global
facts.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_global_facts
lm_eval/tasks/mmlu/flan_n_shot/mmlu_high_school_biology.yaml
0 → 100644
View file @
f23ae748
dataset_name
:
high_school_biology
description
:
'
The
following
are
multiple
choice
questions
(with
answers)
about
high
school
biology.
'
include
:
_mmlu_flan_generative_template_yaml
task
:
mmlu_flan_n_shot_high_school_biology
Prev
1
…
3
4
5
6
7
8
9
10
11
12
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment