Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
e4db76cb
Commit
e4db76cb
authored
Jul 09, 2024
by
haileyschoelkopf
Browse files
Merge branch 'main' into multimodal-prototyping
parents
6cc6e9cd
ad80f555
Changes
871
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
49 additions
and
23 deletions
+49
-23
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_us_foreign_policy.yaml
...l/tasks/mmlu/flan_cot_fewshot/mmlu_us_foreign_policy.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_virology.yaml
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_virology.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_world_religions.yaml
...val/tasks/mmlu/flan_cot_fewshot/mmlu_world_religions.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/_mmlu.yaml
lm_eval/tasks/mmlu/flan_cot_zeroshot/_mmlu.yaml
+30
-4
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_abstract_algebra.yaml
...l/tasks/mmlu/flan_cot_zeroshot/mmlu_abstract_algebra.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_anatomy.yaml
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_anatomy.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_astronomy.yaml
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_astronomy.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_business_ethics.yaml
...al/tasks/mmlu/flan_cot_zeroshot/mmlu_business_ethics.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_clinical_knowledge.yaml
...tasks/mmlu/flan_cot_zeroshot/mmlu_clinical_knowledge.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_biology.yaml
...al/tasks/mmlu/flan_cot_zeroshot/mmlu_college_biology.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_chemistry.yaml
.../tasks/mmlu/flan_cot_zeroshot/mmlu_college_chemistry.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_computer_science.yaml
...mmlu/flan_cot_zeroshot/mmlu_college_computer_science.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_mathematics.yaml
...asks/mmlu/flan_cot_zeroshot/mmlu_college_mathematics.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_medicine.yaml
...l/tasks/mmlu/flan_cot_zeroshot/mmlu_college_medicine.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_physics.yaml
...al/tasks/mmlu/flan_cot_zeroshot/mmlu_college_physics.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_computer_security.yaml
.../tasks/mmlu/flan_cot_zeroshot/mmlu_computer_security.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_conceptual_physics.yaml
...tasks/mmlu/flan_cot_zeroshot/mmlu_conceptual_physics.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_econometrics.yaml
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_econometrics.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_electrical_engineering.yaml
...s/mmlu/flan_cot_zeroshot/mmlu_electrical_engineering.yaml
+1
-1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_elementary_mathematics.yaml
...s/mmlu/flan_cot_zeroshot/mmlu_elementary_mathematics.yaml
+1
-1
No files found.
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_us_foreign_policy.yaml
View file @
e4db76cb
...
@@ -51,6 +51,6 @@ fewshot_config:
...
@@ -51,6 +51,6 @@ fewshot_config:
target
:
'
Let'
'
s
think
step
by
step.
We
refer
to
Wikipedia
articles
on
us
foreign
target
:
'
Let'
'
s
think
step
by
step.
We
refer
to
Wikipedia
articles
on
us
foreign
policy
for
help.
The
2008
financial
crisis
damanged
the
international
reputation
policy
for
help.
The
2008
financial
crisis
damanged
the
international
reputation
of
the
American
model
of
political
economy
and
capitalism.
The
answer
is
(A).'
of
the
American
model
of
political
economy
and
capitalism.
The
answer
is
(A).'
group
:
mmlu_flan_cot_fewshot_social_sciences
tag
:
mmlu_flan_cot_fewshot_social_sciences
include
:
_mmlu_flan_cot_fewshot_template_yaml
include
:
_mmlu_flan_cot_fewshot_template_yaml
task
:
mmlu_flan_cot_fewshot_us_foreign_policy
task
:
mmlu_flan_cot_fewshot_us_foreign_policy
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_virology.yaml
View file @
e4db76cb
...
@@ -40,6 +40,6 @@ fewshot_config:
...
@@ -40,6 +40,6 @@ fewshot_config:
target
:
'
Let'
'
s
think
step
by
step.
We
refer
to
Wikipedia
articles
on
virology
target
:
'
Let'
'
s
think
step
by
step.
We
refer
to
Wikipedia
articles
on
virology
for
help.
Paroviruses
are
highly
impactful
because
they
do
not
have
nucleic
for
help.
Paroviruses
are
highly
impactful
because
they
do
not
have
nucleic
acid.
The
answer
is
(A).'
acid.
The
answer
is
(A).'
group
:
mmlu_flan_cot_fewshot_other
tag
:
mmlu_flan_cot_fewshot_other
include
:
_mmlu_flan_cot_fewshot_template_yaml
include
:
_mmlu_flan_cot_fewshot_template_yaml
task
:
mmlu_flan_cot_fewshot_virology
task
:
mmlu_flan_cot_fewshot_virology
lm_eval/tasks/mmlu/flan_cot_fewshot/mmlu_world_religions.yaml
View file @
e4db76cb
...
@@ -37,6 +37,6 @@ fewshot_config:
...
@@ -37,6 +37,6 @@ fewshot_config:
target
:
'
Let'
'
s
think
step
by
step.
We
refer
to
Wikipedia
articles
on
world
religions
target
:
'
Let'
'
s
think
step
by
step.
We
refer
to
Wikipedia
articles
on
world
religions
for
help.
In
Judaism,
the
most
distinctive
sign
of
the
covenant
is
circumcision
for
help.
In
Judaism,
the
most
distinctive
sign
of
the
covenant
is
circumcision
(brit
milah).
The
answer
is
(B).'
(brit
milah).
The
answer
is
(B).'
group
:
mmlu_flan_cot_fewshot_humanities
tag
:
mmlu_flan_cot_fewshot_humanities
include
:
_mmlu_flan_cot_fewshot_template_yaml
include
:
_mmlu_flan_cot_fewshot_template_yaml
task
:
mmlu_flan_cot_fewshot_world_religions
task
:
mmlu_flan_cot_fewshot_world_religions
lm_eval/tasks/mmlu/flan_cot_zeroshot/_mmlu.yaml
View file @
e4db76cb
group
:
mmlu_flan_cot_zeroshot
group
:
mmlu_flan_cot_zeroshot
group_alias
:
mmlu (flan style, zeroshot cot)
task
:
task
:
-
mmlu_flan_cot_zeroshot_stem
-
group
:
stem
-
mmlu_flan_cot_zeroshot_other
task
:
-
mmlu_flan_cot_zeroshot_social_sciences
-
mmlu_flan_cot_zeroshot_stem
-
mmlu_flan_cot_zeroshot_humanities
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
True
-
group
:
other
task
:
-
mmlu_flan_cot_zeroshot_other
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
True
-
group
:
social sciences
task
:
-
mmlu_flan_cot_zeroshot_social_sciences
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
True
-
group
:
humanities
task
:
-
mmlu_flan_cot_zeroshot_humanities
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
True
aggregate_metric_list
:
-
metric
:
acc
weight_by_size
:
True
metadata
:
version
:
1
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_abstract_algebra.yaml
View file @
e4db76cb
"
dataset_name"
:
"
abstract_algebra"
"
dataset_name"
:
"
abstract_algebra"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
abstract
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
abstract
\
\
algebra.
\n\n
"
\
algebra.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_abstract_algebra"
"
task"
:
"
mmlu_flan_cot_zeroshot_abstract_algebra"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_anatomy.yaml
View file @
e4db76cb
"
dataset_name"
:
"
anatomy"
"
dataset_name"
:
"
anatomy"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
anatomy.
\n\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
anatomy.
\n\
\n
"
\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_anatomy"
"
task"
:
"
mmlu_flan_cot_zeroshot_anatomy"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_astronomy.yaml
View file @
e4db76cb
"
dataset_name"
:
"
astronomy"
"
dataset_name"
:
"
astronomy"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
astronomy.
\n\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
astronomy.
\n\
\n
"
\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_astronomy"
"
task"
:
"
mmlu_flan_cot_zeroshot_astronomy"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_business_ethics.yaml
View file @
e4db76cb
"
dataset_name"
:
"
business_ethics"
"
dataset_name"
:
"
business_ethics"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
business
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
business
\
\
ethics.
\n\n
"
\
ethics.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_other"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_other"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_business_ethics"
"
task"
:
"
mmlu_flan_cot_zeroshot_business_ethics"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_clinical_knowledge.yaml
View file @
e4db76cb
"
dataset_name"
:
"
clinical_knowledge"
"
dataset_name"
:
"
clinical_knowledge"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
clinical
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
clinical
\
\
knowledge.
\n\n
"
\
knowledge.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_other"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_other"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_clinical_knowledge"
"
task"
:
"
mmlu_flan_cot_zeroshot_clinical_knowledge"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_biology.yaml
View file @
e4db76cb
"
dataset_name"
:
"
college_biology"
"
dataset_name"
:
"
college_biology"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
\
biology.
\n\n
"
\
biology.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_biology"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_biology"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_chemistry.yaml
View file @
e4db76cb
"
dataset_name"
:
"
college_chemistry"
"
dataset_name"
:
"
college_chemistry"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
\
chemistry.
\n\n
"
\
chemistry.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_chemistry"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_chemistry"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_computer_science.yaml
View file @
e4db76cb
"
dataset_name"
:
"
college_computer_science"
"
dataset_name"
:
"
college_computer_science"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
\
computer
science.
\n\n
"
\
computer
science.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_computer_science"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_computer_science"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_mathematics.yaml
View file @
e4db76cb
"
dataset_name"
:
"
college_mathematics"
"
dataset_name"
:
"
college_mathematics"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
\
mathematics.
\n\n
"
\
mathematics.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_mathematics"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_mathematics"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_medicine.yaml
View file @
e4db76cb
"
dataset_name"
:
"
college_medicine"
"
dataset_name"
:
"
college_medicine"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
\
medicine.
\n\n
"
\
medicine.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_other"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_other"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_medicine"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_medicine"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_college_physics.yaml
View file @
e4db76cb
"
dataset_name"
:
"
college_physics"
"
dataset_name"
:
"
college_physics"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
college
\
\
physics.
\n\n
"
\
physics.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_physics"
"
task"
:
"
mmlu_flan_cot_zeroshot_college_physics"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_computer_security.yaml
View file @
e4db76cb
"
dataset_name"
:
"
computer_security"
"
dataset_name"
:
"
computer_security"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
computer
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
computer
\
\
security.
\n\n
"
\
security.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_computer_security"
"
task"
:
"
mmlu_flan_cot_zeroshot_computer_security"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_conceptual_physics.yaml
View file @
e4db76cb
"
dataset_name"
:
"
conceptual_physics"
"
dataset_name"
:
"
conceptual_physics"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
conceptual
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
conceptual
\
\
physics.
\n\n
"
\
physics.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_conceptual_physics"
"
task"
:
"
mmlu_flan_cot_zeroshot_conceptual_physics"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_econometrics.yaml
View file @
e4db76cb
"
dataset_name"
:
"
econometrics"
"
dataset_name"
:
"
econometrics"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
econometrics.
\n\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
econometrics.
\n\
\n
"
\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_social_sciences"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_social_sciences"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_econometrics"
"
task"
:
"
mmlu_flan_cot_zeroshot_econometrics"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_electrical_engineering.yaml
View file @
e4db76cb
"
dataset_name"
:
"
electrical_engineering"
"
dataset_name"
:
"
electrical_engineering"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
electrical
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
electrical
\
\
engineering.
\n\n
"
\
engineering.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_electrical_engineering"
"
task"
:
"
mmlu_flan_cot_zeroshot_electrical_engineering"
lm_eval/tasks/mmlu/flan_cot_zeroshot/mmlu_elementary_mathematics.yaml
View file @
e4db76cb
"
dataset_name"
:
"
elementary_mathematics"
"
dataset_name"
:
"
elementary_mathematics"
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
elementary
\
"
description"
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
elementary
\
\
mathematics.
\n\n
"
\
mathematics.
\n\n
"
"
group
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
tag
"
:
"
mmlu_flan_cot_zeroshot_stem"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
include"
:
"
_mmlu_flan_cot_zeroshot_template_yaml"
"
task"
:
"
mmlu_flan_cot_zeroshot_elementary_mathematics"
"
task"
:
"
mmlu_flan_cot_zeroshot_elementary_mathematics"
Prev
1
…
23
24
25
26
27
28
29
30
31
…
44
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment