Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
9bee4b4f
Commit
9bee4b4f
authored
Aug 05, 2024
by
lintangsutawika
Browse files
add process for each subtask
parent
578f5d48
Changes
14
Hide whitespace changes
Inline
Side-by-side
Showing
14 changed files
with
16 additions
and
15 deletions
+16
-15
lm_eval/tasks/mmlu_pro/_default_template_yaml
lm_eval/tasks/mmlu_pro/_default_template_yaml
+1
-0
lm_eval/tasks/mmlu_pro/mmlu_pro_business.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_business.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_chemistry.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_chemistry.yaml
+3
-3
lm_eval/tasks/mmlu_pro/mmlu_pro_computer_science.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_computer_science.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_economics.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_economics.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_engineering.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_engineering.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_health.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_health.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_history.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_history.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_law.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_law.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_math.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_math.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_other.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_other.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_philosophy.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_philosophy.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_physics.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_physics.yaml
+1
-1
lm_eval/tasks/mmlu_pro/mmlu_pro_psychology.yaml
lm_eval/tasks/mmlu_pro/mmlu_pro_psychology.yaml
+1
-1
No files found.
lm_eval/tasks/mmlu_pro/_default_template_yaml
View file @
9bee4b4f
...
...
@@ -4,6 +4,7 @@ fewshot_split: validation
fewshot_config:
sampler: first_n
doc_to_text: !function utils.fewshot_to_text
doc_to_target: ""
output_type: generate_until
doc_to_text: !function utils.doc_to_text
doc_to_target: answer
...
...
lm_eval/tasks/mmlu_pro/mmlu_pro_business.yaml
View file @
9bee4b4f
dataset_name
:
"
business"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
business.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_business"
task_alias
:
"
business"
process_docs
:
!function
utils.process_business
lm_eval/tasks/mmlu_pro/mmlu_pro_chemistry.yaml
View file @
9bee4b4f
dataset_name
:
"
math"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
chemistry.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_math"
task_alias
:
"
math"
task
:
"
mmlu_pro_chemistry"
task_alias
:
"
chemistry"
process_docs
:
!function
utils.process_chemistry
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_computer_science.yaml
View file @
9bee4b4f
dataset_name
:
"
computer_science"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
computer
science.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_computer_science"
task_alias
:
"
computer_science"
process_docs
:
!function
utils.process_computer_science
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_economics.yaml
View file @
9bee4b4f
dataset_name
:
"
economics"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
economics.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_economics"
task_alias
:
"
economics"
process_docs
:
!function
utils.process_economics
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_engineering.yaml
View file @
9bee4b4f
dataset_name
:
"
engineering"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
engineering.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_engineering"
task_alias
:
"
engineering"
process_docs
:
!function
utils.process_engineering
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_health.yaml
View file @
9bee4b4f
dataset_name
:
"
health"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
health.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_health"
task_alias
:
"
health"
process_docs
:
!function
utils.process_health
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_history.yaml
View file @
9bee4b4f
dataset_name
:
"
history"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
history.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_history"
task_alias
:
"
history"
process_docs
:
!function
utils.process_history
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_law.yaml
View file @
9bee4b4f
dataset_name
:
"
law"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
law.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_law"
task_alias
:
"
law"
process_docs
:
!function
utils.process_law
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_math.yaml
View file @
9bee4b4f
dataset_name
:
"
math"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
math.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_math"
task_alias
:
"
math"
process_docs
:
!function
utils.process_math
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_other.yaml
View file @
9bee4b4f
dataset_name
:
"
other"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
other.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_other"
task_alias
:
"
other"
process_docs
:
!function
utils.process_other
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_philosophy.yaml
View file @
9bee4b4f
dataset_name
:
"
philosophy"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
philosophy.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_philosophy"
task_alias
:
"
philosophy"
process_docs
:
!function
utils.process_philosophy
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_physics.yaml
View file @
9bee4b4f
dataset_name
:
"
physics"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
physics.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_physics"
task_alias
:
"
physics"
process_docs
:
!function
utils.process_physics
\ No newline at end of file
lm_eval/tasks/mmlu_pro/mmlu_pro_psychology.yaml
View file @
9bee4b4f
dataset_name
:
"
psychology"
description
:
"
The
following
are
multiple
choice
questions
(with
answers)
about
psychology.
Think
step
by
step
and
then
finish
your
answer
with
\"
the
answer
is
(X)
\"
where
X
is
the
correct
letter
choice."
include
:
"
_default_template_yaml"
task
:
"
mmlu_pro_psychology"
task_alias
:
"
psychology"
process_docs
:
!function
utils.process_psychology
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment