Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
ace4393e
Unverified
Commit
ace4393e
authored
Jan 15, 2024
by
Hailey Schoelkopf
Committed by
GitHub
Jan 15, 2024
Browse files
fix whitespace in target + prompt for CoT gsm8k (#1275)
parent
89618bf8
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
12 additions
and
14 deletions
+12
-14
lm_eval/tasks/gsm8k/gsm8k-cot-self-consistency.yaml
lm_eval/tasks/gsm8k/gsm8k-cot-self-consistency.yaml
+1
-1
lm_eval/tasks/gsm8k/gsm8k-cot.yaml
lm_eval/tasks/gsm8k/gsm8k-cot.yaml
+11
-12
lm_eval/tasks/gsm8k/gsm8k.yaml
lm_eval/tasks/gsm8k/gsm8k.yaml
+0
-1
No files found.
lm_eval/tasks/gsm8k/gsm8k-cot-self-consistency.yaml
View file @
ace4393e
...
@@ -31,4 +31,4 @@ filter_list:
...
@@ -31,4 +31,4 @@ filter_list:
-
function
:
"
majority_vote"
-
function
:
"
majority_vote"
-
function
:
"
take_first"
-
function
:
"
take_first"
metadata
:
metadata
:
version
:
1
.0
version
:
2
.0
lm_eval/tasks/gsm8k/gsm8k-cot.yaml
View file @
ace4393e
...
@@ -5,16 +5,16 @@ dataset_path: gsm8k
...
@@ -5,16 +5,16 @@ dataset_path: gsm8k
dataset_name
:
main
dataset_name
:
main
output_type
:
generate_until
output_type
:
generate_until
test_split
:
test
test_split
:
test
doc_to_text
:
"
Q:
There
are
15
trees
in
the
grove.
Grove
workers
will
plant
trees
in
the
grove
today.
After
they
are
done,
there
will
be
21
trees.
How
many
trees
did
the
grove
workers
plant
today?
\
n\
n
A:
There
are
15
trees
originally.
Then
there
were
21
trees
after
some
more
were
planted.
So
there
must
have
been
21
-
15
=
6.
The
answer
is
6.
\n\n\
doc_to_text
:
"
Q:
There
are
15
trees
in
the
grove.
Grove
workers
will
plant
trees
in
the
grove
today.
After
they
are
done,
there
will
be
21
trees.
How
many
trees
did
the
grove
workers
plant
today?
\n
A:
There
are
15
trees
originally.
Then
there
were
21
trees
after
some
more
were
planted.
So
there
must
have
been
21
-
15
=
6.
The
answer
is
6.
\n\n\
Q:
If
there
are
3
cars
in
the
parking
lot
and
2
more
cars
arrive,
how
many
cars
are
in
the
parking
lot?
\
n\
n
A:
There
are
originally
3
cars.
2
more
cars
arrive.
3
+
2
=
5.
The
answer
is
5.
\n\n\
Q:
If
there
are
3
cars
in
the
parking
lot
and
2
more
cars
arrive,
how
many
cars
are
in
the
parking
lot?
\n
A:
There
are
originally
3
cars.
2
more
cars
arrive.
3
+
2
=
5.
The
answer
is
5.
\n\n\
Q:
Leah
had
32
chocolates
and
her
sister
had
42.
If
they
ate
35,
how
many
pieces
do
they
have
left
in
total?
\
n\
n
A:
Originally,
Leah
had
32
chocolates.
Her
sister
had
42.
So
in
total
they
had
32
+
42
=
74.
After
eating
35,
they
had
74
-
35
=
39.
The
answer
is
39.
\n\n\
Q:
Leah
had
32
chocolates
and
her
sister
had
42.
If
they
ate
35,
how
many
pieces
do
they
have
left
in
total?
\n
A:
Originally,
Leah
had
32
chocolates.
Her
sister
had
42.
So
in
total
they
had
32
+
42
=
74.
After
eating
35,
they
had
74
-
35
=
39.
The
answer
is
39.
\n\n\
Q:
Jason
had
20
lollipops.
He
gave
Denny
some
lollipops.
Now
Jason
has
12
lollipops.
How
many
lollipops
did
Jason
give
to
Denny?
\
n\
n
A:
Jason
started
with
20
lollipops.
Then
he
had
12
after
giving
some
to
Denny.
So
he
gave
Denny
20
-
12
=
8.
The
answer
is
8.
\n\n\
Q:
Jason
had
20
lollipops.
He
gave
Denny
some
lollipops.
Now
Jason
has
12
lollipops.
How
many
lollipops
did
Jason
give
to
Denny?
\n
A:
Jason
started
with
20
lollipops.
Then
he
had
12
after
giving
some
to
Denny.
So
he
gave
Denny
20
-
12
=
8.
The
answer
is
8.
\n\n\
Q:
Shawn
has
five
toys.
For
Christmas,
he
got
two
toys
each
from
his
mom
and
dad.
How
many
toys
does
he
have
now?
\
n\
n
A:
Shawn
started
with
5
toys.
If
he
got
2
toys
each
from
his
mom
and
dad,
then
that
is
4
more
toys.
5
+
4
=
9.
The
answer
is
9.
\n\n\
Q:
Shawn
has
five
toys.
For
Christmas,
he
got
two
toys
each
from
his
mom
and
dad.
How
many
toys
does
he
have
now?
\n
A:
Shawn
started
with
5
toys.
If
he
got
2
toys
each
from
his
mom
and
dad,
then
that
is
4
more
toys.
5
+
4
=
9.
The
answer
is
9.
\n\n\
Q:
There
were
nine
computers
in
the
server
room.
Five
more
computers
were
installed
each
day,
from
monday
to
thursday.
How
many
computers
are
now
in
the
server
room?
\
n\
n
A:
There
were
originally
9
computers.
For
each
of
4
days,
5
more
computers
were
added.
So
5
*
4
=
20
computers
were
added.
9
+
20
is
29.
The
answer
is
29.
\n\n\
Q:
There
were
nine
computers
in
the
server
room.
Five
more
computers
were
installed
each
day,
from
monday
to
thursday.
How
many
computers
are
now
in
the
server
room?
\n
A:
There
were
originally
9
computers.
For
each
of
4
days,
5
more
computers
were
added.
So
5
*
4
=
20
computers
were
added.
9
+
20
is
29.
The
answer
is
29.
\n\n\
Q:
Michael
had
58
golf
balls.
On
tuesday,
he
lost
23
golf
balls.
On
wednesday,
he
lost
2
more.
How
many
golf
balls
did
he
have
at
the
end
of
wednesday?
\
n\
n
A:
Michael
started
with
58
golf
balls.
After
losing
23
on
tuesday,
he
had
58
-
23
=
35.
After
losing
2
more,
he
had
35
-
2
=
33
golf
balls.
The
answer
is
33.
\n\n\
Q:
Michael
had
58
golf
balls.
On
tuesday,
he
lost
23
golf
balls.
On
wednesday,
he
lost
2
more.
How
many
golf
balls
did
he
have
at
the
end
of
wednesday?
\n
A:
Michael
started
with
58
golf
balls.
After
losing
23
on
tuesday,
he
had
58
-
23
=
35.
After
losing
2
more,
he
had
35
-
2
=
33
golf
balls.
The
answer
is
33.
\n\n\
Q:
Olivia
has
$23.
She
bought
five
bagels
for
$3
each.
How
much
money
does
she
have
left?
\
n\
n
A:
Olivia
had
23
dollars.
5
bagels
for
3
dollars
each
will
be
5
x
3
=
15
dollars.
So
she
has
23
-
15
dollars
left.
23
-
15
is
8.
The
answer
is
8.
\n\n\
Q:
Olivia
has
$23.
She
bought
five
bagels
for
$3
each.
How
much
money
does
she
have
left?
\n
A:
Olivia
had
23
dollars.
5
bagels
for
3
dollars
each
will
be
5
x
3
=
15
dollars.
So
she
has
23
-
15
dollars
left.
23
-
15
is
8.
The
answer
is
8.
\n\n\
Q:
{{question}}
\
n\
n
A:"
Q:
{{question}}
\n
A:"
doc_to_target
:
"
{{answer.split('###
')[-1].
r
strip()}}"
doc_to_target
:
"
{{answer.split('###
#
')[-1].strip()}}"
metric_list
:
metric_list
:
-
metric
:
exact_match
-
metric
:
exact_match
aggregation
:
mean
aggregation
:
mean
...
@@ -31,7 +31,6 @@ generation_kwargs:
...
@@ -31,7 +31,6 @@ generation_kwargs:
-
"
Q:"
-
"
Q:"
-
"
\n\n
"
-
"
\n\n
"
do_sample
:
false
do_sample
:
false
temperature
:
0.0
repeats
:
1
repeats
:
1
num_fewshot
:
0
num_fewshot
:
0
filter_list
:
filter_list
:
...
@@ -41,4 +40,4 @@ filter_list:
...
@@ -41,4 +40,4 @@ filter_list:
regex_pattern
:
"
The
answer
is
(
\\
-?[0-9
\\
.
\\
,]+)."
regex_pattern
:
"
The
answer
is
(
\\
-?[0-9
\\
.
\\
,]+)."
-
function
:
"
take_first"
-
function
:
"
take_first"
metadata
:
metadata
:
version
:
1
.0
version
:
2
.0
lm_eval/tasks/gsm8k/gsm8k.yaml
View file @
ace4393e
...
@@ -24,7 +24,6 @@ generation_kwargs:
...
@@ -24,7 +24,6 @@ generation_kwargs:
-
"
\n\n
"
-
"
\n\n
"
-
"
Question:"
-
"
Question:"
do_sample
:
false
do_sample
:
false
temperature
:
0.0
repeats
:
1
repeats
:
1
num_fewshot
:
5
num_fewshot
:
5
filter_list
:
filter_list
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment