Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
b91cabae
Commit
b91cabae
authored
May 14, 2024
by
JessicaOjo
Browse files
fix exact match valueError
parent
348e304a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
15 additions
and
8 deletions
+15
-8
lm_eval/api/task.py
lm_eval/api/task.py
+15
-8
No files found.
lm_eval/api/task.py
View file @
b91cabae
...
@@ -1367,16 +1367,23 @@ class ConfigurableTask(Task):
...
@@ -1367,16 +1367,23 @@ class ConfigurableTask(Task):
result_score
=
0.0
result_score
=
0.0
else
:
else
:
try
:
try
:
result_score
=
self
.
_metric_fn_list
[
metric
](
if
metric
==
"exact_match"
:
references
=
[
gold
],
result_score
=
self
.
_metric_fn_list
[
metric
](
predictions
=
[
result
],
references
=
[
str
(
gold
)],
**
self
.
_metric_fn_kwargs
[
metric
],
predictions
=
[
str
(
result
)],
)
**
self
.
_metric_fn_kwargs
[
metric
],
)
else
:
result_score
=
self
.
_metric_fn_list
[
metric
](
references
=
[
gold
],
predictions
=
[
result
],
**
self
.
_metric_fn_kwargs
[
metric
],
)
except
TypeError
as
error
:
# needed for now in order to use a different interface between our own metrics and HF Evaluate metrics
except
TypeError
as
error
:
# needed for now in order to use a different interface between our own metrics and HF Evaluate metrics
result_score
=
self
.
_metric_fn_list
[
metric
]([
gold
,
result
])
result_score
=
self
.
_metric_fn_list
[
metric
]([
gold
,
result
])
if
isinstance
(
result_score
,
dict
):
if
isinstance
(
result_score
,
dict
):
# TODO: this handles the case where HF evaluate returns a dict.
# TODO: this handles the case where HF evaluate returns a dict.
result_score
=
result_score
[
metric
]
result_score
=
result_score
[
metric
]
result_dict
[
metric
]
=
result_score
result_dict
[
metric
]
=
result_score
else
:
else
:
raise
ValueError
(
raise
ValueError
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment