Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
c013679d
Commit
c013679d
authored
Oct 24, 2020
by
Charles Foster
Browse files
Fixes to natural questions.
parent
dc3560d0
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
5 deletions
+7
-5
lm_eval/tasks/naturalqs.py
lm_eval/tasks/naturalqs.py
+7
-5
No files found.
lm_eval/tasks/naturalqs.py
View file @
c013679d
...
...
@@ -20,20 +20,22 @@ class NaturalQs(HFTask):
def
training_docs
(
self
):
# Cache training for faster few-shot.
# Data is too large to fit in memory.
return
self
.
data
[
"train"
]
def
doc_to_text
(
self
,
doc
,
include_target
=
True
):
question
=
doc
[
'question'
][
'text'
]
short_answer
=
doc
[
'annotations'
][
'short_answers'
][
0
][
'text'
]
long_answer_start
=
doc
[
'annotations'
][
'long_answer'
][
0
][
'start_token'
]
long_answer_end
=
doc
[
'annotations'
][
'long_answer'
][
0
][
'end_token'
]
long_answer
=
" "
.
join
(
doc
[
'document'
][
'tokens'
][
'token'
][
long_answer_start
:
long_answer_end
])
text
=
'Q: '
+
question
+
'
\n\n
'
+
'A: '
if
include_target
:
# There's a short answer and a long answer. Based on the paper, I'm using the long answer.
short_answer
=
doc
[
'annotations'
][
'short_answers'
][
0
][
'text'
]
long_answer_start
=
doc
[
'annotations'
][
'long_answer'
][
0
][
'start_token'
]
long_answer_end
=
doc
[
'annotations'
][
'long_answer'
][
0
][
'end_token'
]
long_answer_span
=
doc
[
'document'
][
'tokens'
][
'token'
][
long_answer_start
:
long_answer_end
]
long_answer_is_html
=
doc
[
'document'
][
'tokens'
][
'is_html'
][
long_answer_start
:
long_answer_end
]
long_answer_chars
=
[
tok
for
(
tok
,
is_html
)
in
zip
(
long_answer_span
,
long_answer_is_html
)
if
not
is_html
]
long_answer
=
" "
.
join
(
long_answer_chars
)
text
+=
long_answer
# Replace with short_answer[0] for short answer
return
text
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment