Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
0256c682
Unverified
Commit
0256c682
authored
Feb 11, 2024
by
Uanu
Committed by
GitHub
Feb 11, 2024
Browse files
Add multilingual ARC task (#1419)
parent
1ff84897
Changes
34
Hide whitespace changes
Inline
Side-by-side
Showing
14 changed files
with
117 additions
and
0 deletions
+117
-0
lm_eval/tasks/okapi/arc_multilingual/arc_ne.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_ne.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_nl.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_nl.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_pt.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_pt.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_ro.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_ro.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_ru.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_ru.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_sk.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_sk.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_sr.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_sr.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_sv.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_sv.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_ta.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_ta.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_te.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_te.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_uk.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_uk.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_vi.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_vi.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/arc_zh.yaml
lm_eval/tasks/okapi/arc_multilingual/arc_zh.yaml
+7
-0
lm_eval/tasks/okapi/arc_multilingual/utils.py
lm_eval/tasks/okapi/arc_multilingual/utils.py
+26
-0
No files found.
lm_eval/tasks/okapi/arc_multilingual/arc_ne.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_ne
dataset_path
:
alexandrainst/m_arc
dataset_name
:
ne
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_nl.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_nl
dataset_path
:
alexandrainst/m_arc
dataset_name
:
nl
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_pt.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_pt
dataset_path
:
alexandrainst/m_arc
dataset_name
:
pt
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_ro.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_ro
dataset_path
:
alexandrainst/m_arc
dataset_name
:
ro
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_ru.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_ru
dataset_path
:
alexandrainst/m_arc
dataset_name
:
ru
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_sk.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_sk
dataset_path
:
alexandrainst/m_arc
dataset_name
:
sk
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_sr.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_sr
dataset_path
:
alexandrainst/m_arc
dataset_name
:
sr
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_sv.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_sv
dataset_path
:
alexandrainst/m_arc
dataset_name
:
sv
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_ta.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_ta
dataset_path
:
alexandrainst/m_arc
dataset_name
:
ta
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_te.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_te
dataset_path
:
alexandrainst/m_arc
dataset_name
:
te
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_uk.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_uk
dataset_path
:
alexandrainst/m_arc
dataset_name
:
uk
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_vi.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_vi
dataset_path
:
alexandrainst/m_arc
dataset_name
:
vi
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/arc_zh.yaml
0 → 100644
View file @
0256c682
include
:
_arc_yaml
task
:
arc_zh
dataset_path
:
alexandrainst/m_arc
dataset_name
:
zh
training_split
:
train
validation_split
:
validation
test_split
:
test
lm_eval/tasks/okapi/arc_multilingual/utils.py
0 → 100644
View file @
0256c682
import
datasets
import
re
def
preprocess
(
text
):
if
text
is
None
:
return
" "
text
=
text
.
strip
()
text
=
text
.
replace
(
" [title]"
,
". "
)
text
=
re
.
sub
(
"
\\
[.*?
\\
]"
,
""
,
text
)
text
=
text
.
replace
(
" "
,
" "
)
return
text
def
process_docs
(
dataset
:
datasets
.
Dataset
)
->
datasets
.
Dataset
:
def
_process_doc
(
doc
):
# breakpoint()
out_doc
=
{
"id"
:
doc
[
"id"
],
"query"
:
"Question: "
+
preprocess
(
doc
[
"instruction"
])
+
"
\n
Answer:"
,
"choices"
:
[
preprocess
(
doc
[
'option_a'
]),
preprocess
(
doc
[
'option_b'
]),
preprocess
(
doc
[
'option_c'
]),
preprocess
(
doc
[
'option_d'
]),
preprocess
(
doc
[
'option_e'
])],
"gold"
:
[
"A"
,
"B"
,
"C"
,
"D"
,
"E"
].
index
(
doc
[
"answer"
]),
}
return
out_doc
return
dataset
.
map
(
_process_doc
)
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment