Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
90ad5db7
Commit
90ad5db7
authored
Mar 01, 2024
by
lintangsutawika
Browse files
merged main
parents
f692caa9
b177c82c
Changes
484
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
29 additions
and
26 deletions
+29
-26
lm_eval/tasks/okapi/hellaswag_multilingual/utils.py
lm_eval/tasks/okapi/hellaswag_multilingual/utils.py
+2
-1
lm_eval/tasks/okapi/mmlu_multilingual/_generate_configs.py
lm_eval/tasks/okapi/mmlu_multilingual/_generate_configs.py
+3
-9
lm_eval/tasks/okapi/mmlu_multilingual/m_mmlu_hy.yaml
lm_eval/tasks/okapi/mmlu_multilingual/m_mmlu_hy.yaml
+4
-0
lm_eval/tasks/okapi/mmlu_multilingual/m_mmlu_sk.yaml
lm_eval/tasks/okapi/mmlu_multilingual/m_mmlu_sk.yaml
+4
-0
lm_eval/tasks/okapi/truthfulqa_multilingual/_truthfulqa_mc1_yaml
.../tasks/okapi/truthfulqa_multilingual/_truthfulqa_mc1_yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ar_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_ar_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ar_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_ar_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_bn_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_bn_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_bn_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_bn_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ca_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_ca_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ca_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_ca_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_da_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_da_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_da_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_da_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_de_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_de_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_de_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_de_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_es_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_es_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_es_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_es_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_eu_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_eu_mc1.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_eu_mc2.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_eu_mc2.yaml
+1
-1
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_fr_mc1.yaml
...asks/okapi/truthfulqa_multilingual/truthfulqa_fr_mc1.yaml
+1
-1
No files found.
lm_eval/tasks/okapi/hellaswag_multilingual/utils.py
View file @
90ad5db7
import
datasets
import
re
import
re
import
datasets
def
preprocess
(
text
):
def
preprocess
(
text
):
text
=
text
.
strip
()
text
=
text
.
strip
()
...
...
lm_eval/tasks/okapi/mmlu_multilingual/_generate_configs.py
View file @
90ad5db7
import
yaml
import
datasets
import
datasets
import
yaml
from
tqdm
import
tqdm
from
tqdm
import
tqdm
def
main
()
->
None
:
def
main
()
->
None
:
dataset_path
=
"alexandrainst/m_mmlu"
dataset_path
=
"alexandrainst/m_mmlu"
# Removed hy and sk subdataset because the original dataset is broken
for
task
in
tqdm
(
datasets
.
get_dataset_infos
(
dataset_path
).
keys
()):
# I created this PR https://huggingface.co/datasets/alexandrainst/m_mmlu/discussions/3
# on the dataset for the authors, in case it will be accepeted the filter can be removed
keys_without_hy_sk
=
list
(
filter
(
lambda
k
:
(
'hy'
not
in
k
and
'sk'
not
in
k
),
datasets
.
get_dataset_infos
(
dataset_path
).
keys
()))
for
task
in
tqdm
():
file_name
=
f
"m_mmlu_
{
task
}
.yaml"
file_name
=
f
"m_mmlu_
{
task
}
.yaml"
try
:
try
:
with
open
(
f
"
{
file_name
}
"
,
"w"
)
as
f
:
with
open
(
f
"
{
file_name
}
"
,
"w"
)
as
f
:
...
@@ -29,5 +22,6 @@ def main() -> None:
...
@@ -29,5 +22,6 @@ def main() -> None:
except
FileExistsError
:
except
FileExistsError
:
pass
pass
if
__name__
==
"__main__"
:
if
__name__
==
"__main__"
:
main
()
main
()
lm_eval/tasks/okapi/mmlu_multilingual/m_mmlu_hy.yaml
0 → 100644
View file @
90ad5db7
# Generated by _generate_configs.py
dataset_name
:
hy
include
:
_default_yaml
task
:
m_mmlu_hy
lm_eval/tasks/okapi/mmlu_multilingual/m_mmlu_sk.yaml
0 → 100644
View file @
90ad5db7
# Generated by _generate_configs.py
dataset_name
:
sk
include
:
_default_yaml
task
:
m_mmlu_sk
lm_eval/tasks/okapi/truthfulqa_multilingual/_truthfulqa_mc1_yaml
View file @
90ad5db7
...
@@ -4,7 +4,7 @@ dataset_path: null
...
@@ -4,7 +4,7 @@ dataset_path: null
dataset_name: null
dataset_name: null
output_type: multiple_choice
output_type: multiple_choice
training_split: null
training_split: null
validation_split: val
idation
validation_split: val
test_split: null
test_split: null
process_docs: !function utils.process_docs
process_docs: !function utils.process_docs
doc_to_text: "query"
doc_to_text: "query"
...
...
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ar_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_ar_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_ar_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
ar
dataset_name
:
ar
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ar_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_ar_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_ar_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
ar
dataset_name
:
ar
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_bn_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_bn_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_bn_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
bn
dataset_name
:
bn
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_bn_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_bn_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_bn_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
bn
dataset_name
:
bn
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ca_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_ca_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_ca_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
ca
dataset_name
:
ca
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_ca_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_ca_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_ca_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
ca
dataset_name
:
ca
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_da_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_da_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_da_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
da
dataset_name
:
da
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_da_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_da_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_da_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
da
dataset_name
:
da
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_de_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_de_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_de_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
de
dataset_name
:
de
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_de_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_de_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_de_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
de
dataset_name
:
de
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_es_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_es_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_es_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
es
dataset_name
:
es
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_es_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_es_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_es_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
es
dataset_name
:
es
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_eu_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_eu_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_eu_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
eu
dataset_name
:
eu
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_eu_mc2.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_eu_mc2
...
@@ -3,5 +3,5 @@ task: truthfulqa_eu_mc2
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
eu
dataset_name
:
eu
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
lm_eval/tasks/okapi/truthfulqa_multilingual/truthfulqa_fr_mc1.yaml
View file @
90ad5db7
...
@@ -3,5 +3,5 @@ task: truthfulqa_fr_mc1
...
@@ -3,5 +3,5 @@ task: truthfulqa_fr_mc1
dataset_path
:
alexandrainst/m_truthfulqa
dataset_path
:
alexandrainst/m_truthfulqa
dataset_name
:
fr
dataset_name
:
fr
training_split
:
null
training_split
:
null
validation_split
:
val
idation
validation_split
:
val
test_split
:
null
test_split
:
null
Prev
1
…
17
18
19
20
21
22
23
24
25
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment