Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
e160a9f9
Commit
e160a9f9
authored
Oct 08, 2025
by
Baber
Browse files
add mc tasks
parent
c0fc7172
Changes
99
Show whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
85 additions
and
0 deletions
+85
-0
lm_eval/tasks/mrl/mc/_generate_config.py
lm_eval/tasks/mrl/mc/_generate_config.py
+19
-0
lm_eval/tasks/mrl/mc/_template_mc
lm_eval/tasks/mrl/mc/_template_mc
+12
-0
lm_eval/tasks/mrl/mc/acq_arab.yaml
lm_eval/tasks/mrl/mc/acq_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/aeb_arab.yaml
lm_eval/tasks/mrl/mc/aeb_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/afb_arab.yaml
lm_eval/tasks/mrl/mc/afb_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/als_latn.yaml
lm_eval/tasks/mrl/mc/als_latn.yaml
+3
-0
lm_eval/tasks/mrl/mc/amh_ethi.yaml
lm_eval/tasks/mrl/mc/amh_ethi.yaml
+3
-0
lm_eval/tasks/mrl/mc/apc_arab_jord.yaml
lm_eval/tasks/mrl/mc/apc_arab_jord.yaml
+3
-0
lm_eval/tasks/mrl/mc/apc_arab_leba.yaml
lm_eval/tasks/mrl/mc/apc_arab_leba.yaml
+3
-0
lm_eval/tasks/mrl/mc/apc_arab_pale.yaml
lm_eval/tasks/mrl/mc/apc_arab_pale.yaml
+3
-0
lm_eval/tasks/mrl/mc/apc_arab_syri.yaml
lm_eval/tasks/mrl/mc/apc_arab_syri.yaml
+3
-0
lm_eval/tasks/mrl/mc/arb_arab.yaml
lm_eval/tasks/mrl/mc/arb_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/arq_arab.yaml
lm_eval/tasks/mrl/mc/arq_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/ars_arab.yaml
lm_eval/tasks/mrl/mc/ars_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/ary_arab.yaml
lm_eval/tasks/mrl/mc/ary_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/arz_arab.yaml
lm_eval/tasks/mrl/mc/arz_arab.yaml
+3
-0
lm_eval/tasks/mrl/mc/azj_latn.yaml
lm_eval/tasks/mrl/mc/azj_latn.yaml
+3
-0
lm_eval/tasks/mrl/mc/bam_latn.yaml
lm_eval/tasks/mrl/mc/bam_latn.yaml
+3
-0
lm_eval/tasks/mrl/mc/bel_cyrl.yaml
lm_eval/tasks/mrl/mc/bel_cyrl.yaml
+3
-0
lm_eval/tasks/mrl/mc/ben_latn.yaml
lm_eval/tasks/mrl/mc/ben_latn.yaml
+3
-0
No files found.
lm_eval/tasks/mrl/mc/_generate_config.py
0 → 100644
View file @
e160a9f9
from
pathlib
import
Path
import
datasets
if
__name__
==
"__main__"
:
subsets
=
[
x
for
x
in
datasets
.
get_dataset_config_names
(
"mrlbenchmarks/global-piqa-nonparallel"
)
if
not
x
.
startswith
(
"dev"
)
]
PARENT
=
Path
(
__file__
).
parent
for
s
in
subsets
:
with
open
(
PARENT
/
f
"
{
s
}
.yaml"
,
"w"
)
as
f
:
f
.
write
(
"include: '_template_mc'
\n
"
)
f
.
write
(
f
"task: mrl_
{
s
}
\n
"
)
f
.
write
(
f
"dataset_name:
{
s
}
\n
"
)
lm_eval/tasks/mrl/mc/_template_mc
0 → 100644
View file @
e160a9f9
dataset_path: mrlbenchmarks/global-piqa-nonparallel
output_type: multiple_choice
test_split: test
doc_to_text: prompt
doc_to_target: label
doc_to_choice: "{{[solution0, solution1]}}"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
metadata:
version: 1.0
lm_eval/tasks/mrl/mc/acq_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_acq_arab
dataset_name
:
acq_arab
lm_eval/tasks/mrl/mc/aeb_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_aeb_arab
dataset_name
:
aeb_arab
lm_eval/tasks/mrl/mc/afb_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_afb_arab
dataset_name
:
afb_arab
lm_eval/tasks/mrl/mc/als_latn.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_als_latn
dataset_name
:
als_latn
lm_eval/tasks/mrl/mc/amh_ethi.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_amh_ethi
dataset_name
:
amh_ethi
lm_eval/tasks/mrl/mc/apc_arab_jord.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_apc_arab_jord
dataset_name
:
apc_arab_jord
lm_eval/tasks/mrl/mc/apc_arab_leba.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_apc_arab_leba
dataset_name
:
apc_arab_leba
lm_eval/tasks/mrl/mc/apc_arab_pale.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_apc_arab_pale
dataset_name
:
apc_arab_pale
lm_eval/tasks/mrl/mc/apc_arab_syri.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_apc_arab_syri
dataset_name
:
apc_arab_syri
lm_eval/tasks/mrl/mc/arb_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_arb_arab
dataset_name
:
arb_arab
lm_eval/tasks/mrl/mc/arq_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_arq_arab
dataset_name
:
arq_arab
lm_eval/tasks/mrl/mc/ars_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_ars_arab
dataset_name
:
ars_arab
lm_eval/tasks/mrl/mc/ary_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_ary_arab
dataset_name
:
ary_arab
lm_eval/tasks/mrl/mc/arz_arab.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_arz_arab
dataset_name
:
arz_arab
lm_eval/tasks/mrl/mc/azj_latn.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_azj_latn
dataset_name
:
azj_latn
lm_eval/tasks/mrl/mc/bam_latn.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_bam_latn
dataset_name
:
bam_latn
lm_eval/tasks/mrl/mc/bel_cyrl.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_bel_cyrl
dataset_name
:
bel_cyrl
lm_eval/tasks/mrl/mc/ben_latn.yaml
0 → 100644
View file @
e160a9f9
include
:
'
_template_mc'
task
:
mrl_ben_latn
dataset_name
:
ben_latn
Prev
1
2
3
4
5
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment