Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
601be343
Commit
601be343
authored
Jun 23, 2025
by
Baber
Browse files
Merge branch 'main' into feature/eval_from_config
parents
d0884a96
68c3a811
Changes
1000
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
198 additions
and
0 deletions
+198
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_sna.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_sna.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_sot.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_sot.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_swa.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_swa.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_twi.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_twi.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_wol.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_wol.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_xho.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_xho.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_yaml
...tasks/afrixnli/translate/prompt_3/afrixnli_translate_yaml
+27
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_yor.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_yor.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_zul.yaml
...s/afrixnli/translate/prompt_3/afrixnli_translate_zul.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_3/utils.py
lm_eval/tasks/afrixnli/translate/prompt_3/utils.py
+21
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_amh.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_amh.yaml
+9
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_ewe.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_ewe.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_fra.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_fra.yaml
+9
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_hau.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_hau.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_ibo.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_ibo.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_kin.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_kin.yaml
+9
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_lin.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_lin.yaml
+9
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_lug.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_lug.yaml
+9
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_orm.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_orm.yaml
+8
-0
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_sna.yaml
...s/afrixnli/translate/prompt_4/afrixnli_translate_sna.yaml
+9
-0
No files found.
Too many changes to show.
To preserve performance only
1000 of 1000+
files are displayed.
Plain diff
Email patch
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_sna.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
sna
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
chiShona,
identify
if
\
\
the
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
\
\
respond
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_sna_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_sot.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
sot
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
Sesotho,
identify
if
the
\
\
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
respond
\
\
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_sot_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_swa.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
swa
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
Swahili,
identify
if
the
\
\
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
respond
\
\
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_swa_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_twi.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
twi
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
Twi,
identify
if
the
premise
\
\
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
respond
with
\
\
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n\
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_twi_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_wol.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
wol
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
Wolof,
identify
if
the
\
\
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
respond
\
\
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_wol_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_xho.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
xho
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
isiXhosa,
identify
if
\
\
the
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
\
\
respond
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_xho_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_yaml
0 → 100644
View file @
601be343
tag: afrixnli_tt_tasks
dataset_path: masakhane/afrixnli-translate-test
dataset_name: null
output_type: multiple_choice
test_split: test
fewshot_split: test
doc_to_target: !function utils.doc_to_target
doc_to_choice:
- "entailment"
- "neutral"
- "contradiction"
should_decontaminate: true
doc_to_decontamination_query: premise
metric_list:
- metric: f1
aggregation: !function utils.weighted_f1_score
average: weighted
higher_is_better: True
ignore_case: true
ignore_punctuation: true
- metric: acc
aggregation: mean
higher_is_better: true
ignore_case: true
ignore_punctuation: true
metadata:
version: 1.0
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_yor.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
yor
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
Yoruba,
identify
if
the
\
\
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
respond
\
\
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_yor_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/afrixnli_translate_zul.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
zul
doc_to_text
:
"
Given
the
following
premise
and
hypothesis
in
Zulu,
identify
if
the
\
\
premise
entails,
contradicts,
or
is
neutral
towards
the
hypothesis.
Please
respond
\
\
with
exact
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_zul_prompt_3
lm_eval/tasks/afrixnli/translate/prompt_3/utils.py
0 → 100644
View file @
601be343
from
lm_eval.utils
import
weighted_f1_score
def
doc_to_text
(
doc
):
output
=
"""You are an NLP assistant whose purpose is to solve Natural Language Inference (NLI) problems
Please identify whether the premise entails or contradicts the hypothesis in the following premise
and hypothesis. The answer should be exact entailment, contradiction, or neutral.
Premise: {premise}
Hypothesis: {hypothesis}
Is it entailment, contradiction, or neutral?"""
text
=
output
.
format
(
premise
=
doc
[
"premise"
],
hypothesis
=
doc
[
"hypothesis"
])
return
text
def
doc_to_target
(
doc
):
replacements
=
{
0
:
"entailment"
,
1
:
"neutral"
,
2
:
"contradiction"
}
return
replacements
[
doc
[
"label"
]]
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_amh.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
amh
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Amharic
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Amharic,
and
\
\
determine
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
\
\
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
\
\
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_amh_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_ewe.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
ewe
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Ewe
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Ewe,
and
determine
\
\
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
'entailment',
\
\
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_ewe_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_fra.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
fra
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
French
language.
\n
Analyze
the
premise
and
hypothesis
given
in
French,
and
\
\
determine
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
\
\
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
\
\
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_fra_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_hau.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
hau
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Hausa
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Hausa,
and
determine
\
\
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
'entailment',
\
\
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_hau_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_ibo.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
ibo
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Igbo
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Igbo,
and
determine
\
\
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
'entailment',
\
\
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_ibo_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_kin.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
kin
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Kinyarwanda
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Kinyarwanda,
\
\
and
determine
the
relationship
between
them.
\n
Respond
with
one
of
the
following
\
\
options:
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_kin_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_lin.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
lin
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Lingala
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Lingala,
and
\
\
determine
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
\
\
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
\
\
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_lin_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_lug.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
lug
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Luganda
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Luganda,
and
\
\
determine
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
\
\
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
\
\
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_lug_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_orm.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
orm
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
Oromo
language.
\n
Analyze
the
premise
and
hypothesis
given
in
Oromo,
and
determine
\
\
the
relationship
between
them.
\n
Respond
with
one
of
the
following
options:
'entailment',
\
\
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_orm_prompt_4
lm_eval/tasks/afrixnli/translate/prompt_4/afrixnli_translate_sna.yaml
0 → 100644
View file @
601be343
# Generated by utils.py
dataset_name
:
sna
doc_to_text
:
"
You
are
an
expert
in
Natural
Language
Inference
(NLI)
specializing
in
\
\
the
chiShona
language.
\n
Analyze
the
premise
and
hypothesis
given
in
chiShona,
\
\
and
determine
the
relationship
between
them.
\n
Respond
with
one
of
the
following
\
\
options:
'entailment',
'contradiction',
or
'neutral'.
\n\n
Premise:
{{premise}}
\
\ \n
Hypothesis:
{{hypothesis}}"
include
:
afrixnli_translate_yaml
task
:
afrixnli_translate_sna_prompt_4
Prev
1
…
41
42
43
44
45
46
47
48
49
50
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment