Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
f49b0377
Unverified
Commit
f49b0377
authored
Dec 03, 2024
by
Naiara Perez
Committed by
GitHub
Dec 03, 2024
Browse files
add Basque translation of PIQA (piqa_eu) to BasqueBench (#2531)
parent
1170ef9e
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
24 additions
and
0 deletions
+24
-0
lm_eval/tasks/basque_bench/README.md
lm_eval/tasks/basque_bench/README.md
+2
-0
lm_eval/tasks/basque_bench/basque_bench.yaml
lm_eval/tasks/basque_bench/basque_bench.yaml
+1
-0
lm_eval/tasks/basque_bench/piqa_eu.yaml
lm_eval/tasks/basque_bench/piqa_eu.yaml
+21
-0
No files found.
lm_eval/tasks/basque_bench/README.md
View file @
f49b0377
...
@@ -8,6 +8,7 @@ The new evaluation datasets included in BasqueBench are:
...
@@ -8,6 +8,7 @@ The new evaluation datasets included in BasqueBench are:
| Task | Category | Homepage |
| Task | Category | Homepage |
|:-------------:|:-----:|:-----:|
|:-------------:|:-----:|:-----:|
| MGSM_eu | Math | https://huggingface.co/datasets/HiTZ/MGSM-eu |
| MGSM_eu | Math | https://huggingface.co/datasets/HiTZ/MGSM-eu |
| PIQA_eu | Question Answering | https://huggingface.co/datasets/HiTZ/PIQA-eu |
| WNLI_eu | Natural Language Inference | https://huggingface.co/datasets/HiTZ/wnli-eu |
| WNLI_eu | Natural Language Inference | https://huggingface.co/datasets/HiTZ/wnli-eu |
| XCOPA_eu | Commonsense Reasoning | https://huggingface.co/datasets/HiTZ/XCOPA-eu |
| XCOPA_eu | Commonsense Reasoning | https://huggingface.co/datasets/HiTZ/XCOPA-eu |
...
@@ -63,6 +64,7 @@ The following tasks evaluate tasks on BasqueBench dataset using various scoring
...
@@ -63,6 +64,7 @@ The following tasks evaluate tasks on BasqueBench dataset using various scoring
-
`flores_pt-eu`
-
`flores_pt-eu`
-
`mgsm_direct_eu`
-
`mgsm_direct_eu`
-
`mgsm_native_cot_eu`
-
`mgsm_native_cot_eu`
-
`piqa_eu`
-
`qnlieu`
-
`qnlieu`
-
`wnli_eu`
-
`wnli_eu`
-
`xcopa_eu`
-
`xcopa_eu`
...
...
lm_eval/tasks/basque_bench/basque_bench.yaml
View file @
f49b0377
...
@@ -14,5 +14,6 @@ task:
...
@@ -14,5 +14,6 @@ task:
-
xcopa_eu
-
xcopa_eu
-
mgsm_direct_eu
-
mgsm_direct_eu
-
mgsm_native_cot_eu
-
mgsm_native_cot_eu
-
piqa_eu
metadata
:
metadata
:
version
:
1.0
version
:
1.0
lm_eval/tasks/basque_bench/piqa_eu.yaml
0 → 100755
View file @
f49b0377
task
:
piqa_eu
dataset_path
:
HiTZ/PIQA-eu
dataset_name
:
null
output_type
:
multiple_choice
training_split
:
null
validation_split
:
validation
test_split
:
null
doc_to_text
:
"
Galdera:
{{goal}}
\n
Erantzuna:"
doc_to_target
:
label
doc_to_choice
:
"
{{[sol1,
sol2]}}"
should_decontaminate
:
true
doc_to_decontamination_query
:
goal
metric_list
:
-
metric
:
acc
aggregation
:
mean
higher_is_better
:
true
-
metric
:
acc_norm
aggregation
:
mean
higher_is_better
:
true
metadata
:
version
:
1.0
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment