Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
c2848879
Unverified
Commit
c2848879
authored
Sep 21, 2023
by
Hailey Schoelkopf
Committed by
GitHub
Sep 21, 2023
Browse files
Merge pull request #870 from EleutherAI/lintangsutawika-patch-4
Create cot_yaml
parents
d8233365
4a967f26
Changes
12
Hide whitespace changes
Inline
Side-by-side
Showing
12 changed files
with
40 additions
and
11 deletions
+40
-11
lm_eval/tasks/mgsm/native_cot/cot_yaml
lm_eval/tasks/mgsm/native_cot/cot_yaml
+29
-0
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_bn.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_bn.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_de.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_de.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_en.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_en.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_es.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_es.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_fr.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_fr.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ja.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ja.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ru.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ru.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_sw.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_sw.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_te.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_te.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_th.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_th.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_zh.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_zh.yaml
+1
-1
No files found.
lm_eval/tasks/mgsm/native_cot/cot_yaml
0 → 100644
View file @
c2848879
# This file will be included in the generated language-specific task configs.
# It doesn't have a yaml file extension as it is not meant to be imported directly
# by the harness.
group: mgsm_cot_native
dataset_path: juletxara/mgsm
dataset_name: null # Overridden by language-specific config.
output_type: greedy_until
training_split: train
test_split: test
target_delimiter: ""
generation_kwargs:
until:
- "\n\n"
- "\n"
do_sample: false
temperature: 0.0
target_delimiter: " "
metric_list:
- metric: exact_match
aggregation: mean
higher_is_better: true
ignore_case: true
ignore_punctuation: true
filter_list:
- name: "get-answer"
filter:
- function: "regex"
regex_pattern: "The answer is (\\-?[0-9\\.\\,]+)"
- function: "take_first"
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_bn.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[16+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nধাপে
ধাপে
উত্তর:"}}{%
else
%}{{"প্রশ্ন:
"+question+"\nধাপে
ধাপে
উত্তর:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_bn_
direc
t
task
:
mgsm_bn_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_de.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[28+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nSchritt-für-Schritt-Antwort:"}}{%
else
%}{{"Frage:
"+question+"\nSchritt-für-Schritt-Antwort:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_de_
direc
t
task
:
mgsm_de_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_en.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[20+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nStep-by-Step
Answer:"}}{%
else
%}{{"Question:
"+question+"\nStep-by-Step
Answer:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_en_
direc
t
task
:
mgsm_en_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_es.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[22+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nRespuesta
paso
a
paso:"}}{%
else
%}{{"Pregunta:
"+question+"\nRespuesta
paso
a
paso:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_es_
direc
t
task
:
mgsm_es_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_fr.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[25+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nRéponse
étape
par
étape
:"}}{%
else
%}{{"Question
:
"+question+"\nRéponse
étape
par
étape
:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_fr_
direc
t
task
:
mgsm_fr_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ja.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[10+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nステップごとの答え:"}}{%
else
%}{{"問題:
"+question+"\nステップごとの答え:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_ja_
direc
t
task
:
mgsm_ja_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ru.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[17+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nПошаговоерешение:"}}{%
else
%}{{"Задача:
"+question+"\nПошаговоерешение:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_ru_
direc
t
task
:
mgsm_ru_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_sw.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[24+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nJibu
la
Hatua
kwa
Hatua:"}}{%
else
%}{{"Swali:
"+question+"\nJibu
la
Hatua
kwa
Hatua:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_sw_
direc
t
task
:
mgsm_sw_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_te.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[18+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nదశలవారీగా
సమాధానం:"}}{%
else
%}{{"ప్రశ్న:
"+question+"\nదశలవారీగా
సమాధానం:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_te_
direc
t
task
:
mgsm_te_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_th.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[17+1]}}{% else %}{{answer_nu
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nคำตอบทีละขั้นตอน:"}}{%
else
%}{{"โจทย์:
"+question+"\nคำตอบทีละขั้นตอน:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_th_
direc
t
task
:
mgsm_th_
native_co
t
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_zh.yaml
View file @
c2848879
...
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[5+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\n逐步解答:"}}{%
else
%}{{"问题:
"+question+"\n逐步解答:"}}{%
endif
%}'
include
:
cot_yaml
task
:
mgsm_zh_
direc
t
task
:
mgsm_zh_
native_co
t
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment