Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
fd279089
Commit
fd279089
authored
Sep 06, 2023
by
lintangsutawika
Browse files
tidy up
parent
9f4682a3
Changes
25
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
38 additions
and
18 deletions
+38
-18
lm_eval/tasks/mgsm/README.md
lm_eval/tasks/mgsm/README.md
+26
-6
lm_eval/tasks/mgsm/direct/direct_yaml
lm_eval/tasks/mgsm/direct/direct_yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_bn.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_bn.yaml
+0
-0
lm_eval/tasks/mgsm/direct/mgsm_direct_de.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_de.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_en.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_en.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_es.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_es.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_fr.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_fr.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_ja.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_ja.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_ru.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_ru.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_sw.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_sw.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_te.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_te.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_th.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_th.yaml
+1
-1
lm_eval/tasks/mgsm/direct/mgsm_direct_zh.yaml
lm_eval/tasks/mgsm/direct/mgsm_direct_zh.yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/cot_yaml
lm_eval/tasks/mgsm/native_cot/cot_yaml
+1
-1
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_bn.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_bn.yaml
+0
-0
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_de.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_de.yaml
+0
-0
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_en.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_en.yaml
+0
-0
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_es.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_es.yaml
+0
-0
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_fr.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_fr.yaml
+0
-0
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ja.yaml
lm_eval/tasks/mgsm/native_cot/mgsm_cot_native_ja.yaml
+0
-0
No files found.
lm_eval/tasks/mgsm/README.md
View file @
fd279089
...
@@ -53,12 +53,32 @@ Homepage: https://github.com/google-research/url-nlp/tree/main/mgsm
...
@@ -53,12 +53,32 @@ Homepage: https://github.com/google-research/url-nlp/tree/main/mgsm
#### Groups
#### Groups
*
`mgsm`
*
`mgsm_direct`
: Direct question
*
`mgsm_direct_bn`
: Bengali
#### Tasks
*
`mgsm_direct_de`
: German
*
`mgsm_direct_en`
: English
*
`task_name`
:
`1-sentence description of what this particular task does`
*
`mgsm_direct_es`
: Spanish
*
`task_name2`
: ...
*
`mgsm_direct_fr`
: French
*
`mgsm_direct_ja`
: Japanese
*
`mgsm_direct_ru`
: Russian
*
`mgsm_direct_sw`
: Swahili
*
`mgsm_direct_te`
: Telugu
*
`mgsm_direct_th`
: Thai
*
`mgsm_direct_zh`
: Chinese
*
`mgsm_cot_native`
: Question with Answer followed by CoT prompt in the same language as the dataset.
*
`mgsm_cot_native_bn`
: Bengali
*
`mgsm_cot_native_de`
: German
*
`mgsm_cot_native_en`
: English
*
`mgsm_cot_native_es`
: Spanish
*
`mgsm_cot_native_fr`
: French
*
`mgsm_cot_native_ja`
: Japanese
*
`mgsm_cot_native_ru`
: Russian
*
`mgsm_cot_native_sw`
: Swahili
*
`mgsm_cot_native_te`
: Telugu
*
`mgsm_cot_native_th`
: Thai
*
`mgsm_cot_native_zh`
: Chinese
Examplar Samples: https://github.com/google-research/url-nlp/blob/main/mgsm/exemplars.py
### Checklist
### Checklist
...
...
lm_eval/tasks/mgsm/direct_yaml
→
lm_eval/tasks/mgsm/direct
/direct
_yaml
View file @
fd279089
# This file will be included in the generated language-specific task configs.
# This file will be included in the generated language-specific task configs.
# It doesn't have a yaml file extension as it is not meant to be imported directly
# It doesn't have a yaml file extension as it is not meant to be imported directly
# by the harness.
# by the harness.
group: mgsm
group: mgsm
_direct
dataset_path: juletxara/mgsm
dataset_path: juletxara/mgsm
dataset_name: null # Overridden by language-specific config.
dataset_name: null # Overridden by language-specific config.
output_type: greedy_until
output_type: greedy_until
...
...
lm_eval/tasks/mgsm/mgsm_
bn_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_bn
.yaml
View file @
fd279089
File moved
lm_eval/tasks/mgsm/mgsm_
de_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_de
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[7+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[7+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAntwort"}}{%
else
%}{{"Frage:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAntwort"}}{%
else
%}{{"Frage:
"+question+"\nAntwort"}}{%
endif
%}'
"+question+"\nAntwort"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
de_
direct
task
:
mgsm_direct
_de
lm_eval/tasks/mgsm/mgsm_
en_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_en
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Question:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Question:
"+question+"\nAnswer"}}{%
endif
%}'
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
en_
direct
task
:
mgsm_direct
_en
lm_eval/tasks/mgsm/mgsm_
es_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_es
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Pregunta:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Pregunta:
"+question+"\nAnswer"}}{%
endif
%}'
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
es_
direct
task
:
mgsm_direct
_es
lm_eval/tasks/mgsm/mgsm_
fr_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_fr
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Question
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Question
:
"+question+"\nAnswer"}}{%
endif
%}'
:
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
fr_
direct
task
:
mgsm_direct
_fr
lm_eval/tasks/mgsm/mgsm_
ja_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_ja
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"問題:
"+question+"\nAnswer"}}{%
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"問題:
"+question+"\nAnswer"}}{%
endif
%}'
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
ja_
direct
task
:
mgsm_direct
_ja
lm_eval/tasks/mgsm/mgsm_
ru_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_ru
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Задача:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Задача:
"+question+"\nAnswer"}}{%
endif
%}'
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
ru_
direct
task
:
mgsm_direct
_ru
lm_eval/tasks/mgsm/mgsm_
sw_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_sw
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Swali:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"Swali:
"+question+"\nAnswer"}}{%
endif
%}'
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
sw_
direct
task
:
mgsm_direct
_sw
lm_eval/tasks/mgsm/mgsm_
te_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_te
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"ప్రశ్న:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"ప్రశ్న:
"+question+"\nAnswer"}}{%
endif
%}'
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
te_
direct
task
:
mgsm_direct
_te
lm_eval/tasks/mgsm/mgsm_
th_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_th
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"โจทย์:
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"โจทย์:
"+question+"\nAnswer"}}{%
endif
%}'
"+question+"\nAnswer"}}{%
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
th_
direct
task
:
mgsm_direct
_th
lm_eval/tasks/mgsm/mgsm_
zh_
direct.yaml
→
lm_eval/tasks/mgsm/
direct/
mgsm_direct
_zh
.yaml
View file @
fd279089
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
...
@@ -5,4 +5,4 @@ doc_to_target: '{% if answer is not none %}{{answer[6+1]}}{% else %}{{answer_num
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"问题:
"+question+"\nAnswer"}}{%
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer"}}{%
else
%}{{"问题:
"+question+"\nAnswer"}}{%
endif
%}'
endif
%}'
include
:
direct_yaml
include
:
direct_yaml
task
:
mgsm_
zh_
direct
task
:
mgsm_direct
_zh
lm_eval/tasks/mgsm/cot_yaml
→
lm_eval/tasks/mgsm/
native_cot/
cot_yaml
View file @
fd279089
# This file will be included in the generated language-specific task configs.
# This file will be included in the generated language-specific task configs.
# It doesn't have a yaml file extension as it is not meant to be imported directly
# It doesn't have a yaml file extension as it is not meant to be imported directly
# by the harness.
# by the harness.
group: mgsm
group: mgsm
_cot_native
dataset_path: juletxara/mgsm
dataset_path: juletxara/mgsm
dataset_name: null # Overridden by language-specific config.
dataset_name: null # Overridden by language-specific config.
output_type: greedy_until
output_type: greedy_until
...
...
lm_eval/tasks/mgsm/mgsm_
bn
_native
-cot
.yaml
→
lm_eval/tasks/mgsm/
native_cot/
mgsm_
cot
_native
_bn
.yaml
View file @
fd279089
File moved
lm_eval/tasks/mgsm/mgsm_
de
_native
-cot
.yaml
→
lm_eval/tasks/mgsm/
native_cot/
mgsm_
cot
_native
_de
.yaml
View file @
fd279089
File moved
lm_eval/tasks/mgsm/mgsm_
en
_native
-cot
.yaml
→
lm_eval/tasks/mgsm/
native_cot/
mgsm_
cot
_native
_en
.yaml
View file @
fd279089
File moved
lm_eval/tasks/mgsm/mgsm_
es
_native
-cot
.yaml
→
lm_eval/tasks/mgsm/
native_cot/
mgsm_
cot
_native
_es
.yaml
View file @
fd279089
File moved
lm_eval/tasks/mgsm/mgsm_
fr
_native
-cot
.yaml
→
lm_eval/tasks/mgsm/
native_cot/
mgsm_
cot
_native
_fr
.yaml
View file @
fd279089
File moved
lm_eval/tasks/mgsm/mgsm_
ja
_native
-cot
.yaml
→
lm_eval/tasks/mgsm/
native_cot/
mgsm_
cot
_native
_ja
.yaml
View file @
fd279089
File moved
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment