Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
c74e2761
Commit
c74e2761
authored
Dec 06, 2023
by
lintangsutawika
Browse files
reformat
parent
cc572624
Changes
155
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
31 additions
and
24 deletions
+31
-24
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_3ds.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_3ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4da.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_4da.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4ds.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_4ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5da.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_5da.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5ds.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_5ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/_template_05_yaml
.../arithmetic/alternative_worlds/style_05/_template_05_yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_1dc.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_1dc.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_2da.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2dm.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_2dm.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_2ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_3da.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_3ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_4da.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_4ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_5da.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_5ds.yaml
+1
-1
lm_eval/tasks/arithmetic/alternative_worlds/utils.py
lm_eval/tasks/arithmetic/alternative_worlds/utils.py
+9
-2
lm_eval/tasks/hellaswag/alternative_worlds/README.md
lm_eval/tasks/hellaswag/alternative_worlds/README.md
+2
-2
lm_eval/tasks/hellaswag/alternative_worlds/style_01/a.yaml
lm_eval/tasks/hellaswag/alternative_worlds/style_01/a.yaml
+2
-2
lm_eval/tasks/hellaswag/alternative_worlds/style_01/b.yaml
lm_eval/tasks/hellaswag/alternative_worlds/style_01/b.yaml
+2
-2
No files found.
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_3ds.yaml
View file @
c74e2761
include
:
_template_04_yaml
include
:
_template_04_yaml
task
:
arithmetic_3ds_alt_04
task
:
arithmetic_3ds_alt_04
dataset_name
:
arithmetic_3ds
dataset_name
:
arithmetic_3ds
task_alias
:
3ds
task_alias
:
3ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4da.yaml
View file @
c74e2761
include
:
_template_04_yaml
include
:
_template_04_yaml
task
:
arithmetic_4da_alt_04
task
:
arithmetic_4da_alt_04
dataset_name
:
arithmetic_4da
dataset_name
:
arithmetic_4da
task_alias
:
4da
task_alias
:
4da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4ds.yaml
View file @
c74e2761
include
:
_template_04_yaml
include
:
_template_04_yaml
task
:
arithmetic_4ds_alt_04
task
:
arithmetic_4ds_alt_04
dataset_name
:
arithmetic_4ds
dataset_name
:
arithmetic_4ds
task_alias
:
4ds
task_alias
:
4ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5da.yaml
View file @
c74e2761
include
:
_template_04_yaml
include
:
_template_04_yaml
task
:
arithmetic_5da_alt_04
task
:
arithmetic_5da_alt_04
dataset_name
:
arithmetic_5da
dataset_name
:
arithmetic_5da
task_alias
:
5da
task_alias
:
5da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5ds.yaml
View file @
c74e2761
include
:
_template_04_yaml
include
:
_template_04_yaml
task
:
arithmetic_5ds_alt_04
task
:
arithmetic_5ds_alt_04
dataset_name
:
arithmetic_5ds
dataset_name
:
arithmetic_5ds
task_alias
:
5ds
task_alias
:
5ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/_template_05_yaml
View file @
c74e2761
...
@@ -12,4 +12,4 @@ metric_list:
...
@@ -12,4 +12,4 @@ metric_list:
aggregation: mean
aggregation: mean
higher_is_better: true
higher_is_better: true
- metric: brier_score
- metric: brier_score
higher_is_better: false
higher_is_better: false
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_1dc.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_1dc_alt_05
task
:
arithmetic_1dc_alt_05
dataset_name
:
arithmetic_1dc
dataset_name
:
arithmetic_1dc
task_alias
:
1dc
task_alias
:
1dc
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2da.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_2da_alt_05
task
:
arithmetic_2da_alt_05
dataset_name
:
arithmetic_2da
dataset_name
:
arithmetic_2da
task_alias
:
2da
task_alias
:
2da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2dm.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_2dm_alt_05
task
:
arithmetic_2dm_alt_05
dataset_name
:
arithmetic_2dm
dataset_name
:
arithmetic_2dm
task_alias
:
2dm
task_alias
:
2dm
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2ds.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_2ds_alt_05
task
:
arithmetic_2ds_alt_05
dataset_name
:
arithmetic_2ds
dataset_name
:
arithmetic_2ds
task_alias
:
2ds
task_alias
:
2ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3da.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_3da_alt_05
task
:
arithmetic_3da_alt_05
dataset_name
:
arithmetic_3da
dataset_name
:
arithmetic_3da
task_alias
:
3da
task_alias
:
3da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3ds.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_3ds_alt_05
task
:
arithmetic_3ds_alt_05
dataset_name
:
arithmetic_3ds
dataset_name
:
arithmetic_3ds
task_alias
:
3ds
task_alias
:
3ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4da.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_4da_alt_05
task
:
arithmetic_4da_alt_05
dataset_name
:
arithmetic_4da
dataset_name
:
arithmetic_4da
task_alias
:
4da
task_alias
:
4da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4ds.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_4ds_alt_05
task
:
arithmetic_4ds_alt_05
dataset_name
:
arithmetic_4ds
dataset_name
:
arithmetic_4ds
task_alias
:
4ds
task_alias
:
4ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5da.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_5da_alt_05
task
:
arithmetic_5da_alt_05
dataset_name
:
arithmetic_5da
dataset_name
:
arithmetic_5da
task_alias
:
5da
task_alias
:
5da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5ds.yaml
View file @
c74e2761
include
:
_template_05_yaml
include
:
_template_05_yaml
task
:
arithmetic_5ds_alt_05
task
:
arithmetic_5ds_alt_05
dataset_name
:
arithmetic_5ds
dataset_name
:
arithmetic_5ds
task_alias
:
5ds
task_alias
:
5ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/utils.py
View file @
c74e2761
...
@@ -7,27 +7,34 @@ def style_00(docs):
...
@@ -7,27 +7,34 @@ def style_00(docs):
# What is (9 + 8) * 2?
# What is (9 + 8) * 2?
return
docs
[
"context"
]
return
docs
[
"context"
]
def
style_01
(
docs
):
def
style_01
(
docs
):
# What is (9 + 8) * 2?
# What is (9 + 8) * 2?
return
docs
[
"context"
].
replace
(
"Question: "
,
""
).
replace
(
" Answer:"
,
""
)
return
docs
[
"context"
].
replace
(
"Question: "
,
""
).
replace
(
" Answer:"
,
""
)
def
style_02
(
docs
):
def
style_02
(
docs
):
# Q: What is (9 + 8) * 2? A:
# Q: What is (9 + 8) * 2? A:
return
docs
[
"context"
].
replace
(
"Question: "
,
"Q: "
).
replace
(
" Answer:"
,
" A:"
)
return
docs
[
"context"
].
replace
(
"Question: "
,
"Q: "
).
replace
(
" Answer:"
,
" A:"
)
def
style_03
(
docs
):
def
style_03
(
docs
):
# Solve (9 + 8) * 2.
# Solve (9 + 8) * 2.
return
docs
[
"context"
].
replace
(
"Question: What is"
,
"Solve"
).
replace
(
" Answer:"
,
"."
)
return
(
docs
[
"context"
].
replace
(
"Question: What is"
,
"Solve"
).
replace
(
" Answer:"
,
"."
)
)
def
style_04
(
docs
):
def
style_04
(
docs
):
# (9 + 8) * 2 =
# (9 + 8) * 2 =
return
docs
[
"context"
].
replace
(
"Question: What is "
,
""
).
replace
(
" Answer:"
,
" ="
)
return
docs
[
"context"
].
replace
(
"Question: What is "
,
""
).
replace
(
" Answer:"
,
" ="
)
def
style_05
(
docs
):
def
style_05
(
docs
):
# What is (9 + 8) * 2? Answer:
# What is (9 + 8) * 2? Answer:
return
docs
[
"context"
].
replace
(
"Question: "
,
""
)
return
docs
[
"context"
].
replace
(
"Question: "
,
""
)
\ No newline at end of file
lm_eval/tasks/hellaswag/alternative_worlds/README.md
View file @
c74e2761
...
@@ -15,6 +15,6 @@ Answer types:
...
@@ -15,6 +15,6 @@ Answer types:
-
original option
-
original option
-
just letter
-
just letter
-
letters + continuation
-
letters + continuation
-
original option
-
original option
-
just letter
-
just letter
-
continuation
-
continuation
\ No newline at end of file
lm_eval/tasks/hellaswag/alternative_worlds/style_01/a.yaml
View file @
c74e2761
include
:
../_hellaswag_alt_yaml
include
:
../_hellaswag_alt_yaml
group
:
hellaswag_01
group
:
hellaswag_01
group_alias
:
style_01
group_alias
:
style_01
task
:
hellaswag_01a
task
:
hellaswag_01a
task_alias
:
a
task_alias
:
a
doc_to_text
:
!function
../styles.template_01
doc_to_text
:
!function
../styles.template_01
doc_to_choice
:
!function
../styles.choice_01a
doc_to_choice
:
!function
../styles.choice_01a
\ No newline at end of file
lm_eval/tasks/hellaswag/alternative_worlds/style_01/b.yaml
View file @
c74e2761
include
:
../_hellaswag_alt_yaml
include
:
../_hellaswag_alt_yaml
group
:
hellaswag_01
group
:
hellaswag_01
group_alias
:
style_01
group_alias
:
style_01
task
:
hellaswag_01b
task
:
hellaswag_01b
task_alias
:
b
task_alias
:
b
doc_to_text
:
!function
../styles.template_01
doc_to_text
:
!function
../styles.template_01
doc_to_choice
:
!function
../styles.choice_01b
doc_to_choice
:
!function
../styles.choice_01b
\ No newline at end of file
Prev
1
2
3
4
5
6
7
8
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment