Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
c25f6a31
Commit
c25f6a31
authored
Nov 14, 2023
by
lintangsutawika
Browse files
alternative prompts for arithmetic
parent
5e4f1799
Changes
58
Hide whitespace changes
Inline
Side-by-side
Showing
18 changed files
with
108 additions
and
0 deletions
+108
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_3da.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_3da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_3ds.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_3ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4da.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_4da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4ds.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_4ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5da.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_5da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5ds.yaml
...rithmetic/alternative_worlds/style_04/arithmetic_5ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/_template_05_yaml
.../arithmetic/alternative_worlds/style_05/_template_05_yaml
+15
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_1dc.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_1dc.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_2da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2dm.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_2dm.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_2ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_3da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_3ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_4da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_4ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5da.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_5da.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5ds.yaml
...rithmetic/alternative_worlds/style_05/arithmetic_5ds.yaml
+4
-0
lm_eval/tasks/arithmetic/alternative_worlds/utils.py
lm_eval/tasks/arithmetic/alternative_worlds/utils.py
+29
-0
No files found.
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_3da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_04_yaml
task
:
arithmetic_3da_alt_04
dataset_name
:
arithmetic_3da
task_alias
:
3da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_3ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_04_yaml
task
:
arithmetic_3ds_alt_04
dataset_name
:
arithmetic_3ds
task_alias
:
3ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_04_yaml
task
:
arithmetic_4da_alt_04
dataset_name
:
arithmetic_4da
task_alias
:
4da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_4ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_04_yaml
task
:
arithmetic_4ds_alt_04
dataset_name
:
arithmetic_4ds
task_alias
:
4ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_04_yaml
task
:
arithmetic_5da_alt_04
dataset_name
:
arithmetic_5da
task_alias
:
5da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_04/arithmetic_5ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_04_yaml
task
:
arithmetic_5ds_alt_04
dataset_name
:
arithmetic_5ds
task_alias
:
5ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/_template_05_yaml
0 → 100644
View file @
c25f6a31
include: ../_template_yaml
group: arithmetic_alt_05
group_alias: arithmetic (Style 05)
dataset_path: EleutherAI/arithmetic
output_type: loglikelihood
validation_split: validation
test_split: null
doc_to_text: !function ../utils.style_05
doc_to_target: "{{completion}}"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
- metric: brier_score
higher_is_better: false
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_1dc.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_1dc_alt_05
dataset_name
:
arithmetic_1dc
task_alias
:
1dc
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_2da_alt_05
dataset_name
:
arithmetic_2da
task_alias
:
2da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2dm.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_2dm_alt_05
dataset_name
:
arithmetic_2dm
task_alias
:
2dm
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_2ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_2ds_alt_05
dataset_name
:
arithmetic_2ds
task_alias
:
2ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_3da_alt_05
dataset_name
:
arithmetic_3da
task_alias
:
3da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_3ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_3ds_alt_05
dataset_name
:
arithmetic_3ds
task_alias
:
3ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_4da_alt_05
dataset_name
:
arithmetic_4da
task_alias
:
4da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_4ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_4ds_alt_05
dataset_name
:
arithmetic_4ds
task_alias
:
4ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5da.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_5da_alt_05
dataset_name
:
arithmetic_5da
task_alias
:
5da
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/style_05/arithmetic_5ds.yaml
0 → 100644
View file @
c25f6a31
include
:
_template_05_yaml
task
:
arithmetic_5ds_alt_05
dataset_name
:
arithmetic_5ds
task_alias
:
5ds
\ No newline at end of file
lm_eval/tasks/arithmetic/alternative_worlds/utils.py
0 → 100644
View file @
c25f6a31
import
re
# Original Prompt
# Question: What is (9 + 8) * 2? Answer:
def
style_01
(
docs
):
# What is (9 + 8) * 2?
return
docs
[
"context"
].
replace
(
"Question: "
,
""
).
replace
(
" Answer:"
,
""
)
def
style_02
(
docs
):
# Q: What is (9 + 8) * 2? A:
return
docs
[
"context"
].
replace
(
"Question: "
,
"Q: "
).
replace
(
" Answer:"
,
" A:"
)
def
style_03
(
docs
):
# Solve (9 + 8) * 2.
return
docs
[
"context"
].
replace
(
"Question: What is"
,
"Solve"
).
replace
(
" Answer:"
,
"."
)
def
style_04
(
docs
):
# (9 + 8) * 2 =
return
docs
[
"context"
].
replace
(
"Question: What is "
,
""
).
replace
(
" Answer:"
,
" ="
)
def
style_05
(
docs
):
# What is (9 + 8) * 2? Answer:
return
docs
[
"context"
].
replace
(
"Question: "
,
""
)
\ No newline at end of file
Prev
1
2
3
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment