Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
6f4f9e1c
Commit
6f4f9e1c
authored
Dec 13, 2023
by
lintangsutawika
Browse files
resolved merge conflict
parents
0d5748b7
aed90773
Changes
145
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
lm_eval/tasks/arc/alternative_worlds/output_variation/style_04/b.yaml
...s/arc/alternative_worlds/output_variation/style_04/b.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_04/c.yaml
...s/arc/alternative_worlds/output_variation/style_04/c.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_05/a.yaml
...s/arc/alternative_worlds/output_variation/style_05/a.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_05/b.yaml
...s/arc/alternative_worlds/output_variation/style_05/b.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_05/c.yaml
...s/arc/alternative_worlds/output_variation/style_05/c.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_06/a.yaml
...s/arc/alternative_worlds/output_variation/style_06/a.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_06/b.yaml
...s/arc/alternative_worlds/output_variation/style_06/b.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_06/c.yaml
...s/arc/alternative_worlds/output_variation/style_06/c.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_07/a.yaml
...s/arc/alternative_worlds/output_variation/style_07/a.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_07/b.yaml
...s/arc/alternative_worlds/output_variation/style_07/b.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_07/c.yaml
...s/arc/alternative_worlds/output_variation/style_07/c.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_08/a.yaml
...s/arc/alternative_worlds/output_variation/style_08/a.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_08/b.yaml
...s/arc/alternative_worlds/output_variation/style_08/b.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/output_variation/style_08/c.yaml
...s/arc/alternative_worlds/output_variation/style_08/c.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/prompt_variation/style_02.yaml
...sks/arc/alternative_worlds/prompt_variation/style_02.yaml
+1
-1
lm_eval/tasks/arc/alternative_worlds/prompt_variation/style_03.yaml
...sks/arc/alternative_worlds/prompt_variation/style_03.yaml
+1
-1
lm_eval/tasks/bbh/alternative_worlds/README.md
lm_eval/tasks/bbh/alternative_worlds/README.md
+1
-1
lm_eval/tasks/bbh/alternative_worlds/prompt_variation/style_01/zeroshot/_zeroshot_template_yaml
...rompt_variation/style_01/zeroshot/_zeroshot_template_yaml
+1
-1
lm_eval/tasks/bbh/alternative_worlds/prompt_variation/style_01/zeroshot/formal_fallacies.yaml
.../prompt_variation/style_01/zeroshot/formal_fallacies.yaml
+1
-1
lm_eval/tasks/bbh/alternative_worlds/prompt_variation/style_01/zeroshot/movie_recommendation.yaml
...mpt_variation/style_01/zeroshot/movie_recommendation.yaml
+1
-1
No files found.
lm_eval/tasks/arc/alternative_worlds/output_variation/style_04/b.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_04
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_04
task
:
arc_easy_alt_ov_04b
task
:
arc_easy_alt_ov_04b
doc_to_text
:
!function
../styles.template_04
doc_to_text
:
!function
../styles.template_04
doc_to_choice
:
!function
../styles.choice_04b
doc_to_choice
:
!function
../styles.choice_04b
doc_to_decontamination_query
:
!function
../styles.template_04
doc_to_decontamination_query
:
!function
../styles.template_04
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_04/c.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_04
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_04
task
:
arc_easy_alt_ov_04c
task
:
arc_easy_alt_ov_04c
doc_to_text
:
!function
../styles.template_04
doc_to_text
:
!function
../styles.template_04
doc_to_choice
:
!function
../styles.choice_04c
doc_to_choice
:
!function
../styles.choice_04c
doc_to_decontamination_query
:
!function
../styles.template_04
doc_to_decontamination_query
:
!function
../styles.template_04
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_05/a.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_05
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_05
task
:
arc_easy_alt_ov_05a
task
:
arc_easy_alt_ov_05a
doc_to_text
:
!function
../styles.template_05
doc_to_text
:
!function
../styles.template_05
doc_to_choice
:
!function
../styles.choice_05a
doc_to_choice
:
!function
../styles.choice_05a
doc_to_decontamination_query
:
!function
../styles.template_05
doc_to_decontamination_query
:
!function
../styles.template_05
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_05/b.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_05
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_05
task
:
arc_easy_alt_ov_05b
task
:
arc_easy_alt_ov_05b
doc_to_text
:
!function
../styles.template_05
doc_to_text
:
!function
../styles.template_05
doc_to_choice
:
!function
../styles.choice_05b
doc_to_choice
:
!function
../styles.choice_05b
doc_to_decontamination_query
:
!function
../styles.template_05
doc_to_decontamination_query
:
!function
../styles.template_05
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_05/c.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_05
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_05
task
:
arc_easy_alt_ov_05c
task
:
arc_easy_alt_ov_05c
doc_to_text
:
!function
../styles.template_05
doc_to_text
:
!function
../styles.template_05
doc_to_choice
:
!function
../styles.choice_05c
doc_to_choice
:
!function
../styles.choice_05c
doc_to_decontamination_query
:
!function
../styles.template_05
doc_to_decontamination_query
:
!function
../styles.template_05
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_06/a.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_06
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_06
task
:
arc_easy_alt_ov_06a
task
:
arc_easy_alt_ov_06a
doc_to_text
:
!function
../styles.template_06
doc_to_text
:
!function
../styles.template_06
doc_to_choice
:
!function
../styles.choice_06a
doc_to_choice
:
!function
../styles.choice_06a
doc_to_decontamination_query
:
!function
../styles.template_06
doc_to_decontamination_query
:
!function
../styles.template_06
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_06/b.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_06
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_06
task
:
arc_easy_alt_ov_06b
task
:
arc_easy_alt_ov_06b
doc_to_text
:
!function
../styles.template_06
doc_to_text
:
!function
../styles.template_06
doc_to_choice
:
!function
../styles.choice_06b
doc_to_choice
:
!function
../styles.choice_06b
doc_to_decontamination_query
:
!function
../styles.template_06
doc_to_decontamination_query
:
!function
../styles.template_06
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_06/c.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_06
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_06
task
:
arc_easy_alt_ov_06c
task
:
arc_easy_alt_ov_06c
doc_to_text
:
!function
../styles.template_06
doc_to_text
:
!function
../styles.template_06
doc_to_choice
:
!function
../styles.choice_06c
doc_to_choice
:
!function
../styles.choice_06c
doc_to_decontamination_query
:
!function
../styles.template_06
doc_to_decontamination_query
:
!function
../styles.template_06
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_07/a.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_07
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_07
task
:
arc_easy_alt_ov_07a
task
:
arc_easy_alt_ov_07a
doc_to_text
:
!function
../styles.template_07
doc_to_text
:
!function
../styles.template_07
doc_to_choice
:
!function
../styles.choice_07a
doc_to_choice
:
!function
../styles.choice_07a
doc_to_decontamination_query
:
!function
../styles.template_07
doc_to_decontamination_query
:
!function
../styles.template_07
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_07/b.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_07
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_07
task
:
arc_easy_alt_ov_07b
task
:
arc_easy_alt_ov_07b
doc_to_text
:
!function
../styles.template_07
doc_to_text
:
!function
../styles.template_07
doc_to_choice
:
!function
../styles.choice_07b
doc_to_choice
:
!function
../styles.choice_07b
doc_to_decontamination_query
:
!function
../styles.template_07
doc_to_decontamination_query
:
!function
../styles.template_07
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_07/c.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_07
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_07
task
:
arc_easy_alt_ov_07c
task
:
arc_easy_alt_ov_07c
doc_to_text
:
!function
../styles.template_07
doc_to_text
:
!function
../styles.template_07
doc_to_choice
:
!function
../styles.choice_07c
doc_to_choice
:
!function
../styles.choice_07c
doc_to_decontamination_query
:
!function
../styles.template_07
doc_to_decontamination_query
:
!function
../styles.template_07
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_08/a.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_08
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_08
task
:
arc_easy_alt_ov_08a
task
:
arc_easy_alt_ov_08a
doc_to_text
:
!function
../styles.template_08
doc_to_text
:
!function
../styles.template_08
doc_to_choice
:
!function
../styles.choice_08a
doc_to_choice
:
!function
../styles.choice_08a
doc_to_decontamination_query
:
!function
../styles.template_08
doc_to_decontamination_query
:
!function
../styles.template_08
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_08/b.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_08
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_08
task
:
arc_easy_alt_ov_08b
task
:
arc_easy_alt_ov_08b
doc_to_text
:
!function
../styles.template_08
doc_to_text
:
!function
../styles.template_08
doc_to_choice
:
!function
../styles.choice_08b
doc_to_choice
:
!function
../styles.choice_08b
doc_to_decontamination_query
:
!function
../styles.template_08
doc_to_decontamination_query
:
!function
../styles.template_08
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/output_variation/style_08/c.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_08
...
@@ -3,4 +3,4 @@ group: arc_easy_alt_ov_08
task
:
arc_easy_alt_ov_08c
task
:
arc_easy_alt_ov_08c
doc_to_text
:
!function
../styles.template_08
doc_to_text
:
!function
../styles.template_08
doc_to_choice
:
!function
../styles.choice_08c
doc_to_choice
:
!function
../styles.choice_08c
doc_to_decontamination_query
:
!function
../styles.template_08
doc_to_decontamination_query
:
!function
../styles.template_08
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/prompt_variation/style_02.yaml
View file @
6f4f9e1c
...
@@ -2,4 +2,4 @@ include: _arc_easy_alt_yaml
...
@@ -2,4 +2,4 @@ include: _arc_easy_alt_yaml
group
:
arc_easy_alt_pv
group
:
arc_easy_alt_pv
task
:
arc_easy_alt_pv_02
task
:
arc_easy_alt_pv_02
doc_to_text
:
"
Q:
{{question}}
\n
A:"
doc_to_text
:
"
Q:
{{question}}
\n
A:"
doc_to_decontamination_query
:
"
Q:
{{question}}
\n
A:"
doc_to_decontamination_query
:
"
Q:
{{question}}
\n
A:"
\ No newline at end of file
lm_eval/tasks/arc/alternative_worlds/prompt_variation/style_03.yaml
View file @
6f4f9e1c
...
@@ -2,4 +2,4 @@ include: _arc_easy_alt_yaml
...
@@ -2,4 +2,4 @@ include: _arc_easy_alt_yaml
group
:
arc_easy_alt_pv
group
:
arc_easy_alt_pv
task
:
arc_easy_alt_pv_03
task
:
arc_easy_alt_pv_03
doc_to_text
:
"
Question:
{{question}}
\n
Answer:"
doc_to_text
:
"
Question:
{{question}}
\n
Answer:"
doc_to_decontamination_query
:
"
Question:
{{question}}
\n
Answer:"
doc_to_decontamination_query
:
"
Question:
{{question}}
\n
Answer:"
\ No newline at end of file
lm_eval/tasks/bbh/alternative_worlds/README.md
View file @
6f4f9e1c
...
@@ -31,4 +31,4 @@
...
@@ -31,4 +31,4 @@
Notes:
Notes:
-
`web_of_lies`
already starts with
`Question: `
-
`web_of_lies`
already starts with
`Question: `
-
Tasks with options are
`Options: (A) ...`
(multiple choice) or
`Options: - ...`
(binary choice)
-
Tasks with options are
`Options: (A) ...`
(multiple choice) or
`Options: - ...`
(binary choice)
\ No newline at end of file
lm_eval/tasks/bbh/alternative_worlds/prompt_variation/style_01/zeroshot/_zeroshot_template_yaml
View file @
6f4f9e1c
...
@@ -9,4 +9,4 @@ num_fewshot: 0
...
@@ -9,4 +9,4 @@ num_fewshot: 0
metric_list:
metric_list:
- metric: acc
- metric: acc
- metric: acc_norm
- metric: acc_norm
- metric: brier_score
- metric: brier_score
\ No newline at end of file
lm_eval/tasks/bbh/alternative_worlds/prompt_variation/style_01/zeroshot/formal_fallacies.yaml
View file @
6f4f9e1c
...
@@ -3,4 +3,4 @@
...
@@ -3,4 +3,4 @@
"
include"
:
"
_zeroshot_template_yaml"
"
include"
:
"
_zeroshot_template_yaml"
"
task"
:
"
bbh_alt_pv_01_zeroshot_formal_fallacies"
"
task"
:
"
bbh_alt_pv_01_zeroshot_formal_fallacies"
"
doc_to_target"
:
target
"
doc_to_target"
:
target
"
doc_to_choice"
:
[
"
valid"
,
"
invalid"
]
"
doc_to_choice"
:
[
"
valid"
,
"
invalid"
]
\ No newline at end of file
lm_eval/tasks/bbh/alternative_worlds/prompt_variation/style_01/zeroshot/movie_recommendation.yaml
View file @
6f4f9e1c
...
@@ -2,4 +2,4 @@
...
@@ -2,4 +2,4 @@
"
description"
:
"
Recommend
movies
similar
to
the
given
list
of
movies.
\n\n
"
"
description"
:
"
Recommend
movies
similar
to
the
given
list
of
movies.
\n\n
"
"
include"
:
"
_zeroshot_template_yaml"
"
include"
:
"
_zeroshot_template_yaml"
"
task"
:
"
bbh_alt_pv_01_zeroshot_movie_recommendation"
"
task"
:
"
bbh_alt_pv_01_zeroshot_movie_recommendation"
"
process_docs"
:
!function
../../utils.fix_movie_recommendation
"
process_docs"
:
!function
../../utils.fix_movie_recommendation
\ No newline at end of file
Prev
1
2
3
4
5
6
…
8
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment