Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
835cc40e
Commit
835cc40e
authored
Dec 06, 2023
by
lintangsutawika
Browse files
merged latest and added altworld files
parents
8da401e0
c9bbec6e
Changes
430
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
23 additions
and
18 deletions
+23
-18
lm_eval/tasks/bigbench/multiple_choice_template_yaml
lm_eval/tasks/bigbench/multiple_choice_template_yaml
+2
-0
lm_eval/tasks/blimp/_template_yaml
lm_eval/tasks/blimp/_template_yaml
+3
-0
lm_eval/tasks/blimp/adjunct_island.yaml
lm_eval/tasks/blimp/adjunct_island.yaml
+1
-1
lm_eval/tasks/blimp/anaphor_gender_agreement.yaml
lm_eval/tasks/blimp/anaphor_gender_agreement.yaml
+1
-1
lm_eval/tasks/blimp/anaphor_number_agreement.yaml
lm_eval/tasks/blimp/anaphor_number_agreement.yaml
+1
-1
lm_eval/tasks/blimp/animate_subject_passive.yaml
lm_eval/tasks/blimp/animate_subject_passive.yaml
+1
-1
lm_eval/tasks/blimp/animate_subject_trans.yaml
lm_eval/tasks/blimp/animate_subject_trans.yaml
+1
-1
lm_eval/tasks/blimp/causative.yaml
lm_eval/tasks/blimp/causative.yaml
+1
-1
lm_eval/tasks/blimp/complex_NP_island.yaml
lm_eval/tasks/blimp/complex_NP_island.yaml
+1
-1
lm_eval/tasks/blimp/coordinate_structure_constraint_complex_left_branch.yaml
.../coordinate_structure_constraint_complex_left_branch.yaml
+1
-1
lm_eval/tasks/blimp/coordinate_structure_constraint_object_extraction.yaml
...mp/coordinate_structure_constraint_object_extraction.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_1.yaml
lm_eval/tasks/blimp/determiner_noun_agreement_1.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_2.yaml
lm_eval/tasks/blimp/determiner_noun_agreement_2.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_irregular_1.yaml
...al/tasks/blimp/determiner_noun_agreement_irregular_1.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_irregular_2.yaml
...al/tasks/blimp/determiner_noun_agreement_irregular_2.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_with_adj_2.yaml
...val/tasks/blimp/determiner_noun_agreement_with_adj_2.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_with_adj_irregular_1.yaml
...blimp/determiner_noun_agreement_with_adj_irregular_1.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_with_adj_irregular_2.yaml
...blimp/determiner_noun_agreement_with_adj_irregular_2.yaml
+1
-1
lm_eval/tasks/blimp/determiner_noun_agreement_with_adjective_1.yaml
...sks/blimp/determiner_noun_agreement_with_adjective_1.yaml
+1
-1
lm_eval/tasks/blimp/distractor_agreement_relational_noun.yaml
...val/tasks/blimp/distractor_agreement_relational_noun.yaml
+1
-1
No files found.
lm_eval/tasks/bigbench/multiple_choice_template_yaml
View file @
835cc40e
...
...
@@ -11,3 +11,5 @@ doc_to_choice: "{{multiple_choice_targets}}"
metric_list:
- metric: acc
# TODO: brier score and other metrics
metadata:
- version: 0.0
lm_eval/tasks/blimp/template_yaml
→
lm_eval/tasks/blimp/
_
template_yaml
View file @
835cc40e
...
...
@@ -5,7 +5,10 @@ validation_split: train
doc_to_text: ""
doc_to_target: 0
doc_to_choice: "{{[sentence_good, sentence_bad]}}"
num_fewshot: 0
should_decontaminate: true
doc_to_decontamination_query: "{{sentence_good}} {{sentence_bad}}"
metric_list:
- metric: acc
metadata:
- version: 1.0
lm_eval/tasks/blimp/adjunct_island.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
adjunct_island
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_adjunct_island
lm_eval/tasks/blimp/anaphor_gender_agreement.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
anaphor_gender_agreement
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_anaphor_gender_agreement
lm_eval/tasks/blimp/anaphor_number_agreement.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
anaphor_number_agreement
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_anaphor_number_agreement
lm_eval/tasks/blimp/animate_subject_passive.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
animate_subject_passive
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_animate_subject_passive
lm_eval/tasks/blimp/animate_subject_trans.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
animate_subject_trans
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_animate_subject_trans
lm_eval/tasks/blimp/causative.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
causative
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_causative
lm_eval/tasks/blimp/complex_NP_island.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
complex_NP_island
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_complex_NP_island
lm_eval/tasks/blimp/coordinate_structure_constraint_complex_left_branch.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
coordinate_structure_constraint_complex_left_branch
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_coordinate_structure_constraint_complex_left_branch
lm_eval/tasks/blimp/coordinate_structure_constraint_object_extraction.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
coordinate_structure_constraint_object_extraction
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_coordinate_structure_constraint_object_extraction
lm_eval/tasks/blimp/determiner_noun_agreement_1.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_1
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_1
lm_eval/tasks/blimp/determiner_noun_agreement_2.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_2
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_2
lm_eval/tasks/blimp/determiner_noun_agreement_irregular_1.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_irregular_1
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_irregular_1
lm_eval/tasks/blimp/determiner_noun_agreement_irregular_2.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_irregular_2
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_irregular_2
lm_eval/tasks/blimp/determiner_noun_agreement_with_adj_2.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_with_adj_2
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_with_adj_2
lm_eval/tasks/blimp/determiner_noun_agreement_with_adj_irregular_1.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_with_adj_irregular_1
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_with_adj_irregular_1
lm_eval/tasks/blimp/determiner_noun_agreement_with_adj_irregular_2.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_with_adj_irregular_2
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_with_adj_irregular_2
lm_eval/tasks/blimp/determiner_noun_agreement_with_adjective_1.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
determiner_noun_agreement_with_adjective_1
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_determiner_noun_agreement_with_adjective_1
lm_eval/tasks/blimp/distractor_agreement_relational_noun.yaml
View file @
835cc40e
# Generated by utils.py
dataset_name
:
distractor_agreement_relational_noun
include
:
template_yaml
include
:
_
template_yaml
task
:
blimp_distractor_agreement_relational_noun
Prev
1
…
6
7
8
9
10
11
12
13
14
…
22
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment