Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
a5a9017b
Commit
a5a9017b
authored
Aug 10, 2023
by
lintangsutawika
Browse files
Merge branch 'big-refactor' of
https://github.com/EleutherAI/lm-evaluation-harness
into xstorycloze
parents
a04a600a
7634a6ec
Changes
71
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
87 additions
and
0 deletions
+87
-0
lm_eval/tasks/blimp/npi_present_2.yaml
lm_eval/tasks/blimp/npi_present_2.yaml
+4
-0
lm_eval/tasks/blimp/only_npi_licensor_present.yaml
lm_eval/tasks/blimp/only_npi_licensor_present.yaml
+4
-0
lm_eval/tasks/blimp/only_npi_scope.yaml
lm_eval/tasks/blimp/only_npi_scope.yaml
+4
-0
lm_eval/tasks/blimp/passive_1.yaml
lm_eval/tasks/blimp/passive_1.yaml
+4
-0
lm_eval/tasks/blimp/passive_2.yaml
lm_eval/tasks/blimp/passive_2.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_c_command.yaml
lm_eval/tasks/blimp/principle_A_c_command.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_case_1.yaml
lm_eval/tasks/blimp/principle_A_case_1.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_case_2.yaml
lm_eval/tasks/blimp/principle_A_case_2.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_domain_1.yaml
lm_eval/tasks/blimp/principle_A_domain_1.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_domain_2.yaml
lm_eval/tasks/blimp/principle_A_domain_2.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_domain_3.yaml
lm_eval/tasks/blimp/principle_A_domain_3.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_reconstruction.yaml
lm_eval/tasks/blimp/principle_A_reconstruction.yaml
+4
-0
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_1.yaml
.../tasks/blimp/regular_plural_subject_verb_agreement_1.yaml
+4
-0
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_2.yaml
.../tasks/blimp/regular_plural_subject_verb_agreement_2.yaml
+4
-0
lm_eval/tasks/blimp/sentential_negation_npi_licensor_present.yaml
...tasks/blimp/sentential_negation_npi_licensor_present.yaml
+4
-0
lm_eval/tasks/blimp/sentential_negation_npi_scope.yaml
lm_eval/tasks/blimp/sentential_negation_npi_scope.yaml
+4
-0
lm_eval/tasks/blimp/sentential_subject_island.yaml
lm_eval/tasks/blimp/sentential_subject_island.yaml
+4
-0
lm_eval/tasks/blimp/superlative_quantifiers_1.yaml
lm_eval/tasks/blimp/superlative_quantifiers_1.yaml
+4
-0
lm_eval/tasks/blimp/superlative_quantifiers_2.yaml
lm_eval/tasks/blimp/superlative_quantifiers_2.yaml
+4
-0
lm_eval/tasks/blimp/template_yaml
lm_eval/tasks/blimp/template_yaml
+11
-0
No files found.
lm_eval/tasks/blimp/npi_present_2.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
npi_present_2
include
:
template_yaml
task
:
blimp_npi_present_2
lm_eval/tasks/blimp/only_npi_licensor_present.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
only_npi_licensor_present
include
:
template_yaml
task
:
blimp_only_npi_licensor_present
lm_eval/tasks/blimp/only_npi_scope.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
only_npi_scope
include
:
template_yaml
task
:
blimp_only_npi_scope
lm_eval/tasks/blimp/passive_1.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
passive_1
include
:
template_yaml
task
:
blimp_passive_1
lm_eval/tasks/blimp/passive_2.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
passive_2
include
:
template_yaml
task
:
blimp_passive_2
lm_eval/tasks/blimp/principle_A_c_command.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_c_command
include
:
template_yaml
task
:
blimp_principle_A_c_command
lm_eval/tasks/blimp/principle_A_case_1.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_case_1
include
:
template_yaml
task
:
blimp_principle_A_case_1
lm_eval/tasks/blimp/principle_A_case_2.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_case_2
include
:
template_yaml
task
:
blimp_principle_A_case_2
lm_eval/tasks/blimp/principle_A_domain_1.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_domain_1
include
:
template_yaml
task
:
blimp_principle_A_domain_1
lm_eval/tasks/blimp/principle_A_domain_2.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_domain_2
include
:
template_yaml
task
:
blimp_principle_A_domain_2
lm_eval/tasks/blimp/principle_A_domain_3.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_domain_3
include
:
template_yaml
task
:
blimp_principle_A_domain_3
lm_eval/tasks/blimp/principle_A_reconstruction.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
principle_A_reconstruction
include
:
template_yaml
task
:
blimp_principle_A_reconstruction
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_1.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
regular_plural_subject_verb_agreement_1
include
:
template_yaml
task
:
blimp_regular_plural_subject_verb_agreement_1
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_2.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
regular_plural_subject_verb_agreement_2
include
:
template_yaml
task
:
blimp_regular_plural_subject_verb_agreement_2
lm_eval/tasks/blimp/sentential_negation_npi_licensor_present.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
sentential_negation_npi_licensor_present
include
:
template_yaml
task
:
blimp_sentential_negation_npi_licensor_present
lm_eval/tasks/blimp/sentential_negation_npi_scope.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
sentential_negation_npi_scope
include
:
template_yaml
task
:
blimp_sentential_negation_npi_scope
lm_eval/tasks/blimp/sentential_subject_island.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
sentential_subject_island
include
:
template_yaml
task
:
blimp_sentential_subject_island
lm_eval/tasks/blimp/superlative_quantifiers_1.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
superlative_quantifiers_1
include
:
template_yaml
task
:
blimp_superlative_quantifiers_1
lm_eval/tasks/blimp/superlative_quantifiers_2.yaml
0 → 100644
View file @
a5a9017b
# Generated by utils.py
dataset_name
:
superlative_quantifiers_2
include
:
template_yaml
task
:
blimp_superlative_quantifiers_2
lm_eval/tasks/blimp/template_yaml
0 → 100644
View file @
a5a9017b
group: blimp
dataset_path: blimp
output_type: multiple_choice
validation_split: validation
doc_to_text: ""
doc_to_target: 0
doc_to_choice: "{{[sentence_good, sentence_bad]}}"
should_decontaminate: true
doc_to_decontamination_query: "{{sentence_good}} {{sentence_bad}}"
metric_list:
- metric: acc
Prev
1
2
3
4
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment