Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
95360bc2
Commit
95360bc2
authored
Aug 15, 2023
by
lintangsutawika
Browse files
Merge branch 'big-refactor' of
https://github.com/EleutherAI/lm-evaluation-harness
into add-readme
parents
545fb8fc
30aa9c33
Changes
137
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
87 additions
and
0 deletions
+87
-0
lm_eval/tasks/blimp/principle_A_case_2.yaml
lm_eval/tasks/blimp/principle_A_case_2.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_domain_1.yaml
lm_eval/tasks/blimp/principle_A_domain_1.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_domain_2.yaml
lm_eval/tasks/blimp/principle_A_domain_2.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_domain_3.yaml
lm_eval/tasks/blimp/principle_A_domain_3.yaml
+4
-0
lm_eval/tasks/blimp/principle_A_reconstruction.yaml
lm_eval/tasks/blimp/principle_A_reconstruction.yaml
+4
-0
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_1.yaml
.../tasks/blimp/regular_plural_subject_verb_agreement_1.yaml
+4
-0
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_2.yaml
.../tasks/blimp/regular_plural_subject_verb_agreement_2.yaml
+4
-0
lm_eval/tasks/blimp/sentential_negation_npi_licensor_present.yaml
...tasks/blimp/sentential_negation_npi_licensor_present.yaml
+4
-0
lm_eval/tasks/blimp/sentential_negation_npi_scope.yaml
lm_eval/tasks/blimp/sentential_negation_npi_scope.yaml
+4
-0
lm_eval/tasks/blimp/sentential_subject_island.yaml
lm_eval/tasks/blimp/sentential_subject_island.yaml
+4
-0
lm_eval/tasks/blimp/superlative_quantifiers_1.yaml
lm_eval/tasks/blimp/superlative_quantifiers_1.yaml
+4
-0
lm_eval/tasks/blimp/superlative_quantifiers_2.yaml
lm_eval/tasks/blimp/superlative_quantifiers_2.yaml
+4
-0
lm_eval/tasks/blimp/template_yaml
lm_eval/tasks/blimp/template_yaml
+11
-0
lm_eval/tasks/blimp/tough_vs_raising_1.yaml
lm_eval/tasks/blimp/tough_vs_raising_1.yaml
+4
-0
lm_eval/tasks/blimp/tough_vs_raising_2.yaml
lm_eval/tasks/blimp/tough_vs_raising_2.yaml
+4
-0
lm_eval/tasks/blimp/transitive.yaml
lm_eval/tasks/blimp/transitive.yaml
+4
-0
lm_eval/tasks/blimp/wh_island.yaml
lm_eval/tasks/blimp/wh_island.yaml
+4
-0
lm_eval/tasks/blimp/wh_questions_object_gap.yaml
lm_eval/tasks/blimp/wh_questions_object_gap.yaml
+4
-0
lm_eval/tasks/blimp/wh_questions_subject_gap.yaml
lm_eval/tasks/blimp/wh_questions_subject_gap.yaml
+4
-0
lm_eval/tasks/blimp/wh_questions_subject_gap_long_distance.yaml
...l/tasks/blimp/wh_questions_subject_gap_long_distance.yaml
+4
-0
No files found.
lm_eval/tasks/blimp/principle_A_case_2.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
principle_A_case_2
include
:
template_yaml
task
:
blimp_principle_A_case_2
lm_eval/tasks/blimp/principle_A_domain_1.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
principle_A_domain_1
include
:
template_yaml
task
:
blimp_principle_A_domain_1
lm_eval/tasks/blimp/principle_A_domain_2.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
principle_A_domain_2
include
:
template_yaml
task
:
blimp_principle_A_domain_2
lm_eval/tasks/blimp/principle_A_domain_3.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
principle_A_domain_3
include
:
template_yaml
task
:
blimp_principle_A_domain_3
lm_eval/tasks/blimp/principle_A_reconstruction.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
principle_A_reconstruction
include
:
template_yaml
task
:
blimp_principle_A_reconstruction
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_1.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
regular_plural_subject_verb_agreement_1
include
:
template_yaml
task
:
blimp_regular_plural_subject_verb_agreement_1
lm_eval/tasks/blimp/regular_plural_subject_verb_agreement_2.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
regular_plural_subject_verb_agreement_2
include
:
template_yaml
task
:
blimp_regular_plural_subject_verb_agreement_2
lm_eval/tasks/blimp/sentential_negation_npi_licensor_present.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
sentential_negation_npi_licensor_present
include
:
template_yaml
task
:
blimp_sentential_negation_npi_licensor_present
lm_eval/tasks/blimp/sentential_negation_npi_scope.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
sentential_negation_npi_scope
include
:
template_yaml
task
:
blimp_sentential_negation_npi_scope
lm_eval/tasks/blimp/sentential_subject_island.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
sentential_subject_island
include
:
template_yaml
task
:
blimp_sentential_subject_island
lm_eval/tasks/blimp/superlative_quantifiers_1.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
superlative_quantifiers_1
include
:
template_yaml
task
:
blimp_superlative_quantifiers_1
lm_eval/tasks/blimp/superlative_quantifiers_2.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
superlative_quantifiers_2
include
:
template_yaml
task
:
blimp_superlative_quantifiers_2
lm_eval/tasks/blimp/template_yaml
0 → 100644
View file @
95360bc2
group: blimp
dataset_path: blimp
output_type: multiple_choice
validation_split: train
doc_to_text: ""
doc_to_target: 0
doc_to_choice: "{{[sentence_good, sentence_bad]}}"
should_decontaminate: true
doc_to_decontamination_query: "{{sentence_good}} {{sentence_bad}}"
metric_list:
- metric: acc
lm_eval/tasks/blimp/tough_vs_raising_1.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
tough_vs_raising_1
include
:
template_yaml
task
:
blimp_tough_vs_raising_1
lm_eval/tasks/blimp/tough_vs_raising_2.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
tough_vs_raising_2
include
:
template_yaml
task
:
blimp_tough_vs_raising_2
lm_eval/tasks/blimp/transitive.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
transitive
include
:
template_yaml
task
:
blimp_transitive
lm_eval/tasks/blimp/wh_island.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
wh_island
include
:
template_yaml
task
:
blimp_wh_island
lm_eval/tasks/blimp/wh_questions_object_gap.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
wh_questions_object_gap
include
:
template_yaml
task
:
blimp_wh_questions_object_gap
lm_eval/tasks/blimp/wh_questions_subject_gap.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
wh_questions_subject_gap
include
:
template_yaml
task
:
blimp_wh_questions_subject_gap
lm_eval/tasks/blimp/wh_questions_subject_gap_long_distance.yaml
0 → 100644
View file @
95360bc2
# Generated by utils.py
dataset_name
:
wh_questions_subject_gap_long_distance
include
:
template_yaml
task
:
blimp_wh_questions_subject_gap_long_distance
Prev
1
2
3
4
5
6
7
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment