Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
5f48dfc2
Commit
5f48dfc2
authored
Dec 21, 2021
by
Igor Ostrovsky
Browse files
Add BLiMP
parent
df5d7cf0
Changes
138
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
0 deletions
+20
-0
tests/tests/testdata/blimp_coordinate_structure_constraint_object_extraction-v0-loglikelihood
...e_structure_constraint_object_extraction-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_coordinate_structure_constraint_object_extraction-v0-res.json
...dinate_structure_constraint_object_extraction-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_1-v0-loglikelihood
...stdata/blimp_determiner_noun_agreement_1-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_1-v0-res.json
...ts/testdata/blimp_determiner_noun_agreement_1-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_2-v0-loglikelihood
...stdata/blimp_determiner_noun_agreement_2-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_2-v0-res.json
...ts/testdata/blimp_determiner_noun_agreement_2-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_1-v0-loglikelihood
...mp_determiner_noun_agreement_irregular_1-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_1-v0-res.json
...a/blimp_determiner_noun_agreement_irregular_1-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_2-v0-loglikelihood
...mp_determiner_noun_agreement_irregular_2-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_2-v0-res.json
...a/blimp_determiner_noun_agreement_irregular_2-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_2-v0-loglikelihood
...imp_determiner_noun_agreement_with_adj_2-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_2-v0-res.json
...ta/blimp_determiner_noun_agreement_with_adj_2-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_1-v0-loglikelihood
...iner_noun_agreement_with_adj_irregular_1-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_1-v0-res.json
...eterminer_noun_agreement_with_adj_irregular_1-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_2-v0-loglikelihood
...iner_noun_agreement_with_adj_irregular_2-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_2-v0-res.json
...eterminer_noun_agreement_with_adj_irregular_2-v0-res.json
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adjective_1-v0-loglikelihood
...terminer_noun_agreement_with_adjective_1-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_determiner_noun_agreement_with_adjective_1-v0-res.json
...mp_determiner_noun_agreement_with_adjective_1-v0-res.json
+1
-0
tests/tests/testdata/blimp_distractor_agreement_relational_noun-v0-loglikelihood
...imp_distractor_agreement_relational_noun-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_distractor_agreement_relational_noun-v0-res.json
...ta/blimp_distractor_agreement_relational_noun-v0-res.json
+1
-0
No files found.
tests/tests/testdata/blimp_coordinate_structure_constraint_object_extraction-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
23ddafdff7b1ebe331b146e23b2c21aa109fe57aa1ce8ca201a0d239fcbdd166
\ No newline at end of file
tests/tests/testdata/blimp_coordinate_structure_constraint_object_extraction-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_coordinate_structure_constraint_object_extraction"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_coordinate_structure_constraint_object_extraction"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_1-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
2df8cc7f17089f7e8c7d974dcb324c809d30ef059a5be22aed6b69f44230809f
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_1-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_1"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_1"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_2-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
123e2acd00fbba60aba1fbae607c79a062e512c9e79c7d8dfafff63e30111d76
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_2-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_2"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_2"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_1-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
7fab9f02e71a224ae7931aa77f8a9a61d887a7480756adc965d4746e97fb04a5
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_1-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_irregular_1"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_irregular_1"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_2-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
ddb24ddfaebe076b3aa7107937d71bf5f4503a78283bc889e39200368603681e
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_irregular_2-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_irregular_2"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_irregular_2"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_2-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
95acb74fac7d57ae2c9d208361a5f8ad36b0b19a055f02e648ed8e99505f4b43
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_2-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_with_adj_2"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_with_adj_2"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_1-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
ad61c619aa79433d02f1aeacde2ab87291fd5d5c370032c24d41c4f0065ed1f9
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_1-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_with_adj_irregular_1"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_with_adj_irregular_1"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_2-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
ccc64b4d5e80c081d5161aae5828212ba49d277ca8c5a4281f181744727a6a99
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adj_irregular_2-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_with_adj_irregular_2"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_with_adj_irregular_2"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adjective_1-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
007c47e5fbf88119c5180feef75e1345d448e56adcd4c7ab2d52fb8d67350d34
\ No newline at end of file
tests/tests/testdata/blimp_determiner_noun_agreement_with_adjective_1-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_determiner_noun_agreement_with_adjective_1"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_determiner_noun_agreement_with_adjective_1"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_distractor_agreement_relational_noun-v0-loglikelihood
0 → 100644
View file @
5f48dfc2
8aab641bd5933f84f46a14f5c1208a3c855cace7e67b44abcd5aff8fec96717d
\ No newline at end of file
tests/tests/testdata/blimp_distractor_agreement_relational_noun-v0-res.json
0 → 100644
View file @
5f48dfc2
{
"results"
:
{
"blimp_distractor_agreement_relational_noun"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_distractor_agreement_relational_noun"
:
0
}}
\ No newline at end of file
Prev
1
2
3
4
5
6
7
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment