Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
8c997e53
Commit
8c997e53
authored
May 03, 2022
by
jon-tow
Browse files
Revert `tests/testdata` changes and address flake8 issues
parent
d95a4333
Changes
627
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/blimp_wh_vs_that_with_gap-v0-res.json
tests/testdata/blimp_wh_vs_that_with_gap-v0-res.json
+1
-1
tests/testdata/blimp_wh_vs_that_with_gap_long_distance-v0-loglikelihood
.../blimp_wh_vs_that_with_gap_long_distance-v0-loglikelihood
+1
-1
tests/testdata/blimp_wh_vs_that_with_gap_long_distance-v0-res.json
...tdata/blimp_wh_vs_that_with_gap_long_distance-v0-res.json
+1
-1
tests/testdata/boolq-v0-loglikelihood
tests/testdata/boolq-v0-loglikelihood
+1
-1
tests/testdata/boolq-v0-res.json
tests/testdata/boolq-v0-res.json
+1
-1
tests/testdata/boolq-v1-loglikelihood
tests/testdata/boolq-v1-loglikelihood
+1
-1
tests/testdata/boolq-v1-res.json
tests/testdata/boolq-v1-res.json
+1
-1
tests/testdata/cb-v0-loglikelihood
tests/testdata/cb-v0-loglikelihood
+1
-1
tests/testdata/cb-v0-res.json
tests/testdata/cb-v0-res.json
+1
-1
tests/testdata/cb-v1-loglikelihood
tests/testdata/cb-v1-loglikelihood
+1
-1
tests/testdata/cb-v1-res.json
tests/testdata/cb-v1-res.json
+1
-1
tests/testdata/cola-v0-loglikelihood
tests/testdata/cola-v0-loglikelihood
+1
-1
tests/testdata/cola-v0-res.json
tests/testdata/cola-v0-res.json
+1
-1
tests/testdata/copa-v0-loglikelihood
tests/testdata/copa-v0-loglikelihood
+1
-1
tests/testdata/copa-v0-res.json
tests/testdata/copa-v0-res.json
+1
-1
tests/testdata/coqa-v0-greedy_until
tests/testdata/coqa-v0-greedy_until
+1
-1
tests/testdata/coqa-v0-res.json
tests/testdata/coqa-v0-res.json
+1
-1
tests/testdata/coqa-v1-greedy_until
tests/testdata/coqa-v1-greedy_until
+1
-1
tests/testdata/coqa-v1-res.json
tests/testdata/coqa-v1-res.json
+1
-1
tests/testdata/cycle_letters-v0-greedy_until
tests/testdata/cycle_letters-v0-greedy_until
+1
-1
No files found.
tests/
tests/
testdata/blimp_wh_vs_that_with_gap-v0-res.json
→
tests/testdata/blimp_wh_vs_that_with_gap-v0-res.json
View file @
8c997e53
{
"results"
:
{
"blimp_wh_vs_that_with_gap"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_wh_vs_that_with_gap"
:
0
}}
{
"results"
:
{
"blimp_wh_vs_that_with_gap"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_wh_vs_that_with_gap"
:
0
}}
\ No newline at end of file
tests/
tests/
testdata/blimp_wh_vs_that_with_gap_long_distance-v0-loglikelihood
→
tests/testdata/blimp_wh_vs_that_with_gap_long_distance-v0-loglikelihood
View file @
8c997e53
eed67491bdf493a1dad8f1d9766bc7bd0e79946365b833c0f7eb81ac998e3dca
eed67491bdf493a1dad8f1d9766bc7bd0e79946365b833c0f7eb81ac998e3dca
\ No newline at end of file
tests/
tests/
testdata/blimp_wh_vs_that_with_gap_long_distance-v0-res.json
→
tests/testdata/blimp_wh_vs_that_with_gap_long_distance-v0-res.json
View file @
8c997e53
{
"results"
:
{
"blimp_wh_vs_that_with_gap_long_distance"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_wh_vs_that_with_gap_long_distance"
:
0
}}
{
"results"
:
{
"blimp_wh_vs_that_with_gap_long_distance"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_wh_vs_that_with_gap_long_distance"
:
0
}}
\ No newline at end of file
tests/testdata/boolq-v0-loglikelihood
View file @
8c997e53
de5aa6f77a2e0fd050b9c272f10c4d5d5581e4f75ffa60926f79e60ae1738960
de5aa6f77a2e0fd050b9c272f10c4d5d5581e4f75ffa60926f79e60ae1738960
\ No newline at end of file
tests/testdata/boolq-v0-res.json
View file @
8c997e53
{
"results"
:
{
"boolq"
:
{
"acc"
:
0.5048929663608562
,
"acc_stderr"
:
0.00874463623355505
}},
"versions"
:
{
"boolq"
:
0
}}
{
"results"
:
{
"boolq"
:
{
"acc"
:
0.5048929663608562
,
"acc_stderr"
:
0.00874463623355505
}},
"versions"
:
{
"boolq"
:
0
}}
\ No newline at end of file
tests/testdata/boolq-v1-loglikelihood
View file @
8c997e53
6577e0d88572772ef08e64f624c0e3df0953286ae1f118ccef15623b59ffeabf
6577e0d88572772ef08e64f624c0e3df0953286ae1f118ccef15623b59ffeabf
\ No newline at end of file
tests/testdata/boolq-v1-res.json
View file @
8c997e53
{
"results"
:
{
"boolq"
:
{
"acc"
:
0.5048929663608562
,
"acc_stderr"
:
0.00874463623355505
}},
"versions"
:
{
"boolq"
:
1
}}
{
"results"
:
{
"boolq"
:
{
"acc"
:
0.5048929663608562
,
"acc_stderr"
:
0.00874463623355505
}},
"versions"
:
{
"boolq"
:
1
}}
\ No newline at end of file
tests/testdata/cb-v0-loglikelihood
View file @
8c997e53
ec3b1bbb9561e39c43c6f77a23b4060b15c606141c5346e3d0791b3e92aaa5d0
ec3b1bbb9561e39c43c6f77a23b4060b15c606141c5346e3d0791b3e92aaa5d0
\ No newline at end of file
tests/testdata/cb-v0-res.json
View file @
8c997e53
{
"results"
:
{
"cb"
:
{
"acc"
:
0.3392857142857143
,
"acc_stderr"
:
0.06384226561930825
,
"f1"
:
0.2819143819143819
}},
"versions"
:
{
"cb"
:
0
}}
{
"results"
:
{
"cb"
:
{
"acc"
:
0.3392857142857143
,
"acc_stderr"
:
0.06384226561930825
,
"f1"
:
0.2819143819143819
}},
"versions"
:
{
"cb"
:
0
}}
\ No newline at end of file
tests/testdata/cb-v1-loglikelihood
View file @
8c997e53
77b11f4348eb8a7f57faf95c531fda01ab4bf0e729f91a82451ed8e71ec8e66d
77b11f4348eb8a7f57faf95c531fda01ab4bf0e729f91a82451ed8e71ec8e66d
\ No newline at end of file
tests/testdata/cb-v1-res.json
View file @
8c997e53
{
"results"
:
{
"cb"
:
{
"acc"
:
0.3392857142857143
,
"acc_stderr"
:
0.06384226561930825
,
"f1"
:
0.2819143819143819
}},
"versions"
:
{
"cb"
:
1
}}
{
"results"
:
{
"cb"
:
{
"acc"
:
0.3392857142857143
,
"acc_stderr"
:
0.06384226561930825
,
"f1"
:
0.2819143819143819
}},
"versions"
:
{
"cb"
:
1
}}
\ No newline at end of file
tests/testdata/cola-v0-loglikelihood
View file @
8c997e53
e8635578ed8ee70b707a666d35e468b9321db24470f80c92080651e2bfa01751
e8635578ed8ee70b707a666d35e468b9321db24470f80c92080651e2bfa01751
\ No newline at end of file
tests/testdata/cola-v0-res.json
View file @
8c997e53
{
"results"
:
{
"cola"
:
{
"mcc"
:
-0.04538802810223175
,
"mcc_stderr"
:
0.023100371589225246
}},
"versions"
:
{
"cola"
:
0
}}
{
"results"
:
{
"cola"
:
{
"mcc"
:
-0.04538802810223175
,
"mcc_stderr"
:
0.023100371589225246
}},
"versions"
:
{
"cola"
:
0
}}
\ No newline at end of file
tests/testdata/copa-v0-loglikelihood
View file @
8c997e53
66276b9045b5300cba4b81340db06f674f031fa0b8883714ad0d03be464cd799
66276b9045b5300cba4b81340db06f674f031fa0b8883714ad0d03be464cd799
\ No newline at end of file
tests/testdata/copa-v0-res.json
View file @
8c997e53
{
"results"
:
{
"copa"
:
{
"acc"
:
0.48
,
"acc_stderr"
:
0.050211673156867795
}},
"versions"
:
{
"copa"
:
0
}}
{
"results"
:
{
"copa"
:
{
"acc"
:
0.48
,
"acc_stderr"
:
0.050211673156867795
}},
"versions"
:
{
"copa"
:
0
}}
\ No newline at end of file
tests/testdata/coqa-v0-greedy_until
View file @
8c997e53
4a8605d5deed0423ec095700251ed93325b45d320aca35d4ce1e94702094435e
4a8605d5deed0423ec095700251ed93325b45d320aca35d4ce1e94702094435e
\ No newline at end of file
tests/testdata/coqa-v0-res.json
View file @
8c997e53
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
0
}}
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
0
}}
\ No newline at end of file
tests/testdata/coqa-v1-greedy_until
View file @
8c997e53
57581470b921435d40da97872bb1cfda6ecf963ccc4b0240a3b04e3fea8c8e3a
57581470b921435d40da97872bb1cfda6ecf963ccc4b0240a3b04e3fea8c8e3a
\ No newline at end of file
tests/testdata/coqa-v1-res.json
View file @
8c997e53
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
1
}}
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
1
}}
\ No newline at end of file
tests/testdata/cycle_letters-v0-greedy_until
View file @
8c997e53
eb23f7d5de7528eefd8ed5f8054c402ff947319cccfef7195995946f99389201
eb23f7d5de7528eefd8ed5f8054c402ff947319cccfef7195995946f99389201
\ No newline at end of file
Prev
1
…
6
7
8
9
10
11
12
13
14
…
32
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment