Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
121b7096
Commit
121b7096
authored
May 02, 2022
by
Fabrizio Milo
Browse files
add pre-commit
parent
7a038118
Changes
732
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/cb-v1-loglikelihood
tests/testdata/cb-v1-loglikelihood
+1
-1
tests/testdata/cb-v1-res.json
tests/testdata/cb-v1-res.json
+1
-1
tests/testdata/cola-v0-loglikelihood
tests/testdata/cola-v0-loglikelihood
+1
-1
tests/testdata/cola-v0-res.json
tests/testdata/cola-v0-res.json
+1
-1
tests/testdata/copa-v0-loglikelihood
tests/testdata/copa-v0-loglikelihood
+1
-1
tests/testdata/copa-v0-res.json
tests/testdata/copa-v0-res.json
+1
-1
tests/testdata/coqa-v0-greedy_until
tests/testdata/coqa-v0-greedy_until
+1
-1
tests/testdata/coqa-v0-res.json
tests/testdata/coqa-v0-res.json
+1
-1
tests/testdata/coqa-v1-greedy_until
tests/testdata/coqa-v1-greedy_until
+1
-1
tests/testdata/coqa-v1-res.json
tests/testdata/coqa-v1-res.json
+1
-1
tests/testdata/cycle_letters-v0-greedy_until
tests/testdata/cycle_letters-v0-greedy_until
+1
-1
tests/testdata/cycle_letters-v0-res.json
tests/testdata/cycle_letters-v0-res.json
+1
-1
tests/testdata/drop-v0-greedy_until
tests/testdata/drop-v0-greedy_until
+1
-1
tests/testdata/drop-v0-res.json
tests/testdata/drop-v0-res.json
+1
-1
tests/testdata/drop-v1-greedy_until
tests/testdata/drop-v1-greedy_until
+1
-1
tests/testdata/drop-v1-res.json
tests/testdata/drop-v1-res.json
+1
-1
tests/testdata/ethics_cm-v0-loglikelihood
tests/testdata/ethics_cm-v0-loglikelihood
+1
-1
tests/testdata/ethics_cm-v0-res.json
tests/testdata/ethics_cm-v0-res.json
+1
-1
tests/testdata/ethics_deontology-v0-loglikelihood
tests/testdata/ethics_deontology-v0-loglikelihood
+1
-1
tests/testdata/ethics_deontology-v0-res.json
tests/testdata/ethics_deontology-v0-res.json
+1
-1
No files found.
tests/testdata/cb-v1-loglikelihood
View file @
121b7096
77b11f4348eb8a7f57faf95c531fda01ab4bf0e729f91a82451ed8e71ec8e66d
\ No newline at end of file
77b11f4348eb8a7f57faf95c531fda01ab4bf0e729f91a82451ed8e71ec8e66d
tests/testdata/cb-v1-res.json
View file @
121b7096
{
"results"
:
{
"cb"
:
{
"acc"
:
0.3392857142857143
,
"acc_stderr"
:
0.06384226561930825
,
"f1"
:
0.2819143819143819
}},
"versions"
:
{
"cb"
:
1
}}
\ No newline at end of file
{
"results"
:
{
"cb"
:
{
"acc"
:
0.3392857142857143
,
"acc_stderr"
:
0.06384226561930825
,
"f1"
:
0.2819143819143819
}},
"versions"
:
{
"cb"
:
1
}}
tests/testdata/cola-v0-loglikelihood
View file @
121b7096
e8635578ed8ee70b707a666d35e468b9321db24470f80c92080651e2bfa01751
\ No newline at end of file
e8635578ed8ee70b707a666d35e468b9321db24470f80c92080651e2bfa01751
tests/testdata/cola-v0-res.json
View file @
121b7096
{
"results"
:
{
"cola"
:
{
"mcc"
:
-0.04538802810223175
,
"mcc_stderr"
:
0.023100371589225246
}},
"versions"
:
{
"cola"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"cola"
:
{
"mcc"
:
-0.04538802810223175
,
"mcc_stderr"
:
0.023100371589225246
}},
"versions"
:
{
"cola"
:
0
}}
tests/testdata/copa-v0-loglikelihood
View file @
121b7096
66276b9045b5300cba4b81340db06f674f031fa0b8883714ad0d03be464cd799
\ No newline at end of file
66276b9045b5300cba4b81340db06f674f031fa0b8883714ad0d03be464cd799
tests/testdata/copa-v0-res.json
View file @
121b7096
{
"results"
:
{
"copa"
:
{
"acc"
:
0.48
,
"acc_stderr"
:
0.050211673156867795
}},
"versions"
:
{
"copa"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"copa"
:
{
"acc"
:
0.48
,
"acc_stderr"
:
0.050211673156867795
}},
"versions"
:
{
"copa"
:
0
}}
tests/testdata/coqa-v0-greedy_until
View file @
121b7096
4a8605d5deed0423ec095700251ed93325b45d320aca35d4ce1e94702094435e
\ No newline at end of file
4a8605d5deed0423ec095700251ed93325b45d320aca35d4ce1e94702094435e
tests/testdata/coqa-v0-res.json
View file @
121b7096
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
0
}}
tests/testdata/coqa-v1-greedy_until
View file @
121b7096
57581470b921435d40da97872bb1cfda6ecf963ccc4b0240a3b04e3fea8c8e3a
\ No newline at end of file
57581470b921435d40da97872bb1cfda6ecf963ccc4b0240a3b04e3fea8c8e3a
tests/testdata/coqa-v1-res.json
View file @
121b7096
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
1
}}
\ No newline at end of file
{
"results"
:
{
"coqa"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"coqa"
:
1
}}
tests/testdata/cycle_letters-v0-greedy_until
View file @
121b7096
eb23f7d5de7528eefd8ed5f8054c402ff947319cccfef7195995946f99389201
\ No newline at end of file
eb23f7d5de7528eefd8ed5f8054c402ff947319cccfef7195995946f99389201
tests/testdata/cycle_letters-v0-res.json
View file @
121b7096
{
"results"
:
{
"cycle_letters"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"cycle_letters"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"cycle_letters"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"cycle_letters"
:
0
}}
tests/testdata/drop-v0-greedy_until
View file @
121b7096
ca566c630d8ac853d5785d4b5c40a5137172c34b48af3350e1f79e6d548b36ba
\ No newline at end of file
ca566c630d8ac853d5785d4b5c40a5137172c34b48af3350e1f79e6d548b36ba
tests/testdata/drop-v0-res.json
View file @
121b7096
{
"results"
:
{
"drop"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"drop"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"drop"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"drop"
:
0
}}
tests/testdata/drop-v1-greedy_until
View file @
121b7096
a670f911ab2999d72db15f534b22703d19e7837edbda4f9f199ad587f7aae6b2
\ No newline at end of file
a670f911ab2999d72db15f534b22703d19e7837edbda4f9f199ad587f7aae6b2
tests/testdata/drop-v1-res.json
View file @
121b7096
{
"results"
:
{
"drop"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"drop"
:
1
}}
\ No newline at end of file
{
"results"
:
{
"drop"
:
{
"em"
:
0.0
,
"em_stderr"
:
0.0
,
"f1"
:
0.0
,
"f1_stderr"
:
0.0
}},
"versions"
:
{
"drop"
:
1
}}
tests/testdata/ethics_cm-v0-loglikelihood
View file @
121b7096
92d136ebb2bd86cd036e61699ad9a1417dbb48651f0a3afa5045cf57cef5a3f6
\ No newline at end of file
92d136ebb2bd86cd036e61699ad9a1417dbb48651f0a3afa5045cf57cef5a3f6
tests/testdata/ethics_cm-v0-res.json
View file @
121b7096
{
"results"
:
{
"ethics_cm"
:
{
"acc"
:
0.49987129987129986
,
"acc_stderr"
:
0.008022881531793336
}},
"versions"
:
{
"ethics_cm"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"ethics_cm"
:
{
"acc"
:
0.49987129987129986
,
"acc_stderr"
:
0.008022881531793336
}},
"versions"
:
{
"ethics_cm"
:
0
}}
tests/testdata/ethics_deontology-v0-loglikelihood
View file @
121b7096
74ecebe322457d70afc16fde848978410a09b854dc65c47f428d100bd1593248
\ No newline at end of file
74ecebe322457d70afc16fde848978410a09b854dc65c47f428d100bd1593248
tests/testdata/ethics_deontology-v0-res.json
View file @
121b7096
{
"results"
:
{
"ethics_deontology"
:
{
"acc"
:
0.503615127919911
,
"acc_stderr"
:
0.008338908432085105
,
"em"
:
0.07119021134593993
}},
"versions"
:
{
"ethics_deontology"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"ethics_deontology"
:
{
"acc"
:
0.503615127919911
,
"acc_stderr"
:
0.008338908432085105
,
"em"
:
0.07119021134593993
}},
"versions"
:
{
"ethics_deontology"
:
0
}}
Prev
1
…
5
6
7
8
9
10
11
12
13
…
37
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment