Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
121b7096
Commit
121b7096
authored
May 02, 2022
by
Fabrizio Milo
Browse files
add pre-commit
parent
7a038118
Changes
732
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/qa4mre_2012-v0-loglikelihood
tests/testdata/qa4mre_2012-v0-loglikelihood
+1
-1
tests/testdata/qa4mre_2012-v0-res.json
tests/testdata/qa4mre_2012-v0-res.json
+1
-1
tests/testdata/qa4mre_2013-v0-loglikelihood
tests/testdata/qa4mre_2013-v0-loglikelihood
+1
-1
tests/testdata/qa4mre_2013-v0-res.json
tests/testdata/qa4mre_2013-v0-res.json
+1
-1
tests/testdata/qnli-v0-loglikelihood
tests/testdata/qnli-v0-loglikelihood
+1
-1
tests/testdata/qnli-v0-res.json
tests/testdata/qnli-v0-res.json
+1
-1
tests/testdata/qqp-v0-loglikelihood
tests/testdata/qqp-v0-loglikelihood
+1
-1
tests/testdata/qqp-v0-res.json
tests/testdata/qqp-v0-res.json
+1
-1
tests/testdata/race-v0-loglikelihood
tests/testdata/race-v0-loglikelihood
+1
-1
tests/testdata/race-v0-res.json
tests/testdata/race-v0-res.json
+1
-1
tests/testdata/random_insertion-v0-greedy_until
tests/testdata/random_insertion-v0-greedy_until
+1
-1
tests/testdata/random_insertion-v0-res.json
tests/testdata/random_insertion-v0-res.json
+1
-1
tests/testdata/record-v0-loglikelihood
tests/testdata/record-v0-loglikelihood
+1
-1
tests/testdata/record-v0-res.json
tests/testdata/record-v0-res.json
+1
-1
tests/testdata/reversed_words-v0-greedy_until
tests/testdata/reversed_words-v0-greedy_until
+1
-1
tests/testdata/reversed_words-v0-res.json
tests/testdata/reversed_words-v0-res.json
+1
-1
tests/testdata/rte-v0-loglikelihood
tests/testdata/rte-v0-loglikelihood
+1
-1
tests/testdata/rte-v0-res.json
tests/testdata/rte-v0-res.json
+1
-1
tests/testdata/sciq-v0-loglikelihood
tests/testdata/sciq-v0-loglikelihood
+1
-1
tests/testdata/sciq-v0-res.json
tests/testdata/sciq-v0-res.json
+1
-1
No files found.
tests/testdata/qa4mre_2012-v0-loglikelihood
View file @
121b7096
7e17261820acb365966cb9431d93aec983b14393eaeefbc96e30a11cf58bc6df
\ No newline at end of file
7e17261820acb365966cb9431d93aec983b14393eaeefbc96e30a11cf58bc6df
tests/testdata/qa4mre_2012-v0-res.json
View file @
121b7096
{
"results"
:
{
"qa4mre_2012"
:
{
"acc"
:
0.15625
,
"acc_norm"
:
0.16875
,
"acc_norm_stderr"
:
0.029702236908328808
,
"acc_stderr"
:
0.02879508360159146
}},
"versions"
:
{
"qa4mre_2012"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"qa4mre_2012"
:
{
"acc"
:
0.15625
,
"acc_norm"
:
0.16875
,
"acc_norm_stderr"
:
0.029702236908328808
,
"acc_stderr"
:
0.02879508360159146
}},
"versions"
:
{
"qa4mre_2012"
:
0
}}
tests/testdata/qa4mre_2013-v0-loglikelihood
View file @
121b7096
52fc431e94c67f983e28ebc70cf45e6c14116b0ae77dc1bf22347c705a65d054
\ No newline at end of file
52fc431e94c67f983e28ebc70cf45e6c14116b0ae77dc1bf22347c705a65d054
tests/testdata/qa4mre_2013-v0-res.json
View file @
121b7096
{
"results"
:
{
"qa4mre_2013"
:
{
"acc"
:
0.18309859154929578
,
"acc_norm"
:
0.22183098591549297
,
"acc_norm_stderr"
:
0.02469760575535269
,
"acc_stderr"
:
0.022989742475464973
}},
"versions"
:
{
"qa4mre_2013"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"qa4mre_2013"
:
{
"acc"
:
0.18309859154929578
,
"acc_norm"
:
0.22183098591549297
,
"acc_norm_stderr"
:
0.02469760575535269
,
"acc_stderr"
:
0.022989742475464973
}},
"versions"
:
{
"qa4mre_2013"
:
0
}}
tests/testdata/qnli-v0-loglikelihood
View file @
121b7096
4281d4ff5cf1244358b0ea0220c67863c69fbade850696b43e8ff05138e01e12
\ No newline at end of file
4281d4ff5cf1244358b0ea0220c67863c69fbade850696b43e8ff05138e01e12
tests/testdata/qnli-v0-res.json
View file @
121b7096
{
"results"
:
{
"qnli"
:
{
"acc"
:
0.5108914515833791
,
"acc_stderr"
:
0.00676380528502966
}},
"versions"
:
{
"qnli"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"qnli"
:
{
"acc"
:
0.5108914515833791
,
"acc_stderr"
:
0.00676380528502966
}},
"versions"
:
{
"qnli"
:
0
}}
tests/testdata/qqp-v0-loglikelihood
View file @
121b7096
97b551b0fc3d239aad4929a2e8e79c986891aefd9fcd19441fea0382d507889e
\ No newline at end of file
97b551b0fc3d239aad4929a2e8e79c986891aefd9fcd19441fea0382d507889e
tests/testdata/qqp-v0-res.json
View file @
121b7096
{
"results"
:
{
"qqp"
:
{
"acc"
:
0.49782339846648527
,
"acc_stderr"
:
0.0024866770696239894
,
"f1"
:
0.42322661288031593
,
"f1_stderr"
:
0.002695903831328166
}},
"versions"
:
{
"qqp"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"qqp"
:
{
"acc"
:
0.49782339846648527
,
"acc_stderr"
:
0.0024866770696239894
,
"f1"
:
0.42322661288031593
,
"f1_stderr"
:
0.002695903831328166
}},
"versions"
:
{
"qqp"
:
0
}}
tests/testdata/race-v0-loglikelihood
View file @
121b7096
bdfdfab7fa1c7af0c1e161785e347b1b8071a15cbf971f6f2a9ae8c8e845199f
\ No newline at end of file
bdfdfab7fa1c7af0c1e161785e347b1b8071a15cbf971f6f2a9ae8c8e845199f
tests/testdata/race-v0-res.json
View file @
121b7096
{
"results"
:
{
"race"
:
{
"acc"
:
0.23253588516746412
,
"acc_stderr"
:
0.013074460615265295
}},
"versions"
:
{
"race"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"race"
:
{
"acc"
:
0.23253588516746412
,
"acc_stderr"
:
0.013074460615265295
}},
"versions"
:
{
"race"
:
0
}}
tests/testdata/random_insertion-v0-greedy_until
View file @
121b7096
6c48baa6924f3635120f33062251c4b571b3d4e9fe46b14d91f54ddd1c857997
\ No newline at end of file
6c48baa6924f3635120f33062251c4b571b3d4e9fe46b14d91f54ddd1c857997
tests/testdata/random_insertion-v0-res.json
View file @
121b7096
{
"results"
:
{
"random_insertion"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"random_insertion"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"random_insertion"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"random_insertion"
:
0
}}
tests/testdata/record-v0-loglikelihood
View file @
121b7096
a3e378fbde4e28f375cac1561bbfc7d7673c2af193628a774ad012d5192393aa
\ No newline at end of file
a3e378fbde4e28f375cac1561bbfc7d7673c2af193628a774ad012d5192393aa
tests/testdata/record-v0-res.json
View file @
121b7096
{
"results"
:
{
"record"
:
{
"em"
:
0.1521
,
"em_stderr"
:
0.0035913575128186616
,
"f1"
:
0.1581870634920636
,
"f1_stderr"
:
0.0036146895141474576
}},
"versions"
:
{
"record"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"record"
:
{
"em"
:
0.1521
,
"em_stderr"
:
0.0035913575128186616
,
"f1"
:
0.1581870634920636
,
"f1_stderr"
:
0.0036146895141474576
}},
"versions"
:
{
"record"
:
0
}}
tests/testdata/reversed_words-v0-greedy_until
View file @
121b7096
1d79fc4f0177f9624a487b9973f4e0e1d3f8404993b419a7b807a690ebbbb290
\ No newline at end of file
1d79fc4f0177f9624a487b9973f4e0e1d3f8404993b419a7b807a690ebbbb290
tests/testdata/reversed_words-v0-res.json
View file @
121b7096
{
"results"
:
{
"reversed_words"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"reversed_words"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"reversed_words"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"reversed_words"
:
0
}}
tests/testdata/rte-v0-loglikelihood
View file @
121b7096
c80ce13c8c736087f1557f8736d5d318b540ff01e4bb7f55e568890dc8b0393e
\ No newline at end of file
c80ce13c8c736087f1557f8736d5d318b540ff01e4bb7f55e568890dc8b0393e
tests/testdata/rte-v0-res.json
View file @
121b7096
{
"results"
:
{
"rte"
:
{
"acc"
:
0.5379061371841155
,
"acc_stderr"
:
0.030009848912529117
}},
"versions"
:
{
"rte"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"rte"
:
{
"acc"
:
0.5379061371841155
,
"acc_stderr"
:
0.030009848912529117
}},
"versions"
:
{
"rte"
:
0
}}
tests/testdata/sciq-v0-loglikelihood
View file @
121b7096
71cbb6e2a7ac4512c3761ea801d420eb3fac49d158c7e4deaa3ab8727bea923c
\ No newline at end of file
71cbb6e2a7ac4512c3761ea801d420eb3fac49d158c7e4deaa3ab8727bea923c
tests/testdata/sciq-v0-res.json
View file @
121b7096
{
"results"
:
{
"sciq"
:
{
"acc"
:
0.234
,
"acc_norm"
:
0.239
,
"acc_norm_stderr"
:
0.01349300044693758
,
"acc_stderr"
:
0.01339490288966001
}},
"versions"
:
{
"sciq"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"sciq"
:
{
"acc"
:
0.234
,
"acc_norm"
:
0.239
,
"acc_norm_stderr"
:
0.01349300044693758
,
"acc_stderr"
:
0.01339490288966001
}},
"versions"
:
{
"sciq"
:
0
}}
Prev
1
…
21
22
23
24
25
26
27
28
29
…
37
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment