Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
a73e80c7
Commit
a73e80c7
authored
Nov 03, 2021
by
Leo Gao
Browse files
Add testdata
parent
05ed92a4
Changes
4
Show whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
4 additions
and
0 deletions
+4
-0
tests/testdata/truthfulqa_gen-v1-greedy_until
tests/testdata/truthfulqa_gen-v1-greedy_until
+1
-0
tests/testdata/truthfulqa_gen-v1-res.json
tests/testdata/truthfulqa_gen-v1-res.json
+1
-0
tests/testdata/truthfulqa_mc-v1-loglikelihood
tests/testdata/truthfulqa_mc-v1-loglikelihood
+1
-0
tests/testdata/truthfulqa_mc-v1-res.json
tests/testdata/truthfulqa_mc-v1-res.json
+1
-0
No files found.
tests/testdata/truthfulqa_gen-v1-greedy_until
0 → 100644
View file @
a73e80c7
0d7c56e1aa71ffd8f94bde28f6e8dfdd35f7aaadffa0620bd2a27704253d6c14
\ No newline at end of file
tests/testdata/truthfulqa_gen-v1-res.json
0 → 100644
View file @
a73e80c7
{
"results"
:
{
"truthfulqa_gen"
:
{
"bleu_acc"
:
0.0
,
"bleu_acc_stderr"
:
0.0
,
"bleu_diff"
:
0.0
,
"bleu_diff_stderr"
:
0.0
,
"bleu_max"
:
0.0
,
"bleu_max_stderr"
:
0.0
,
"bleurt_acc"
:
0.8372093023255814
,
"bleurt_acc_stderr"
:
0.012923696051772253
,
"bleurt_diff"
:
0.13967358205134603
,
"bleurt_diff_stderr"
:
0.00532907098769571
,
"bleurt_max"
:
-1.4402793981454072
,
"bleurt_max_stderr"
:
0.0021884846359458963
,
"rouge1_acc"
:
0.0
,
"rouge1_acc_stderr"
:
0.0
,
"rouge1_diff"
:
0.0
,
"rouge1_diff_stderr"
:
0.0
,
"rouge1_max"
:
0.0
,
"rouge1_max_stderr"
:
0.0
,
"rouge2_acc"
:
0.0
,
"rouge2_acc_stderr"
:
0.0
,
"rouge2_diff"
:
0.0
,
"rouge2_diff_stderr"
:
0.0
,
"rouge2_max"
:
0.0
,
"rouge2_max_stderr"
:
0.0
,
"rougeL_acc"
:
0.0
,
"rougeL_acc_stderr"
:
0.0
,
"rougeL_diff"
:
0.0
,
"rougeL_diff_stderr"
:
0.0
,
"rougeL_max"
:
0.0
,
"rougeL_max_stderr"
:
0.0
}},
"versions"
:
{
"truthfulqa_gen"
:
1
}}
\ No newline at end of file
tests/testdata/truthfulqa_mc-v1-loglikelihood
0 → 100644
View file @
a73e80c7
226a6783976177dc9ceda5688623ff37023242eff30ddf270b886bf7b9b32228
\ No newline at end of file
tests/testdata/truthfulqa_mc-v1-res.json
0 → 100644
View file @
a73e80c7
{
"results"
:
{
"truthfulqa_mc"
:
{
"mc1"
:
0.2141982864137087
,
"mc1_stderr"
:
0.01436214815569045
,
"mc2"
:
0.465436996173817
,
"mc2_stderr"
:
0.0048422530880316405
}},
"versions"
:
{
"truthfulqa_mc"
:
1
}}
\ No newline at end of file
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment