Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
8c997e53
Commit
8c997e53
authored
May 03, 2022
by
jon-tow
Browse files
Revert `tests/testdata` changes and address flake8 issues
parent
d95a4333
Changes
627
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/hendrycksTest-world_religions-v0-res.json
tests/testdata/hendrycksTest-world_religions-v0-res.json
+1
-1
tests/testdata/iwslt17-ar-en-v0-greedy_until
tests/testdata/iwslt17-ar-en-v0-greedy_until
+1
-1
tests/testdata/iwslt17-ar-en-v0-res.json
tests/testdata/iwslt17-ar-en-v0-res.json
+1
-1
tests/testdata/iwslt17-en-ar-v0-greedy_until
tests/testdata/iwslt17-en-ar-v0-greedy_until
+1
-1
tests/testdata/iwslt17-en-ar-v0-res.json
tests/testdata/iwslt17-en-ar-v0-res.json
+1
-1
tests/testdata/lambada-v0-loglikelihood
tests/testdata/lambada-v0-loglikelihood
+1
-1
tests/testdata/lambada-v0-res.json
tests/testdata/lambada-v0-res.json
+1
-1
tests/testdata/lambada_cloze-v0-loglikelihood
tests/testdata/lambada_cloze-v0-loglikelihood
+1
-1
tests/testdata/lambada_cloze-v0-res.json
tests/testdata/lambada_cloze-v0-res.json
+1
-1
tests/testdata/lambada_mt_de-v0-loglikelihood
tests/testdata/lambada_mt_de-v0-loglikelihood
+1
-1
tests/testdata/lambada_mt_de-v0-res.json
tests/testdata/lambada_mt_de-v0-res.json
+1
-1
tests/testdata/lambada_mt_en-v0-loglikelihood
tests/testdata/lambada_mt_en-v0-loglikelihood
+1
-1
tests/testdata/lambada_mt_en-v0-res.json
tests/testdata/lambada_mt_en-v0-res.json
+1
-1
tests/testdata/lambada_mt_es-v0-loglikelihood
tests/testdata/lambada_mt_es-v0-loglikelihood
+1
-1
tests/testdata/lambada_mt_es-v0-res.json
tests/testdata/lambada_mt_es-v0-res.json
+1
-1
tests/testdata/lambada_mt_fr-v0-loglikelihood
tests/testdata/lambada_mt_fr-v0-loglikelihood
+1
-1
tests/testdata/lambada_mt_fr-v0-res.json
tests/testdata/lambada_mt_fr-v0-res.json
+1
-1
tests/testdata/lambada_mt_it-v0-loglikelihood
tests/testdata/lambada_mt_it-v0-loglikelihood
+1
-1
tests/testdata/lambada_mt_it-v0-res.json
tests/testdata/lambada_mt_it-v0-res.json
+1
-1
tests/testdata/logiqa-v0-loglikelihood
tests/testdata/logiqa-v0-loglikelihood
+1
-1
No files found.
tests/testdata/hendrycksTest-world_religions-v0-res.json
View file @
8c997e53
{
"results"
:
{
"hendrycksTest-world_religions"
:
{
"acc"
:
0.21637426900584794
,
"acc_norm"
:
0.22807017543859648
,
"acc_norm_stderr"
:
0.03218093795602357
,
"acc_stderr"
:
0.03158149539338734
}},
"versions"
:
{
"hendrycksTest-world_religions"
:
0
}}
{
"results"
:
{
"hendrycksTest-world_religions"
:
{
"acc"
:
0.21637426900584794
,
"acc_norm"
:
0.22807017543859648
,
"acc_norm_stderr"
:
0.03218093795602357
,
"acc_stderr"
:
0.03158149539338734
}},
"versions"
:
{
"hendrycksTest-world_religions"
:
0
}}
\ No newline at end of file
tests/testdata/iwslt17-ar-en-v0-greedy_until
View file @
8c997e53
e94d310de91fad7ce36f4cf3305552020221482c5588f2efcefaa019893504f1
e94d310de91fad7ce36f4cf3305552020221482c5588f2efcefaa019893504f1
\ No newline at end of file
tests/testdata/iwslt17-ar-en-v0-res.json
View file @
8c997e53
{
"results"
:
{
"iwslt17-ar-en"
:
{
"bleu"
:
0.0
,
"bleu_stderr"
:
0.0
,
"chrf"
:
0.015049895477752772
,
"chrf_stderr"
:
0.0002940315671893584
,
"ter"
:
1.0
,
"ter_stderr"
:
0.0
}},
"versions"
:
{
"iwslt17-ar-en"
:
0
}}
{
"results"
:
{
"iwslt17-ar-en"
:
{
"bleu"
:
0.0
,
"bleu_stderr"
:
0.0
,
"chrf"
:
0.015049895477752772
,
"chrf_stderr"
:
0.0002940315671893584
,
"ter"
:
1.0
,
"ter_stderr"
:
0.0
}},
"versions"
:
{
"iwslt17-ar-en"
:
0
}}
\ No newline at end of file
tests/testdata/iwslt17-en-ar-v0-greedy_until
View file @
8c997e53
b20adbcd2c6d135e28600b427113532c5df624cb3a90e8c5e48715c09a3a38fa
b20adbcd2c6d135e28600b427113532c5df624cb3a90e8c5e48715c09a3a38fa
\ No newline at end of file
tests/testdata/iwslt17-en-ar-v0-res.json
View file @
8c997e53
{
"results"
:
{
"iwslt17-en-ar"
:
{
"bleu"
:
0.0
,
"bleu_stderr"
:
0.0
,
"chrf"
:
0.0
,
"chrf_stderr"
:
0.0
,
"ter"
:
1.0
,
"ter_stderr"
:
0.0
}},
"versions"
:
{
"iwslt17-en-ar"
:
0
}}
{
"results"
:
{
"iwslt17-en-ar"
:
{
"bleu"
:
0.0
,
"bleu_stderr"
:
0.0
,
"chrf"
:
0.0
,
"chrf_stderr"
:
0.0
,
"ter"
:
1.0
,
"ter_stderr"
:
0.0
}},
"versions"
:
{
"iwslt17-en-ar"
:
0
}}
\ No newline at end of file
tests/testdata/lambada-v0-loglikelihood
View file @
8c997e53
6829e6a8aa5922e6c92dd31403cc060f242dc0ede4a775e085a70da095ab2e20
6829e6a8aa5922e6c92dd31403cc060f242dc0ede4a775e085a70da095ab2e20
\ No newline at end of file
tests/testdata/lambada-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada"
:
0
}}
{
"results"
:
{
"lambada"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_cloze-v0-loglikelihood
View file @
8c997e53
7655e748b63ae7e9911411d2d2a2577221d6c861ca4448509992541294d689f3
7655e748b63ae7e9911411d2d2a2577221d6c861ca4448509992541294d689f3
\ No newline at end of file
tests/testdata/lambada_cloze-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada_cloze"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_cloze"
:
0
}}
{
"results"
:
{
"lambada_cloze"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_cloze"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_mt_de-v0-loglikelihood
View file @
8c997e53
5ad125e1708499832b2cee8c3388f89f9c0277010fd96fbd3359039ce8105984
5ad125e1708499832b2cee8c3388f89f9c0277010fd96fbd3359039ce8105984
\ No newline at end of file
tests/testdata/lambada_mt_de-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada_mt_de"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_de"
:
0
}}
{
"results"
:
{
"lambada_mt_de"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_de"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_mt_en-v0-loglikelihood
View file @
8c997e53
6829e6a8aa5922e6c92dd31403cc060f242dc0ede4a775e085a70da095ab2e20
6829e6a8aa5922e6c92dd31403cc060f242dc0ede4a775e085a70da095ab2e20
\ No newline at end of file
tests/testdata/lambada_mt_en-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada_mt_en"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_en"
:
0
}}
{
"results"
:
{
"lambada_mt_en"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_en"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_mt_es-v0-loglikelihood
View file @
8c997e53
4a88f4b316c72fe0396c382d6cbb33568ac4d0ad225150d3536635c085359fc9
4a88f4b316c72fe0396c382d6cbb33568ac4d0ad225150d3536635c085359fc9
\ No newline at end of file
tests/testdata/lambada_mt_es-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada_mt_es"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_es"
:
0
}}
{
"results"
:
{
"lambada_mt_es"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_es"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_mt_fr-v0-loglikelihood
View file @
8c997e53
5d16f4a0c51dc6d7b6df2ebeba2bbfa51e700b843779b559b3d90183d7b02a11
5d16f4a0c51dc6d7b6df2ebeba2bbfa51e700b843779b559b3d90183d7b02a11
\ No newline at end of file
tests/testdata/lambada_mt_fr-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada_mt_fr"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_fr"
:
0
}}
{
"results"
:
{
"lambada_mt_fr"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_fr"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_mt_it-v0-loglikelihood
View file @
8c997e53
fd87c6c5cf4e0499c5f9f80e5bd7ee6a4f3d2991902a0cc3ec9e6eaf22d6760a
fd87c6c5cf4e0499c5f9f80e5bd7ee6a4f3d2991902a0cc3ec9e6eaf22d6760a
\ No newline at end of file
tests/testdata/lambada_mt_it-v0-res.json
View file @
8c997e53
{
"results"
:
{
"lambada_mt_it"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_it"
:
0
}}
{
"results"
:
{
"lambada_mt_it"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_mt_it"
:
0
}}
\ No newline at end of file
tests/testdata/logiqa-v0-loglikelihood
View file @
8c997e53
12495c50454ba5e1ce0753bd18c09aaca516bebd27648d815e37b15229dbf198
12495c50454ba5e1ce0753bd18c09aaca516bebd27648d815e37b15229dbf198
\ No newline at end of file
Prev
1
…
14
15
16
17
18
19
20
21
22
…
32
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment