Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
baa8b0d3
Commit
baa8b0d3
authored
May 18, 2023
by
bzantium
Browse files
fix for merge from master
parent
a956bc63
Changes
392
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
13 additions
and
0 deletions
+13
-0
tests/testdata/lambada_openai_mt_en-v0-res.json
tests/testdata/lambada_openai_mt_en-v0-res.json
+1
-0
tests/testdata/lambada_openai_mt_es-v0-loglikelihood
tests/testdata/lambada_openai_mt_es-v0-loglikelihood
+1
-0
tests/testdata/lambada_openai_mt_es-v0-res.json
tests/testdata/lambada_openai_mt_es-v0-res.json
+1
-0
tests/testdata/lambada_openai_mt_fr-v0-loglikelihood
tests/testdata/lambada_openai_mt_fr-v0-loglikelihood
+1
-0
tests/testdata/lambada_openai_mt_fr-v0-res.json
tests/testdata/lambada_openai_mt_fr-v0-res.json
+1
-0
tests/testdata/lambada_openai_mt_it-v0-loglikelihood
tests/testdata/lambada_openai_mt_it-v0-loglikelihood
+1
-0
tests/testdata/lambada_openai_mt_it-v0-res.json
tests/testdata/lambada_openai_mt_it-v0-res.json
+1
-0
tests/testdata/lambada_standard-v0-loglikelihood
tests/testdata/lambada_standard-v0-loglikelihood
+1
-0
tests/testdata/lambada_standard-v0-res.json
tests/testdata/lambada_standard-v0-res.json
+1
-0
tests/testdata/lambada_standard_cloze-v0-loglikelihood
tests/testdata/lambada_standard_cloze-v0-loglikelihood
+1
-0
tests/testdata/lambada_standard_cloze-v0-res.json
tests/testdata/lambada_standard_cloze-v0-res.json
+1
-0
tests/testdata/swag-v0-loglikelihood
tests/testdata/swag-v0-loglikelihood
+1
-0
tests/testdata/swag-v0-res.json
tests/testdata/swag-v0-res.json
+1
-0
tests/testdata/textsynth_test_0a89c2739f9598b4be2674b0a8e43931d7f3f0b696970bcba31f9b52bdf12297.pkl
...598b4be2674b0a8e43931d7f3f0b696970bcba31f9b52bdf12297.pkl
+0
-0
tests/testdata/textsynth_test_0c1c14571add7903b89e588c8212572b95bb57b334fc0752c89a7e045a5f63ae.pkl
...d7903b89e588c8212572b95bb57b334fc0752c89a7e045a5f63ae.pkl
+0
-0
tests/testdata/textsynth_test_3092d07756f3e1d010c07524cc8a2ecba7f0c19f9e39f2aaf2bf440bfe328004.pkl
...3e1d010c07524cc8a2ecba7f0c19f9e39f2aaf2bf440bfe328004.pkl
+0
-0
tests/testdata/textsynth_test_434076260b6af3a46b7a5eaceec3306a5872c400a3872f744280b237455a0f8e.pkl
...af3a46b7a5eaceec3306a5872c400a3872f744280b237455a0f8e.pkl
+0
-0
tests/testdata/textsynth_test_49c47ae40e11f349f2f6b492128188b1b2bc103a421c676ee4b2142a68b43516.pkl
...1f349f2f6b492128188b1b2bc103a421c676ee4b2142a68b43516.pkl
+0
-0
tests/testdata/textsynth_test_4fd8d66a6dad7f602b40e5d7dc298d6fe329299d086a4659743a41f4a4012659.pkl
...d7f602b40e5d7dc298d6fe329299d086a4659743a41f4a4012659.pkl
+0
-0
tests/testdata/textsynth_test_51b5302f157cf224f694ccad973f255ae19e9e061d533256bdf75b04e0a917ab.pkl
...cf224f694ccad973f255ae19e9e061d533256bdf75b04e0a917ab.pkl
+0
-0
No files found.
tests/testdata/lambada_openai_mt_en-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"lambada_openai_mt_en"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_openai_mt_en"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_openai_mt_es-v0-loglikelihood
0 → 100644
View file @
baa8b0d3
4a88f4b316c72fe0396c382d6cbb33568ac4d0ad225150d3536635c085359fc9
\ No newline at end of file
tests/testdata/lambada_openai_mt_es-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"lambada_openai_mt_es"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_openai_mt_es"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_openai_mt_fr-v0-loglikelihood
0 → 100644
View file @
baa8b0d3
5d16f4a0c51dc6d7b6df2ebeba2bbfa51e700b843779b559b3d90183d7b02a11
\ No newline at end of file
tests/testdata/lambada_openai_mt_fr-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"lambada_openai_mt_fr"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_openai_mt_fr"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_openai_mt_it-v0-loglikelihood
0 → 100644
View file @
baa8b0d3
fd87c6c5cf4e0499c5f9f80e5bd7ee6a4f3d2991902a0cc3ec9e6eaf22d6760a
\ No newline at end of file
tests/testdata/lambada_openai_mt_it-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"lambada_openai_mt_it"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_openai_mt_it"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_standard-v0-loglikelihood
0 → 100644
View file @
baa8b0d3
8958d9f8d8145046b692fadd8a9cc9c8bad5617c10774280cf7c24c21d2be160
\ No newline at end of file
tests/testdata/lambada_standard-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"lambada_standard"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_standard"
:
0
}}
\ No newline at end of file
tests/testdata/lambada_standard_cloze-v0-loglikelihood
0 → 100644
View file @
baa8b0d3
b604f00bc9f2a77ef41f8cfdb5a8509b3ae9266893b9e90abc665f5399ecba4e
\ No newline at end of file
tests/testdata/lambada_standard_cloze-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"lambada_standard_cloze"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
,
"ppl"
:
1.6479047769869253
,
"ppl_stderr"
:
0.006497321146240192
}},
"versions"
:
{
"lambada_standard_cloze"
:
0
}}
\ No newline at end of file
tests/testdata/swag-v0-loglikelihood
0 → 100644
View file @
baa8b0d3
be4fcbad876124c4ba3c71970538a97fec0e36a9cc677c70b6c9243a7bcee0ec
\ No newline at end of file
tests/testdata/swag-v0-res.json
0 → 100644
View file @
baa8b0d3
{
"results"
:
{
"swag"
:
{
"acc"
:
0.2482255323402979
,
"acc_norm"
:
0.24882535239428172
,
"acc_norm_stderr"
:
0.00305666959496067
,
"acc_stderr"
:
0.003054201832644171
}},
"versions"
:
{
"swag"
:
0
}}
\ No newline at end of file
tests/testdata/textsynth_test_0a89c2739f9598b4be2674b0a8e43931d7f3f0b696970bcba31f9b52bdf12297.pkl
0 → 100644
View file @
baa8b0d3
File added
tests/testdata/textsynth_test_0c1c14571add7903b89e588c8212572b95bb57b334fc0752c89a7e045a5f63ae.pkl
0 → 100644
View file @
baa8b0d3
File added
tests/testdata/textsynth_test_3092d07756f3e1d010c07524cc8a2ecba7f0c19f9e39f2aaf2bf440bfe328004.pkl
0 → 100644
View file @
baa8b0d3
File added
tests/testdata/textsynth_test_434076260b6af3a46b7a5eaceec3306a5872c400a3872f744280b237455a0f8e.pkl
0 → 100644
View file @
baa8b0d3
File added
tests/testdata/textsynth_test_49c47ae40e11f349f2f6b492128188b1b2bc103a421c676ee4b2142a68b43516.pkl
0 → 100644
View file @
baa8b0d3
File added
tests/testdata/textsynth_test_4fd8d66a6dad7f602b40e5d7dc298d6fe329299d086a4659743a41f4a4012659.pkl
0 → 100644
View file @
baa8b0d3
File added
tests/testdata/textsynth_test_51b5302f157cf224f694ccad973f255ae19e9e061d533256bdf75b04e0a917ab.pkl
0 → 100644
View file @
baa8b0d3
File added
Prev
1
…
15
16
17
18
19
20
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment