Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
78824d7f
Unverified
Commit
78824d7f
authored
Jan 08, 2022
by
Thomas Wang
Committed by
GitHub
Jan 08, 2022
Browse files
Merge branch 'master' into thomas/fix_best_download_version
parents
c65412e5
cc238121
Changes
114
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
0 deletions
+20
-0
tests/testdata/boolq-v1-loglikelihood
tests/testdata/boolq-v1-loglikelihood
+1
-0
tests/testdata/boolq-v1-res.json
tests/testdata/boolq-v1-res.json
+1
-0
tests/testdata/headqa_en-v0-loglikelihood
tests/testdata/headqa_en-v0-loglikelihood
+1
-0
tests/testdata/headqa_en-v0-res.json
tests/testdata/headqa_en-v0-res.json
+1
-0
tests/testdata/headqa_es-v0-loglikelihood
tests/testdata/headqa_es-v0-loglikelihood
+1
-0
tests/testdata/headqa_es-v0-res.json
tests/testdata/headqa_es-v0-res.json
+1
-0
tests/testdata/multirc-v1-loglikelihood
tests/testdata/multirc-v1-loglikelihood
+1
-0
tests/testdata/multirc-v1-res.json
tests/testdata/multirc-v1-res.json
+1
-0
tests/testdata/pile_arxiv-v1-loglikelihood_rolling
tests/testdata/pile_arxiv-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_arxiv-v1-res.json
tests/testdata/pile_arxiv-v1-res.json
+1
-0
tests/testdata/pile_bookcorpus2-v1-loglikelihood_rolling
tests/testdata/pile_bookcorpus2-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_bookcorpus2-v1-res.json
tests/testdata/pile_bookcorpus2-v1-res.json
+1
-0
tests/testdata/pile_books3-v1-loglikelihood_rolling
tests/testdata/pile_books3-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_books3-v1-res.json
tests/testdata/pile_books3-v1-res.json
+1
-0
tests/testdata/pile_dm-mathematics-v1-loglikelihood_rolling
tests/testdata/pile_dm-mathematics-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_dm-mathematics-v1-res.json
tests/testdata/pile_dm-mathematics-v1-res.json
+1
-0
tests/testdata/pile_enron-v1-loglikelihood_rolling
tests/testdata/pile_enron-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_enron-v1-res.json
tests/testdata/pile_enron-v1-res.json
+1
-0
tests/testdata/pile_europarl-v1-loglikelihood_rolling
tests/testdata/pile_europarl-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_europarl-v1-res.json
tests/testdata/pile_europarl-v1-res.json
+1
-0
No files found.
tests/testdata/boolq-v1-loglikelihood
0 → 100644
View file @
78824d7f
6577e0d88572772ef08e64f624c0e3df0953286ae1f118ccef15623b59ffeabf
\ No newline at end of file
tests/testdata/boolq-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"boolq"
:
{
"acc"
:
0.5048929663608562
,
"acc_stderr"
:
0.00874463623355505
}},
"versions"
:
{
"boolq"
:
1
}}
\ No newline at end of file
tests/testdata/headqa_en-v0-loglikelihood
0 → 100644
View file @
78824d7f
09da45119b12a0144e3081f8fb790c2a22af7b9c3aac42f54423d348a711fbf5
\ No newline at end of file
tests/testdata/headqa_en-v0-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"headqa_en"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.2447118891320204
,
"acc_norm_stderr"
:
0.008211629406841468
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa_en"
:
0
}}
\ No newline at end of file
tests/testdata/headqa_es-v0-loglikelihood
0 → 100644
View file @
78824d7f
767ca34d9714edd9fb030ddbcc35a64e5180d1e247b0cb557fbb22fdf971ad1f
\ No newline at end of file
tests/testdata/headqa_es-v0-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"headqa_es"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.25018234865062
,
"acc_norm_stderr"
:
0.008272783230806014
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa_es"
:
0
}}
\ No newline at end of file
tests/testdata/multirc-v1-loglikelihood
0 → 100644
View file @
78824d7f
0e793bd6f637a70a04c6f2cda080188fc037961b2f909095fe63f7bdbc4a90c6
\ No newline at end of file
tests/testdata/multirc-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"multirc"
:
{
"acc"
:
0.046169989506820566
,
"acc_stderr"
:
0.006801377886208738
}},
"versions"
:
{
"multirc"
:
1
}}
\ No newline at end of file
tests/testdata/pile_arxiv-v1-loglikelihood_rolling
0 → 100644
View file @
78824d7f
814f9954e44368559602c00f7e85fa3971acdfd0315f508ec7df6318a79c55ec
\ No newline at end of file
tests/testdata/pile_arxiv-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"pile_arxiv"
:
{
"bits_per_byte"
:
1.55095665856779e-05
,
"byte_perplexity"
:
1.0000107504701365
,
"word_perplexity"
:
1.0000819333090385
}},
"versions"
:
{
"pile_arxiv"
:
1
}}
\ No newline at end of file
tests/testdata/pile_bookcorpus2-v1-loglikelihood_rolling
0 → 100644
View file @
78824d7f
5c17ddfebeab8c41dabadb6fc216ceda91e3fe5dc95aaf1b2c843d7f11828b03
\ No newline at end of file
tests/testdata/pile_bookcorpus2-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"pile_bookcorpus2"
:
{
"bits_per_byte"
:
1.6780040419457868e-06
,
"byte_perplexity"
:
1.000001163104447
,
"word_perplexity"
:
1.0000066499426599
}},
"versions"
:
{
"pile_bookcorpus2"
:
1
}}
\ No newline at end of file
tests/testdata/pile_books3-v1-loglikelihood_rolling
0 → 100644
View file @
78824d7f
0f8f36f705b999b6d55fa72ff89a82793dd1cb568ab1f8727a6a2086a12b9410
\ No newline at end of file
tests/testdata/pile_books3-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"pile_books3"
:
{
"bits_per_byte"
:
1.2901280503011222e-06
,
"byte_perplexity"
:
1.0000008942490204
,
"word_perplexity"
:
1.0000052870063607
}},
"versions"
:
{
"pile_books3"
:
1
}}
\ No newline at end of file
tests/testdata/pile_dm-mathematics-v1-loglikelihood_rolling
0 → 100644
View file @
78824d7f
d5b7967c0ece8b816f3921a8bd0fad23365349e935b491595e2ad1135af42da6
\ No newline at end of file
tests/testdata/pile_dm-mathematics-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"pile_dm-mathematics"
:
{
"bits_per_byte"
:
8.910951449933553e-05
,
"byte_perplexity"
:
1.0000617679162955
,
"word_perplexity"
:
1.0002875035042451
}},
"versions"
:
{
"pile_dm-mathematics"
:
1
}}
\ No newline at end of file
tests/testdata/pile_enron-v1-loglikelihood_rolling
0 → 100644
View file @
78824d7f
4baa6ccdc9e3aa9921675ab4400d5e89d7b546b844a8ea28f6461d649066418a
\ No newline at end of file
tests/testdata/pile_enron-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"pile_enron"
:
{
"bits_per_byte"
:
0.0004564546920781453
,
"byte_perplexity"
:
1.000316440339552
,
"word_perplexity"
:
1.00224668051869
}},
"versions"
:
{
"pile_enron"
:
1
}}
\ No newline at end of file
tests/testdata/pile_europarl-v1-loglikelihood_rolling
0 → 100644
View file @
78824d7f
e67d3dbccd47d308bfc5b0e66b76d0dfc5e386ebfa94e056562c2281c395543f
\ No newline at end of file
tests/testdata/pile_europarl-v1-res.json
0 → 100644
View file @
78824d7f
{
"results"
:
{
"pile_europarl"
:
{
"bits_per_byte"
:
1.2477664839621123e-05
,
"byte_perplexity"
:
1.000008648895605
,
"word_perplexity"
:
1.000063506523818
}},
"versions"
:
{
"pile_europarl"
:
1
}}
\ No newline at end of file
Prev
1
2
3
4
5
6
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment