Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
55e62507
Commit
55e62507
authored
Jan 31, 2022
by
researcher2
Browse files
Merge branch 'master' into researcher2
parents
bb0eafbb
26f0233f
Changes
269
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
0 deletions
+20
-0
tests/testdata/pile_pubmed-central-v1-res.json
tests/testdata/pile_pubmed-central-v1-res.json
+1
-0
tests/testdata/pile_stackexchange-v1-loglikelihood_rolling
tests/testdata/pile_stackexchange-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_stackexchange-v1-res.json
tests/testdata/pile_stackexchange-v1-res.json
+1
-0
tests/testdata/pile_ubuntu-irc-v1-loglikelihood_rolling
tests/testdata/pile_ubuntu-irc-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_ubuntu-irc-v1-res.json
tests/testdata/pile_ubuntu-irc-v1-res.json
+1
-0
tests/testdata/pile_uspto-v1-loglikelihood_rolling
tests/testdata/pile_uspto-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_uspto-v1-res.json
tests/testdata/pile_uspto-v1-res.json
+1
-0
tests/testdata/pile_wikipedia-v1-loglikelihood_rolling
tests/testdata/pile_wikipedia-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_wikipedia-v1-res.json
tests/testdata/pile_wikipedia-v1-res.json
+1
-0
tests/testdata/pile_youtubesubtitles-v1-loglikelihood_rolling
...s/testdata/pile_youtubesubtitles-v1-loglikelihood_rolling
+1
-0
tests/testdata/pile_youtubesubtitles-v1-res.json
tests/testdata/pile_youtubesubtitles-v1-res.json
+1
-0
tests/testdata/wikitext-v1-loglikelihood_rolling
tests/testdata/wikitext-v1-loglikelihood_rolling
+1
-0
tests/testdata/wikitext-v1-res.json
tests/testdata/wikitext-v1-res.json
+1
-0
tests/testdata/wnli-v1-loglikelihood
tests/testdata/wnli-v1-loglikelihood
+1
-0
tests/testdata/wnli-v1-res.json
tests/testdata/wnli-v1-res.json
+1
-0
tests/tests/testdata/blimp_adjunct_island-v0-loglikelihood
tests/tests/testdata/blimp_adjunct_island-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_adjunct_island-v0-res.json
tests/tests/testdata/blimp_adjunct_island-v0-res.json
+1
-0
tests/tests/testdata/blimp_anaphor_gender_agreement-v0-loglikelihood
.../testdata/blimp_anaphor_gender_agreement-v0-loglikelihood
+1
-0
tests/tests/testdata/blimp_anaphor_gender_agreement-v0-res.json
...tests/testdata/blimp_anaphor_gender_agreement-v0-res.json
+1
-0
tests/tests/testdata/blimp_anaphor_number_agreement-v0-loglikelihood
.../testdata/blimp_anaphor_number_agreement-v0-loglikelihood
+1
-0
No files found.
tests/testdata/pile_pubmed-central-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"pile_pubmed-central"
:
{
"bits_per_byte"
:
2.2812488135667854e-05
,
"byte_perplexity"
:
1.0000158125368497
,
"word_perplexity"
:
1.000123107107861
}},
"versions"
:
{
"pile_pubmed-central"
:
1
}}
\ No newline at end of file
tests/testdata/pile_stackexchange-v1-loglikelihood_rolling
0 → 100644
View file @
55e62507
e524bfb3e21cbdaddc117403a50df598520c7bf5b2c60ad8f2372cfa564e79be
\ No newline at end of file
tests/testdata/pile_stackexchange-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"pile_stackexchange"
:
{
"bits_per_byte"
:
0.0003302063346758449
,
"byte_perplexity"
:
1.0002289077852733
,
"word_perplexity"
:
1.0016993562258851
}},
"versions"
:
{
"pile_stackexchange"
:
1
}}
\ No newline at end of file
tests/testdata/pile_ubuntu-irc-v1-loglikelihood_rolling
0 → 100644
View file @
55e62507
4eb69e314f0864ec8890e2323d7e76f8a8309692c4f090e2b41bf4be681a811d
\ No newline at end of file
tests/testdata/pile_ubuntu-irc-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"pile_ubuntu-irc"
:
{
"bits_per_byte"
:
2.3513498942121155e-06
,
"byte_perplexity"
:
1.0000016298328778
,
"word_perplexity"
:
1.0000108866656874
}},
"versions"
:
{
"pile_ubuntu-irc"
:
1
}}
\ No newline at end of file
tests/testdata/pile_uspto-v1-loglikelihood_rolling
0 → 100644
View file @
55e62507
789b2bdb31564d512b70f801316f49320a26c83ba361226bac0afb255341d477
\ No newline at end of file
tests/testdata/pile_uspto-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"pile_uspto"
:
{
"bits_per_byte"
:
0.000174024142670342
,
"byte_perplexity"
:
1.00012063161925
,
"word_perplexity"
:
1.0007716198916954
}},
"versions"
:
{
"pile_uspto"
:
1
}}
\ No newline at end of file
tests/testdata/pile_wikipedia-v1-loglikelihood_rolling
0 → 100644
View file @
55e62507
ef9ec0dd408316ca6537228a6812e839f14b30608973081d41efc47c138338da
\ No newline at end of file
tests/testdata/pile_wikipedia-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"pile_wikipedia"
:
{
"bits_per_byte"
:
0.00024287370359008176
,
"byte_perplexity"
:
1.0001683613940646
,
"word_perplexity"
:
1.001084677949439
}},
"versions"
:
{
"pile_wikipedia"
:
1
}}
\ No newline at end of file
tests/testdata/pile_youtubesubtitles-v1-loglikelihood_rolling
0 → 100644
View file @
55e62507
68263c52adc0086011e2220b619983935cabb1cc1f5f9f8ee1a74ab2a7457967
\ No newline at end of file
tests/testdata/pile_youtubesubtitles-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"pile_youtubesubtitles"
:
{
"bits_per_byte"
:
3.3827117222045906e-05
,
"byte_perplexity"
:
1.000023447445816
,
"word_perplexity"
:
1.0001529192262875
}},
"versions"
:
{
"pile_youtubesubtitles"
:
1
}}
\ No newline at end of file
tests/testdata/wikitext-v1-loglikelihood_rolling
0 → 100644
View file @
55e62507
b6f83e6cf7535ee41b0057c3e2ec2cf7f2fa5a9119b305c479a83091d1142b2c
\ No newline at end of file
tests/testdata/wikitext-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"wikitext"
:
{
"bits_per_byte"
:
3.202519859941674e-05
,
"byte_perplexity"
:
1.0000221984224973
,
"word_perplexity"
:
1.000118710696617
}},
"versions"
:
{
"wikitext"
:
1
}}
\ No newline at end of file
tests/testdata/wnli-v1-loglikelihood
0 → 100644
View file @
55e62507
8a0f81661d2ab2334bbc8031fac31c0c8882f1d9271dd51599d21dfdbb726dea
\ No newline at end of file
tests/testdata/wnli-v1-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"wnli"
:
{
"acc"
:
0.5633802816901409
,
"acc_stderr"
:
0.0592793555841297
}},
"versions"
:
{
"wnli"
:
1
}}
\ No newline at end of file
tests/tests/testdata/blimp_adjunct_island-v0-loglikelihood
0 → 100644
View file @
55e62507
976a5cac4bdb724632eebd4cb9e522203ce3da8d5525288a597c86e80469f3f2
\ No newline at end of file
tests/tests/testdata/blimp_adjunct_island-v0-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"blimp_adjunct_island"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_adjunct_island"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_anaphor_gender_agreement-v0-loglikelihood
0 → 100644
View file @
55e62507
2d8964e56a17661502ecf3f09c0befba63915360ddf2145b0bd845816950515d
\ No newline at end of file
tests/tests/testdata/blimp_anaphor_gender_agreement-v0-res.json
0 → 100644
View file @
55e62507
{
"results"
:
{
"blimp_anaphor_gender_agreement"
:
{
"acc"
:
0.485
,
"acc_stderr"
:
0.0158121796418149
}},
"versions"
:
{
"blimp_anaphor_gender_agreement"
:
0
}}
\ No newline at end of file
tests/tests/testdata/blimp_anaphor_number_agreement-v0-loglikelihood
0 → 100644
View file @
55e62507
0bdad31c974ba064e1f1ba931841ec2ba7461e8b0ca54ea5f79f08b6bae0bab5
\ No newline at end of file
Prev
1
…
3
4
5
6
7
8
9
10
11
…
14
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment