Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
121b7096
Commit
121b7096
authored
May 02, 2022
by
Fabrizio Milo
Browse files
add pre-commit
parent
7a038118
Changes
732
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/pile_uspto-v0-loglikelihood_rolling
tests/testdata/pile_uspto-v0-loglikelihood_rolling
+1
-1
tests/testdata/pile_uspto-v0-res.json
tests/testdata/pile_uspto-v0-res.json
+1
-1
tests/testdata/pile_uspto-v1-loglikelihood_rolling
tests/testdata/pile_uspto-v1-loglikelihood_rolling
+1
-1
tests/testdata/pile_uspto-v1-res.json
tests/testdata/pile_uspto-v1-res.json
+1
-1
tests/testdata/pile_wikipedia-v0-loglikelihood_rolling
tests/testdata/pile_wikipedia-v0-loglikelihood_rolling
+1
-1
tests/testdata/pile_wikipedia-v0-res.json
tests/testdata/pile_wikipedia-v0-res.json
+1
-1
tests/testdata/pile_wikipedia-v1-loglikelihood_rolling
tests/testdata/pile_wikipedia-v1-loglikelihood_rolling
+1
-1
tests/testdata/pile_wikipedia-v1-res.json
tests/testdata/pile_wikipedia-v1-res.json
+1
-1
tests/testdata/pile_youtubesubtitles-v0-loglikelihood_rolling
...s/testdata/pile_youtubesubtitles-v0-loglikelihood_rolling
+1
-1
tests/testdata/pile_youtubesubtitles-v0-res.json
tests/testdata/pile_youtubesubtitles-v0-res.json
+1
-1
tests/testdata/pile_youtubesubtitles-v1-loglikelihood_rolling
...s/testdata/pile_youtubesubtitles-v1-loglikelihood_rolling
+1
-1
tests/testdata/pile_youtubesubtitles-v1-res.json
tests/testdata/pile_youtubesubtitles-v1-res.json
+1
-1
tests/testdata/piqa-v0-loglikelihood
tests/testdata/piqa-v0-loglikelihood
+1
-1
tests/testdata/piqa-v0-res.json
tests/testdata/piqa-v0-res.json
+1
-1
tests/testdata/prost-v0-loglikelihood
tests/testdata/prost-v0-loglikelihood
+1
-1
tests/testdata/prost-v0-res.json
tests/testdata/prost-v0-res.json
+1
-1
tests/testdata/pubmedqa-v0-loglikelihood
tests/testdata/pubmedqa-v0-loglikelihood
+1
-1
tests/testdata/pubmedqa-v0-res.json
tests/testdata/pubmedqa-v0-res.json
+1
-1
tests/testdata/qa4mre_2011-v0-loglikelihood
tests/testdata/qa4mre_2011-v0-loglikelihood
+1
-1
tests/testdata/qa4mre_2011-v0-res.json
tests/testdata/qa4mre_2011-v0-res.json
+1
-1
No files found.
tests/testdata/pile_uspto-v0-loglikelihood_rolling
View file @
121b7096
789b2bdb31564d512b70f801316f49320a26c83ba361226bac0afb255341d477
\ No newline at end of file
789b2bdb31564d512b70f801316f49320a26c83ba361226bac0afb255341d477
tests/testdata/pile_uspto-v0-res.json
View file @
121b7096
{
"results"
:
{
"pile_uspto"
:
{
"bits_per_byte"
:
0.00012062434384130924
,
"byte_perplexity"
:
1.00012063161925
,
"word_perplexity"
:
1.0007716198916954
}},
"versions"
:
{
"pile_uspto"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"pile_uspto"
:
{
"bits_per_byte"
:
0.00012062434384130924
,
"byte_perplexity"
:
1.00012063161925
,
"word_perplexity"
:
1.0007716198916954
}},
"versions"
:
{
"pile_uspto"
:
0
}}
tests/testdata/pile_uspto-v1-loglikelihood_rolling
View file @
121b7096
789b2bdb31564d512b70f801316f49320a26c83ba361226bac0afb255341d477
\ No newline at end of file
789b2bdb31564d512b70f801316f49320a26c83ba361226bac0afb255341d477
tests/testdata/pile_uspto-v1-res.json
View file @
121b7096
{
"results"
:
{
"pile_uspto"
:
{
"bits_per_byte"
:
0.000174024142670342
,
"byte_perplexity"
:
1.00012063161925
,
"word_perplexity"
:
1.0007716198916954
}},
"versions"
:
{
"pile_uspto"
:
1
}}
\ No newline at end of file
{
"results"
:
{
"pile_uspto"
:
{
"bits_per_byte"
:
0.000174024142670342
,
"byte_perplexity"
:
1.00012063161925
,
"word_perplexity"
:
1.0007716198916954
}},
"versions"
:
{
"pile_uspto"
:
1
}}
tests/testdata/pile_wikipedia-v0-loglikelihood_rolling
View file @
121b7096
ef9ec0dd408316ca6537228a6812e839f14b30608973081d41efc47c138338da
\ No newline at end of file
ef9ec0dd408316ca6537228a6812e839f14b30608973081d41efc47c138338da
tests/testdata/pile_wikipedia-v0-res.json
View file @
121b7096
{
"results"
:
{
"pile_wikipedia"
:
{
"bits_per_byte"
:
0.00016834722287561703
,
"byte_perplexity"
:
1.0001683613940646
,
"word_perplexity"
:
1.001084677949439
}},
"versions"
:
{
"pile_wikipedia"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"pile_wikipedia"
:
{
"bits_per_byte"
:
0.00016834722287561703
,
"byte_perplexity"
:
1.0001683613940646
,
"word_perplexity"
:
1.001084677949439
}},
"versions"
:
{
"pile_wikipedia"
:
0
}}
tests/testdata/pile_wikipedia-v1-loglikelihood_rolling
View file @
121b7096
ef9ec0dd408316ca6537228a6812e839f14b30608973081d41efc47c138338da
\ No newline at end of file
ef9ec0dd408316ca6537228a6812e839f14b30608973081d41efc47c138338da
tests/testdata/pile_wikipedia-v1-res.json
View file @
121b7096
{
"results"
:
{
"pile_wikipedia"
:
{
"bits_per_byte"
:
0.00024287370359008176
,
"byte_perplexity"
:
1.0001683613940646
,
"word_perplexity"
:
1.001084677949439
}},
"versions"
:
{
"pile_wikipedia"
:
1
}}
\ No newline at end of file
{
"results"
:
{
"pile_wikipedia"
:
{
"bits_per_byte"
:
0.00024287370359008176
,
"byte_perplexity"
:
1.0001683613940646
,
"word_perplexity"
:
1.001084677949439
}},
"versions"
:
{
"pile_wikipedia"
:
1
}}
tests/testdata/pile_youtubesubtitles-v0-loglikelihood_rolling
View file @
121b7096
68263c52adc0086011e2220b619983935cabb1cc1f5f9f8ee1a74ab2a7457967
\ No newline at end of file
68263c52adc0086011e2220b619983935cabb1cc1f5f9f8ee1a74ab2a7457967
tests/testdata/pile_youtubesubtitles-v0-res.json
View file @
121b7096
{
"results"
:
{
"pile_youtubesubtitles"
:
{
"bits_per_byte"
:
2.3447170928931888e-05
,
"byte_perplexity"
:
1.000023447445816
,
"word_perplexity"
:
1.0001529192262875
}},
"versions"
:
{
"pile_youtubesubtitles"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"pile_youtubesubtitles"
:
{
"bits_per_byte"
:
2.3447170928931888e-05
,
"byte_perplexity"
:
1.000023447445816
,
"word_perplexity"
:
1.0001529192262875
}},
"versions"
:
{
"pile_youtubesubtitles"
:
0
}}
tests/testdata/pile_youtubesubtitles-v1-loglikelihood_rolling
View file @
121b7096
68263c52adc0086011e2220b619983935cabb1cc1f5f9f8ee1a74ab2a7457967
\ No newline at end of file
68263c52adc0086011e2220b619983935cabb1cc1f5f9f8ee1a74ab2a7457967
tests/testdata/pile_youtubesubtitles-v1-res.json
View file @
121b7096
{
"results"
:
{
"pile_youtubesubtitles"
:
{
"bits_per_byte"
:
3.3827117222045906e-05
,
"byte_perplexity"
:
1.000023447445816
,
"word_perplexity"
:
1.0001529192262875
}},
"versions"
:
{
"pile_youtubesubtitles"
:
1
}}
\ No newline at end of file
{
"results"
:
{
"pile_youtubesubtitles"
:
{
"bits_per_byte"
:
3.3827117222045906e-05
,
"byte_perplexity"
:
1.000023447445816
,
"word_perplexity"
:
1.0001529192262875
}},
"versions"
:
{
"pile_youtubesubtitles"
:
1
}}
tests/testdata/piqa-v0-loglikelihood
View file @
121b7096
6048a3a2bb3ad1e6a3d98139618e06b4d7de766edd685bd38837596199c3f69f
\ No newline at end of file
6048a3a2bb3ad1e6a3d98139618e06b4d7de766edd685bd38837596199c3f69f
tests/testdata/piqa-v0-res.json
View file @
121b7096
{
"results"
:
{
"piqa"
:
{
"acc"
:
0.514145810663765
,
"acc_norm"
:
0.5114254624591947
,
"acc_norm_stderr"
:
0.01166277802645167
,
"acc_stderr"
:
0.011661154475524836
}},
"versions"
:
{
"piqa"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"piqa"
:
{
"acc"
:
0.514145810663765
,
"acc_norm"
:
0.5114254624591947
,
"acc_norm_stderr"
:
0.01166277802645167
,
"acc_stderr"
:
0.011661154475524836
}},
"versions"
:
{
"piqa"
:
0
}}
tests/testdata/prost-v0-loglikelihood
View file @
121b7096
7c475f5b36a8b79f94c2be035441e7fd59dac021b0713b1fc72d256424c70b0b
\ No newline at end of file
7c475f5b36a8b79f94c2be035441e7fd59dac021b0713b1fc72d256424c70b0b
tests/testdata/prost-v0-res.json
View file @
121b7096
{
"results"
:
{
"prost"
:
{
"acc"
:
0.24631725021349274
,
"acc_norm"
:
0.2581127241673783
,
"acc_norm_stderr"
:
0.00319703079646546
,
"acc_stderr"
:
0.003147855968061357
}},
"versions"
:
{
"prost"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"prost"
:
{
"acc"
:
0.24631725021349274
,
"acc_norm"
:
0.2581127241673783
,
"acc_norm_stderr"
:
0.00319703079646546
,
"acc_stderr"
:
0.003147855968061357
}},
"versions"
:
{
"prost"
:
0
}}
tests/testdata/pubmedqa-v0-loglikelihood
View file @
121b7096
7a04a1fb1d2b19db84fd15c224015d6c0306a41195a4e71fe6abd48fb4d53b9f
\ No newline at end of file
7a04a1fb1d2b19db84fd15c224015d6c0306a41195a4e71fe6abd48fb4d53b9f
tests/testdata/pubmedqa-v0-res.json
View file @
121b7096
{
"results"
:
{
"pubmedqa"
:
{
"acc"
:
0.324
,
"acc_stderr"
:
0.01480686473373886
}},
"versions"
:
{
"pubmedqa"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"pubmedqa"
:
{
"acc"
:
0.324
,
"acc_stderr"
:
0.01480686473373886
}},
"versions"
:
{
"pubmedqa"
:
0
}}
tests/testdata/qa4mre_2011-v0-loglikelihood
View file @
121b7096
0d09f17c65768e797633494d2d218e4e46a26f718cab8b0bf3d156b073a8c437
\ No newline at end of file
0d09f17c65768e797633494d2d218e4e46a26f718cab8b0bf3d156b073a8c437
tests/testdata/qa4mre_2011-v0-res.json
View file @
121b7096
{
"results"
:
{
"qa4mre_2011"
:
{
"acc"
:
0.225
,
"acc_norm"
:
0.23333333333333334
,
"acc_norm_stderr"
:
0.03877199986918664
,
"acc_stderr"
:
0.0382797091741014
}},
"versions"
:
{
"qa4mre_2011"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"qa4mre_2011"
:
{
"acc"
:
0.225
,
"acc_norm"
:
0.23333333333333334
,
"acc_norm_stderr"
:
0.03877199986918664
,
"acc_stderr"
:
0.0382797091741014
}},
"versions"
:
{
"qa4mre_2011"
:
0
}}
Prev
1
…
20
21
22
23
24
25
26
27
28
…
37
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment