Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
121b7096
Commit
121b7096
authored
May 02, 2022
by
Fabrizio Milo
Browse files
add pre-commit
parent
7a038118
Changes
732
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/hendrycksTest-computer_security-v0-loglikelihood
...testdata/hendrycksTest-computer_security-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-computer_security-v0-res.json
tests/testdata/hendrycksTest-computer_security-v0-res.json
+1
-1
tests/testdata/hendrycksTest-conceptual_physics-v0-loglikelihood
...estdata/hendrycksTest-conceptual_physics-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-conceptual_physics-v0-res.json
tests/testdata/hendrycksTest-conceptual_physics-v0-res.json
+1
-1
tests/testdata/hendrycksTest-econometrics-v0-loglikelihood
tests/testdata/hendrycksTest-econometrics-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-econometrics-v0-res.json
tests/testdata/hendrycksTest-econometrics-v0-res.json
+1
-1
tests/testdata/hendrycksTest-electrical_engineering-v0-loglikelihood
...ata/hendrycksTest-electrical_engineering-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-electrical_engineering-v0-res.json
...testdata/hendrycksTest-electrical_engineering-v0-res.json
+1
-1
tests/testdata/hendrycksTest-elementary_mathematics-v0-loglikelihood
...ata/hendrycksTest-elementary_mathematics-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-elementary_mathematics-v0-res.json
...testdata/hendrycksTest-elementary_mathematics-v0-res.json
+1
-1
tests/testdata/hendrycksTest-formal_logic-v0-loglikelihood
tests/testdata/hendrycksTest-formal_logic-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-formal_logic-v0-res.json
tests/testdata/hendrycksTest-formal_logic-v0-res.json
+1
-1
tests/testdata/hendrycksTest-global_facts-v0-loglikelihood
tests/testdata/hendrycksTest-global_facts-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-global_facts-v0-res.json
tests/testdata/hendrycksTest-global_facts-v0-res.json
+1
-1
tests/testdata/hendrycksTest-high_school_biology-v0-loglikelihood
...stdata/hendrycksTest-high_school_biology-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-high_school_biology-v0-res.json
tests/testdata/hendrycksTest-high_school_biology-v0-res.json
+1
-1
tests/testdata/hendrycksTest-high_school_chemistry-v0-loglikelihood
...data/hendrycksTest-high_school_chemistry-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-high_school_chemistry-v0-res.json
.../testdata/hendrycksTest-high_school_chemistry-v0-res.json
+1
-1
tests/testdata/hendrycksTest-high_school_computer_science-v0-loglikelihood
...ndrycksTest-high_school_computer_science-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-high_school_computer_science-v0-res.json
...ta/hendrycksTest-high_school_computer_science-v0-res.json
+1
-1
No files found.
tests/testdata/hendrycksTest-computer_security-v0-loglikelihood
View file @
121b7096
a8a1892d1906cc3e7ffd321043f0a60f3b8b69ef76e5c6ff03c6ea41dc87d0cb
\ No newline at end of file
a8a1892d1906cc3e7ffd321043f0a60f3b8b69ef76e5c6ff03c6ea41dc87d0cb
tests/testdata/hendrycksTest-computer_security-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-computer_security"
:
{
"acc"
:
0.24
,
"acc_norm"
:
0.27
,
"acc_norm_stderr"
:
0.044619604333847394
,
"acc_stderr"
:
0.042923469599092816
}},
"versions"
:
{
"hendrycksTest-computer_security"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-computer_security"
:
{
"acc"
:
0.24
,
"acc_norm"
:
0.27
,
"acc_norm_stderr"
:
0.044619604333847394
,
"acc_stderr"
:
0.042923469599092816
}},
"versions"
:
{
"hendrycksTest-computer_security"
:
0
}}
tests/testdata/hendrycksTest-conceptual_physics-v0-loglikelihood
View file @
121b7096
622f191ccfc7a597d99f39897ebe3f95a9ddce0e662fcfb411aa554b289bb355
\ No newline at end of file
622f191ccfc7a597d99f39897ebe3f95a9ddce0e662fcfb411aa554b289bb355
tests/testdata/hendrycksTest-conceptual_physics-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-conceptual_physics"
:
{
"acc"
:
0.2680851063829787
,
"acc_norm"
:
0.2553191489361702
,
"acc_norm_stderr"
:
0.028504856470514185
,
"acc_stderr"
:
0.028957342788342347
}},
"versions"
:
{
"hendrycksTest-conceptual_physics"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-conceptual_physics"
:
{
"acc"
:
0.2680851063829787
,
"acc_norm"
:
0.2553191489361702
,
"acc_norm_stderr"
:
0.028504856470514185
,
"acc_stderr"
:
0.028957342788342347
}},
"versions"
:
{
"hendrycksTest-conceptual_physics"
:
0
}}
tests/testdata/hendrycksTest-econometrics-v0-loglikelihood
View file @
121b7096
cde76ba2c7382b4876e17136c94f52aca2774e50342ab757b2a2d18da370dcb6
\ No newline at end of file
cde76ba2c7382b4876e17136c94f52aca2774e50342ab757b2a2d18da370dcb6
tests/testdata/hendrycksTest-econometrics-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-econometrics"
:
{
"acc"
:
0.24561403508771928
,
"acc_norm"
:
0.24561403508771928
,
"acc_norm_stderr"
:
0.04049339297748142
,
"acc_stderr"
:
0.040493392977481425
}},
"versions"
:
{
"hendrycksTest-econometrics"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-econometrics"
:
{
"acc"
:
0.24561403508771928
,
"acc_norm"
:
0.24561403508771928
,
"acc_norm_stderr"
:
0.04049339297748142
,
"acc_stderr"
:
0.040493392977481425
}},
"versions"
:
{
"hendrycksTest-econometrics"
:
0
}}
tests/testdata/hendrycksTest-electrical_engineering-v0-loglikelihood
View file @
121b7096
b9b5d8b8bb02696302ec6bc2a99bf987a5504d3bae0e529d2c8f263538c97518
\ No newline at end of file
b9b5d8b8bb02696302ec6bc2a99bf987a5504d3bae0e529d2c8f263538c97518
tests/testdata/hendrycksTest-electrical_engineering-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-electrical_engineering"
:
{
"acc"
:
0.2689655172413793
,
"acc_norm"
:
0.2827586206896552
,
"acc_norm_stderr"
:
0.037528339580033376
,
"acc_stderr"
:
0.036951833116502325
}},
"versions"
:
{
"hendrycksTest-electrical_engineering"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-electrical_engineering"
:
{
"acc"
:
0.2689655172413793
,
"acc_norm"
:
0.2827586206896552
,
"acc_norm_stderr"
:
0.037528339580033376
,
"acc_stderr"
:
0.036951833116502325
}},
"versions"
:
{
"hendrycksTest-electrical_engineering"
:
0
}}
tests/testdata/hendrycksTest-elementary_mathematics-v0-loglikelihood
View file @
121b7096
6b21f5cd5606268421a667152ec989424b66905c02adbab8d4ff6bb9d21b77d1
\ No newline at end of file
6b21f5cd5606268421a667152ec989424b66905c02adbab8d4ff6bb9d21b77d1
tests/testdata/hendrycksTest-elementary_mathematics-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-elementary_mathematics"
:
{
"acc"
:
0.2724867724867725
,
"acc_norm"
:
0.2830687830687831
,
"acc_norm_stderr"
:
0.023201392938194978
,
"acc_stderr"
:
0.022930973071633345
}},
"versions"
:
{
"hendrycksTest-elementary_mathematics"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-elementary_mathematics"
:
{
"acc"
:
0.2724867724867725
,
"acc_norm"
:
0.2830687830687831
,
"acc_norm_stderr"
:
0.023201392938194978
,
"acc_stderr"
:
0.022930973071633345
}},
"versions"
:
{
"hendrycksTest-elementary_mathematics"
:
0
}}
tests/testdata/hendrycksTest-formal_logic-v0-loglikelihood
View file @
121b7096
c0d0f0c008a5f3faf2f6f4268d87bbc09c40bb66ae08cf38eea0bf2e519c5a59
\ No newline at end of file
c0d0f0c008a5f3faf2f6f4268d87bbc09c40bb66ae08cf38eea0bf2e519c5a59
tests/testdata/hendrycksTest-formal_logic-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-formal_logic"
:
{
"acc"
:
0.25396825396825395
,
"acc_norm"
:
0.2698412698412698
,
"acc_norm_stderr"
:
0.03970158273235172
,
"acc_stderr"
:
0.03893259610604674
}},
"versions"
:
{
"hendrycksTest-formal_logic"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-formal_logic"
:
{
"acc"
:
0.25396825396825395
,
"acc_norm"
:
0.2698412698412698
,
"acc_norm_stderr"
:
0.03970158273235172
,
"acc_stderr"
:
0.03893259610604674
}},
"versions"
:
{
"hendrycksTest-formal_logic"
:
0
}}
tests/testdata/hendrycksTest-global_facts-v0-loglikelihood
View file @
121b7096
9fdc85240b8170839278b1e883ee0868611d84dce202cb8aa037c841ec76d089
\ No newline at end of file
9fdc85240b8170839278b1e883ee0868611d84dce202cb8aa037c841ec76d089
tests/testdata/hendrycksTest-global_facts-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-global_facts"
:
{
"acc"
:
0.23
,
"acc_norm"
:
0.23
,
"acc_norm_stderr"
:
0.04229525846816507
,
"acc_stderr"
:
0.04229525846816507
}},
"versions"
:
{
"hendrycksTest-global_facts"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-global_facts"
:
{
"acc"
:
0.23
,
"acc_norm"
:
0.23
,
"acc_norm_stderr"
:
0.04229525846816507
,
"acc_stderr"
:
0.04229525846816507
}},
"versions"
:
{
"hendrycksTest-global_facts"
:
0
}}
tests/testdata/hendrycksTest-high_school_biology-v0-loglikelihood
View file @
121b7096
d4dc051f37a49dc75c218741e87bc826fd44f31ee1309b55e0f33bd191c1bc78
\ No newline at end of file
d4dc051f37a49dc75c218741e87bc826fd44f31ee1309b55e0f33bd191c1bc78
tests/testdata/hendrycksTest-high_school_biology-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-high_school_biology"
:
{
"acc"
:
0.23870967741935484
,
"acc_norm"
:
0.2709677419354839
,
"acc_norm_stderr"
:
0.025284416114900152
,
"acc_stderr"
:
0.024251071262208834
}},
"versions"
:
{
"hendrycksTest-high_school_biology"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-high_school_biology"
:
{
"acc"
:
0.23870967741935484
,
"acc_norm"
:
0.2709677419354839
,
"acc_norm_stderr"
:
0.025284416114900152
,
"acc_stderr"
:
0.024251071262208834
}},
"versions"
:
{
"hendrycksTest-high_school_biology"
:
0
}}
tests/testdata/hendrycksTest-high_school_chemistry-v0-loglikelihood
View file @
121b7096
f4f338e45415c4b5ee7f1d249155bcd910c8401bd1436760a5ec61cb6bb211b6
\ No newline at end of file
f4f338e45415c4b5ee7f1d249155bcd910c8401bd1436760a5ec61cb6bb211b6
tests/testdata/hendrycksTest-high_school_chemistry-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-high_school_chemistry"
:
{
"acc"
:
0.2857142857142857
,
"acc_norm"
:
0.2660098522167488
,
"acc_norm_stderr"
:
0.031089826002937523
,
"acc_stderr"
:
0.031785297106427496
}},
"versions"
:
{
"hendrycksTest-high_school_chemistry"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-high_school_chemistry"
:
{
"acc"
:
0.2857142857142857
,
"acc_norm"
:
0.2660098522167488
,
"acc_norm_stderr"
:
0.031089826002937523
,
"acc_stderr"
:
0.031785297106427496
}},
"versions"
:
{
"hendrycksTest-high_school_chemistry"
:
0
}}
tests/testdata/hendrycksTest-high_school_computer_science-v0-loglikelihood
View file @
121b7096
870d5a6300c527077aaf6baa3e750e75fa840b41657cf82549f39b768b14862d
\ No newline at end of file
870d5a6300c527077aaf6baa3e750e75fa840b41657cf82549f39b768b14862d
tests/testdata/hendrycksTest-high_school_computer_science-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-high_school_computer_science"
:
{
"acc"
:
0.2
,
"acc_norm"
:
0.22
,
"acc_norm_stderr"
:
0.04163331998932269
,
"acc_stderr"
:
0.04020151261036845
}},
"versions"
:
{
"hendrycksTest-high_school_computer_science"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-high_school_computer_science"
:
{
"acc"
:
0.2
,
"acc_norm"
:
0.22
,
"acc_norm_stderr"
:
0.04163331998932269
,
"acc_stderr"
:
0.04020151261036845
}},
"versions"
:
{
"hendrycksTest-high_school_computer_science"
:
0
}}
Prev
1
…
8
9
10
11
12
13
14
15
16
…
37
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment