Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
121b7096
"megatron/optimizer_param_scheduler.py" did not exist on "d8c85650ed9b946703383f4b6e12691597c40ccd"
Commit
121b7096
authored
May 02, 2022
by
Fabrizio Milo
Browse files
add pre-commit
parent
7a038118
Changes
732
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/ethics_justice-v0-loglikelihood
tests/testdata/ethics_justice-v0-loglikelihood
+1
-1
tests/testdata/ethics_justice-v0-res.json
tests/testdata/ethics_justice-v0-res.json
+1
-1
tests/testdata/ethics_utilitarianism-v0-loglikelihood
tests/testdata/ethics_utilitarianism-v0-loglikelihood
+1
-1
tests/testdata/ethics_utilitarianism-v0-res.json
tests/testdata/ethics_utilitarianism-v0-res.json
+1
-1
tests/testdata/ethics_utilitarianism_original-v0-loglikelihood
.../testdata/ethics_utilitarianism_original-v0-loglikelihood
+1
-1
tests/testdata/ethics_utilitarianism_original-v0-res.json
tests/testdata/ethics_utilitarianism_original-v0-res.json
+1
-1
tests/testdata/ethics_virtue-v0-loglikelihood
tests/testdata/ethics_virtue-v0-loglikelihood
+1
-1
tests/testdata/ethics_virtue-v0-res.json
tests/testdata/ethics_virtue-v0-res.json
+1
-1
tests/testdata/gsm8k-v0-greedy_until
tests/testdata/gsm8k-v0-greedy_until
+1
-1
tests/testdata/gsm8k-v0-res.json
tests/testdata/gsm8k-v0-res.json
+1
-1
tests/testdata/headqa-v0-loglikelihood
tests/testdata/headqa-v0-loglikelihood
+1
-1
tests/testdata/headqa-v0-res.json
tests/testdata/headqa-v0-res.json
+1
-1
tests/testdata/headqa_en-v0-loglikelihood
tests/testdata/headqa_en-v0-loglikelihood
+1
-1
tests/testdata/headqa_en-v0-res.json
tests/testdata/headqa_en-v0-res.json
+1
-1
tests/testdata/headqa_es-v0-loglikelihood
tests/testdata/headqa_es-v0-loglikelihood
+1
-1
tests/testdata/headqa_es-v0-res.json
tests/testdata/headqa_es-v0-res.json
+1
-1
tests/testdata/hellaswag-v0-loglikelihood
tests/testdata/hellaswag-v0-loglikelihood
+1
-1
tests/testdata/hellaswag-v0-res.json
tests/testdata/hellaswag-v0-res.json
+1
-1
tests/testdata/hendrycksTest-abstract_algebra-v0-loglikelihood
.../testdata/hendrycksTest-abstract_algebra-v0-loglikelihood
+1
-1
tests/testdata/hendrycksTest-abstract_algebra-v0-res.json
tests/testdata/hendrycksTest-abstract_algebra-v0-res.json
+1
-1
No files found.
tests/testdata/ethics_justice-v0-loglikelihood
View file @
121b7096
d7dfc44fea507b5c5c3a8218f79ed8197da8599ebb396d85feb91c25512126b6
\ No newline at end of file
d7dfc44fea507b5c5c3a8218f79ed8197da8599ebb396d85feb91c25512126b6
tests/testdata/ethics_justice-v0-res.json
View file @
121b7096
{
"results"
:
{
"ethics_justice"
:
{
"acc"
:
0.49556213017751477
,
"acc_stderr"
:
0.009616784279885177
,
"em"
:
0.057692307692307696
}},
"versions"
:
{
"ethics_justice"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"ethics_justice"
:
{
"acc"
:
0.49556213017751477
,
"acc_stderr"
:
0.009616784279885177
,
"em"
:
0.057692307692307696
}},
"versions"
:
{
"ethics_justice"
:
0
}}
tests/testdata/ethics_utilitarianism-v0-loglikelihood
View file @
121b7096
88872f1ed1b203f9649a4ced4fb4627d18c17af455d713de6e17c05eced4ec60
\ No newline at end of file
88872f1ed1b203f9649a4ced4fb4627d18c17af455d713de6e17c05eced4ec60
tests/testdata/ethics_utilitarianism-v0-res.json
View file @
121b7096
{
"results"
:
{
"ethics_utilitarianism"
:
{
"acc"
:
0.49771214642262895
,
"acc_stderr"
:
0.007211546310787838
}},
"versions"
:
{
"ethics_utilitarianism"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"ethics_utilitarianism"
:
{
"acc"
:
0.49771214642262895
,
"acc_stderr"
:
0.007211546310787838
}},
"versions"
:
{
"ethics_utilitarianism"
:
0
}}
tests/testdata/ethics_utilitarianism_original-v0-loglikelihood
View file @
121b7096
5b42ba1faf5ece6a6ec9a3976ce79c1fac8df5b98272aab85457188c2142693c
\ No newline at end of file
5b42ba1faf5ece6a6ec9a3976ce79c1fac8df5b98272aab85457188c2142693c
tests/testdata/ethics_utilitarianism_original-v0-res.json
View file @
121b7096
{
"results"
:
{
"ethics_utilitarianism_original"
:
{
"acc"
:
0.5214226289517471
,
"acc_stderr"
:
0.007204999520618661
}},
"versions"
:
{
"ethics_utilitarianism_original"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"ethics_utilitarianism_original"
:
{
"acc"
:
0.5214226289517471
,
"acc_stderr"
:
0.007204999520618661
}},
"versions"
:
{
"ethics_utilitarianism_original"
:
0
}}
tests/testdata/ethics_virtue-v0-loglikelihood
View file @
121b7096
8021db8de46850090ddae6e6ec2d382029c3027b7c69884607503f916d09b709
\ No newline at end of file
8021db8de46850090ddae6e6ec2d382029c3027b7c69884607503f916d09b709
tests/testdata/ethics_virtue-v0-res.json
View file @
121b7096
{
"results"
:
{
"ethics_virtue"
:
{
"acc"
:
0.5035175879396985
,
"acc_stderr"
:
0.0070893491553555765
,
"em"
:
0.036180904522613064
}},
"versions"
:
{
"ethics_virtue"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"ethics_virtue"
:
{
"acc"
:
0.5035175879396985
,
"acc_stderr"
:
0.0070893491553555765
,
"em"
:
0.036180904522613064
}},
"versions"
:
{
"ethics_virtue"
:
0
}}
tests/testdata/gsm8k-v0-greedy_until
View file @
121b7096
e7292dbdd7fd8419ba954f2e0701e04c8d0e8842fe053dbf2fe47d926630e35e
\ No newline at end of file
e7292dbdd7fd8419ba954f2e0701e04c8d0e8842fe053dbf2fe47d926630e35e
tests/testdata/gsm8k-v0-res.json
View file @
121b7096
{
"results"
:
{
"gsm8k"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"gsm8k"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"gsm8k"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"gsm8k"
:
0
}}
tests/testdata/headqa-v0-loglikelihood
View file @
121b7096
767ca34d9714edd9fb030ddbcc35a64e5180d1e247b0cb557fbb22fdf971ad1f
\ No newline at end of file
767ca34d9714edd9fb030ddbcc35a64e5180d1e247b0cb557fbb22fdf971ad1f
tests/testdata/headqa-v0-res.json
View file @
121b7096
{
"results"
:
{
"headqa"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.25018234865062
,
"acc_norm_stderr"
:
0.008272783230806014
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"headqa"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.25018234865062
,
"acc_norm_stderr"
:
0.008272783230806014
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa"
:
0
}}
tests/testdata/headqa_en-v0-loglikelihood
View file @
121b7096
09da45119b12a0144e3081f8fb790c2a22af7b9c3aac42f54423d348a711fbf5
\ No newline at end of file
09da45119b12a0144e3081f8fb790c2a22af7b9c3aac42f54423d348a711fbf5
tests/testdata/headqa_en-v0-res.json
View file @
121b7096
{
"results"
:
{
"headqa_en"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.2447118891320204
,
"acc_norm_stderr"
:
0.008211629406841468
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa_en"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"headqa_en"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.2447118891320204
,
"acc_norm_stderr"
:
0.008211629406841468
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa_en"
:
0
}}
tests/testdata/headqa_es-v0-loglikelihood
View file @
121b7096
767ca34d9714edd9fb030ddbcc35a64e5180d1e247b0cb557fbb22fdf971ad1f
\ No newline at end of file
767ca34d9714edd9fb030ddbcc35a64e5180d1e247b0cb557fbb22fdf971ad1f
tests/testdata/headqa_es-v0-res.json
View file @
121b7096
{
"results"
:
{
"headqa_es"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.25018234865062
,
"acc_norm_stderr"
:
0.008272783230806014
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa_es"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"headqa_es"
:
{
"acc"
:
0.23559445660102116
,
"acc_norm"
:
0.25018234865062
,
"acc_norm_stderr"
:
0.008272783230806014
,
"acc_stderr"
:
0.008105688874297972
}},
"versions"
:
{
"headqa_es"
:
0
}}
tests/testdata/hellaswag-v0-loglikelihood
View file @
121b7096
abb808c97d6529eda6c11067837a132c62d25cba0394d720f80cca6df9f7196e
\ No newline at end of file
abb808c97d6529eda6c11067837a132c62d25cba0394d720f80cca6df9f7196e
tests/testdata/hellaswag-v0-res.json
View file @
121b7096
{
"results"
:
{
"hellaswag"
:
{
"acc"
:
0.24965146385182235
,
"acc_norm"
:
0.24756024696275641
,
"acc_norm_stderr"
:
0.004307128573285236
,
"acc_stderr"
:
0.004319267432460666
}},
"versions"
:
{
"hellaswag"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hellaswag"
:
{
"acc"
:
0.24965146385182235
,
"acc_norm"
:
0.24756024696275641
,
"acc_norm_stderr"
:
0.004307128573285236
,
"acc_stderr"
:
0.004319267432460666
}},
"versions"
:
{
"hellaswag"
:
0
}}
tests/testdata/hendrycksTest-abstract_algebra-v0-loglikelihood
View file @
121b7096
e35d1eeb356ac1084d4e9773f028cb3c81ba1c6e5574d598ac4a78aa467cd797
\ No newline at end of file
e35d1eeb356ac1084d4e9773f028cb3c81ba1c6e5574d598ac4a78aa467cd797
tests/testdata/hendrycksTest-abstract_algebra-v0-res.json
View file @
121b7096
{
"results"
:
{
"hendrycksTest-abstract_algebra"
:
{
"acc"
:
0.32
,
"acc_norm"
:
0.34
,
"acc_norm_stderr"
:
0.04760952285695235
,
"acc_stderr"
:
0.04688261722621504
}},
"versions"
:
{
"hendrycksTest-abstract_algebra"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"hendrycksTest-abstract_algebra"
:
{
"acc"
:
0.32
,
"acc_norm"
:
0.34
,
"acc_norm_stderr"
:
0.04760952285695235
,
"acc_stderr"
:
0.04688261722621504
}},
"versions"
:
{
"hendrycksTest-abstract_algebra"
:
0
}}
Prev
1
…
6
7
8
9
10
11
12
13
14
…
37
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment