Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
121b7096
Commit
121b7096
authored
May 02, 2022
by
Fabrizio Milo
Browse files
add pre-commit
parent
7a038118
Changes
732
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/anagrams1-v0-greedy_until
tests/testdata/anagrams1-v0-greedy_until
+1
-1
tests/testdata/anagrams1-v0-res.json
tests/testdata/anagrams1-v0-res.json
+1
-1
tests/testdata/anagrams2-v0-greedy_until
tests/testdata/anagrams2-v0-greedy_until
+1
-1
tests/testdata/anagrams2-v0-res.json
tests/testdata/anagrams2-v0-res.json
+1
-1
tests/testdata/anli_r1-v0-loglikelihood
tests/testdata/anli_r1-v0-loglikelihood
+1
-1
tests/testdata/anli_r1-v0-res.json
tests/testdata/anli_r1-v0-res.json
+1
-1
tests/testdata/anli_r2-v0-loglikelihood
tests/testdata/anli_r2-v0-loglikelihood
+1
-1
tests/testdata/anli_r2-v0-res.json
tests/testdata/anli_r2-v0-res.json
+1
-1
tests/testdata/anli_r3-v0-loglikelihood
tests/testdata/anli_r3-v0-loglikelihood
+1
-1
tests/testdata/anli_r3-v0-res.json
tests/testdata/anli_r3-v0-res.json
+1
-1
tests/testdata/arc_challenge-v0-loglikelihood
tests/testdata/arc_challenge-v0-loglikelihood
+1
-1
tests/testdata/arc_challenge-v0-res.json
tests/testdata/arc_challenge-v0-res.json
+1
-1
tests/testdata/arc_easy-v0-loglikelihood
tests/testdata/arc_easy-v0-loglikelihood
+1
-1
tests/testdata/arc_easy-v0-res.json
tests/testdata/arc_easy-v0-res.json
+1
-1
tests/testdata/arithmetic_1dc-v0-loglikelihood
tests/testdata/arithmetic_1dc-v0-loglikelihood
+1
-1
tests/testdata/arithmetic_1dc-v0-res.json
tests/testdata/arithmetic_1dc-v0-res.json
+1
-1
tests/testdata/arithmetic_2da-v0-loglikelihood
tests/testdata/arithmetic_2da-v0-loglikelihood
+1
-1
tests/testdata/arithmetic_2da-v0-res.json
tests/testdata/arithmetic_2da-v0-res.json
+1
-1
tests/testdata/arithmetic_2dm-v0-loglikelihood
tests/testdata/arithmetic_2dm-v0-loglikelihood
+1
-1
tests/testdata/arithmetic_2dm-v0-res.json
tests/testdata/arithmetic_2dm-v0-res.json
+1
-1
No files found.
tests/testdata/anagrams1-v0-greedy_until
View file @
121b7096
7c0c5246d3f751f39119a5629ac1d4b2c6fd2a315f78d6de9b2c387e24e3fef1
\ No newline at end of file
7c0c5246d3f751f39119a5629ac1d4b2c6fd2a315f78d6de9b2c387e24e3fef1
tests/testdata/anagrams1-v0-res.json
View file @
121b7096
{
"results"
:
{
"anagrams1"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"anagrams1"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"anagrams1"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"anagrams1"
:
0
}}
tests/testdata/anagrams2-v0-greedy_until
View file @
121b7096
6700a3c44e48abe8337238dcbe3b54cf4abafe0c204c52d921e590872fbd05e7
\ No newline at end of file
6700a3c44e48abe8337238dcbe3b54cf4abafe0c204c52d921e590872fbd05e7
tests/testdata/anagrams2-v0-res.json
View file @
121b7096
{
"results"
:
{
"anagrams2"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"anagrams2"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"anagrams2"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"anagrams2"
:
0
}}
tests/testdata/anli_r1-v0-loglikelihood
View file @
121b7096
3a84baf2f170e138c6ce0bc9f06f905def35d705fa2b8781f10c87aef404c4cb
\ No newline at end of file
3a84baf2f170e138c6ce0bc9f06f905def35d705fa2b8781f10c87aef404c4cb
tests/testdata/anli_r1-v0-res.json
View file @
121b7096
{
"results"
:
{
"anli_r1"
:
{
"acc"
:
0.334
,
"acc_stderr"
:
0.014922019523732967
}},
"versions"
:
{
"anli_r1"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"anli_r1"
:
{
"acc"
:
0.334
,
"acc_stderr"
:
0.014922019523732967
}},
"versions"
:
{
"anli_r1"
:
0
}}
tests/testdata/anli_r2-v0-loglikelihood
View file @
121b7096
d0ea3c3e09d533982c15b4c034439896d6af4bbafb2254d305e20215534a251d
\ No newline at end of file
d0ea3c3e09d533982c15b4c034439896d6af4bbafb2254d305e20215534a251d
tests/testdata/anli_r2-v0-res.json
View file @
121b7096
{
"results"
:
{
"anli_r2"
:
{
"acc"
:
0.356
,
"acc_stderr"
:
0.015149042659306628
}},
"versions"
:
{
"anli_r2"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"anli_r2"
:
{
"acc"
:
0.356
,
"acc_stderr"
:
0.015149042659306628
}},
"versions"
:
{
"anli_r2"
:
0
}}
tests/testdata/anli_r3-v0-loglikelihood
View file @
121b7096
6b6e5c6a794f2fbff78b7aa24fe0c90156039334bbd1cb34f7af9fc6e6183845
\ No newline at end of file
6b6e5c6a794f2fbff78b7aa24fe0c90156039334bbd1cb34f7af9fc6e6183845
tests/testdata/anli_r3-v0-res.json
View file @
121b7096
{
"results"
:
{
"anli_r3"
:
{
"acc"
:
0.31916666666666665
,
"acc_stderr"
:
0.01346230971200514
}},
"versions"
:
{
"anli_r3"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"anli_r3"
:
{
"acc"
:
0.31916666666666665
,
"acc_stderr"
:
0.01346230971200514
}},
"versions"
:
{
"anli_r3"
:
0
}}
tests/testdata/arc_challenge-v0-loglikelihood
View file @
121b7096
41c34c96cca8ace661911d0033d630c554b283f5a3953bcdc50720ae6b00a9c1
\ No newline at end of file
41c34c96cca8ace661911d0033d630c554b283f5a3953bcdc50720ae6b00a9c1
tests/testdata/arc_challenge-v0-res.json
View file @
121b7096
{
"results"
:
{
"arc_challenge"
:
{
"acc"
:
0.24488054607508533
,
"acc_norm"
:
0.2440273037542662
,
"acc_norm_stderr"
:
0.012551447627856257
,
"acc_stderr"
:
0.012566273985131354
}},
"versions"
:
{
"arc_challenge"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"arc_challenge"
:
{
"acc"
:
0.24488054607508533
,
"acc_norm"
:
0.2440273037542662
,
"acc_norm_stderr"
:
0.012551447627856257
,
"acc_stderr"
:
0.012566273985131354
}},
"versions"
:
{
"arc_challenge"
:
0
}}
tests/testdata/arc_easy-v0-loglikelihood
View file @
121b7096
ffa6e39a35a16299dcb015f17f986aaa598ad8b4840c4cebe0339a7042232741
\ No newline at end of file
ffa6e39a35a16299dcb015f17f986aaa598ad8b4840c4cebe0339a7042232741
tests/testdata/arc_easy-v0-res.json
View file @
121b7096
{
"results"
:
{
"arc_easy"
:
{
"acc"
:
0.2474747474747475
,
"acc_norm"
:
0.24074074074074073
,
"acc_norm_stderr"
:
0.008772796145221907
,
"acc_stderr"
:
0.008855114414834707
}},
"versions"
:
{
"arc_easy"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"arc_easy"
:
{
"acc"
:
0.2474747474747475
,
"acc_norm"
:
0.24074074074074073
,
"acc_norm_stderr"
:
0.008772796145221907
,
"acc_stderr"
:
0.008855114414834707
}},
"versions"
:
{
"arc_easy"
:
0
}}
tests/testdata/arithmetic_1dc-v0-loglikelihood
View file @
121b7096
04c3a63a6b3c579bd3775d92b3076ba9130041d5ce7cf9244d3f86e95c804387
\ No newline at end of file
04c3a63a6b3c579bd3775d92b3076ba9130041d5ce7cf9244d3f86e95c804387
tests/testdata/arithmetic_1dc-v0-res.json
View file @
121b7096
{
"results"
:
{
"arithmetic_1dc"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"arithmetic_1dc"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"arithmetic_1dc"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"arithmetic_1dc"
:
0
}}
tests/testdata/arithmetic_2da-v0-loglikelihood
View file @
121b7096
6ca1ca6ebd7cac4420d5005f7f35b0edbc921377f5e4f8874cc176e4fb6d79d4
\ No newline at end of file
6ca1ca6ebd7cac4420d5005f7f35b0edbc921377f5e4f8874cc176e4fb6d79d4
tests/testdata/arithmetic_2da-v0-res.json
View file @
121b7096
{
"results"
:
{
"arithmetic_2da"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"arithmetic_2da"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"arithmetic_2da"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"arithmetic_2da"
:
0
}}
tests/testdata/arithmetic_2dm-v0-loglikelihood
View file @
121b7096
14ac5e510cdf82967d6827a9ca059906ee1db2e347be1b17f36403a157e73552
\ No newline at end of file
14ac5e510cdf82967d6827a9ca059906ee1db2e347be1b17f36403a157e73552
tests/testdata/arithmetic_2dm-v0-res.json
View file @
121b7096
{
"results"
:
{
"arithmetic_2dm"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"arithmetic_2dm"
:
0
}}
\ No newline at end of file
{
"results"
:
{
"arithmetic_2dm"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"arithmetic_2dm"
:
0
}}
Prev
1
…
3
4
5
6
7
8
9
10
11
…
37
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment