Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
8c997e53
Commit
8c997e53
authored
May 03, 2022
by
jon-tow
Browse files
Revert `tests/testdata` changes and address flake8 issues
parent
d95a4333
Changes
627
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
20 additions
and
20 deletions
+20
-20
tests/testdata/logiqa-v0-res.json
tests/testdata/logiqa-v0-res.json
+1
-1
tests/testdata/math_algebra-v0-greedy_until
tests/testdata/math_algebra-v0-greedy_until
+1
-1
tests/testdata/math_algebra-v0-res.json
tests/testdata/math_algebra-v0-res.json
+1
-1
tests/testdata/math_algebra-v1-greedy_until
tests/testdata/math_algebra-v1-greedy_until
+1
-1
tests/testdata/math_algebra-v1-res.json
tests/testdata/math_algebra-v1-res.json
+1
-1
tests/testdata/math_counting_and_prob-v0-greedy_until
tests/testdata/math_counting_and_prob-v0-greedy_until
+1
-1
tests/testdata/math_counting_and_prob-v0-res.json
tests/testdata/math_counting_and_prob-v0-res.json
+1
-1
tests/testdata/math_counting_and_prob-v1-greedy_until
tests/testdata/math_counting_and_prob-v1-greedy_until
+1
-1
tests/testdata/math_counting_and_prob-v1-res.json
tests/testdata/math_counting_and_prob-v1-res.json
+1
-1
tests/testdata/math_geometry-v0-greedy_until
tests/testdata/math_geometry-v0-greedy_until
+1
-1
tests/testdata/math_geometry-v0-res.json
tests/testdata/math_geometry-v0-res.json
+1
-1
tests/testdata/math_geometry-v1-greedy_until
tests/testdata/math_geometry-v1-greedy_until
+1
-1
tests/testdata/math_geometry-v1-res.json
tests/testdata/math_geometry-v1-res.json
+1
-1
tests/testdata/math_intermediate_algebra-v0-greedy_until
tests/testdata/math_intermediate_algebra-v0-greedy_until
+1
-1
tests/testdata/math_intermediate_algebra-v0-res.json
tests/testdata/math_intermediate_algebra-v0-res.json
+1
-1
tests/testdata/math_intermediate_algebra-v1-greedy_until
tests/testdata/math_intermediate_algebra-v1-greedy_until
+1
-1
tests/testdata/math_intermediate_algebra-v1-res.json
tests/testdata/math_intermediate_algebra-v1-res.json
+1
-1
tests/testdata/math_num_theory-v0-greedy_until
tests/testdata/math_num_theory-v0-greedy_until
+1
-1
tests/testdata/math_num_theory-v0-res.json
tests/testdata/math_num_theory-v0-res.json
+1
-1
tests/testdata/math_num_theory-v1-greedy_until
tests/testdata/math_num_theory-v1-greedy_until
+1
-1
No files found.
tests/testdata/logiqa-v0-res.json
View file @
8c997e53
{
"results"
:
{
"logiqa"
:
{
"acc"
:
0.25806451612903225
,
"acc_norm"
:
0.2764976958525346
,
"acc_norm_stderr"
:
0.017543209075825194
,
"acc_stderr"
:
0.017162894755127077
}},
"versions"
:
{
"logiqa"
:
0
}}
{
"results"
:
{
"logiqa"
:
{
"acc"
:
0.25806451612903225
,
"acc_norm"
:
0.2764976958525346
,
"acc_norm_stderr"
:
0.017543209075825194
,
"acc_stderr"
:
0.017162894755127077
}},
"versions"
:
{
"logiqa"
:
0
}}
\ No newline at end of file
tests/testdata/math_algebra-v0-greedy_until
View file @
8c997e53
f19182ce697a2c095d9e5b56ee6659dc38c93994b69ca75d7c3d3f5fd87572b4
f19182ce697a2c095d9e5b56ee6659dc38c93994b69ca75d7c3d3f5fd87572b4
\ No newline at end of file
tests/testdata/math_algebra-v0-res.json
View file @
8c997e53
{
"results"
:
{
"math_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_algebra"
:
0
}}
{
"results"
:
{
"math_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_algebra"
:
0
}}
\ No newline at end of file
tests/testdata/math_algebra-v1-greedy_until
View file @
8c997e53
f19182ce697a2c095d9e5b56ee6659dc38c93994b69ca75d7c3d3f5fd87572b4
f19182ce697a2c095d9e5b56ee6659dc38c93994b69ca75d7c3d3f5fd87572b4
\ No newline at end of file
tests/testdata/math_algebra-v1-res.json
View file @
8c997e53
{
"results"
:
{
"math_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_algebra"
:
1
}}
{
"results"
:
{
"math_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_algebra"
:
1
}}
\ No newline at end of file
tests/testdata/math_counting_and_prob-v0-greedy_until
View file @
8c997e53
2aa9ae43ee9dbb2457525247d7b65358632c5eaa9cbfc40cf95a4f17f5d942ad
2aa9ae43ee9dbb2457525247d7b65358632c5eaa9cbfc40cf95a4f17f5d942ad
\ No newline at end of file
tests/testdata/math_counting_and_prob-v0-res.json
View file @
8c997e53
{
"results"
:
{
"math_counting_and_prob"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_counting_and_prob"
:
0
}}
{
"results"
:
{
"math_counting_and_prob"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_counting_and_prob"
:
0
}}
\ No newline at end of file
tests/testdata/math_counting_and_prob-v1-greedy_until
View file @
8c997e53
2aa9ae43ee9dbb2457525247d7b65358632c5eaa9cbfc40cf95a4f17f5d942ad
2aa9ae43ee9dbb2457525247d7b65358632c5eaa9cbfc40cf95a4f17f5d942ad
\ No newline at end of file
tests/testdata/math_counting_and_prob-v1-res.json
View file @
8c997e53
{
"results"
:
{
"math_counting_and_prob"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_counting_and_prob"
:
1
}}
{
"results"
:
{
"math_counting_and_prob"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_counting_and_prob"
:
1
}}
\ No newline at end of file
tests/testdata/math_geometry-v0-greedy_until
View file @
8c997e53
46bc4cb219b6903397da782699a684bdbb982c0c954ff82e6beeed5c84878f42
46bc4cb219b6903397da782699a684bdbb982c0c954ff82e6beeed5c84878f42
\ No newline at end of file
tests/testdata/math_geometry-v0-res.json
View file @
8c997e53
{
"results"
:
{
"math_geometry"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_geometry"
:
0
}}
{
"results"
:
{
"math_geometry"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_geometry"
:
0
}}
\ No newline at end of file
tests/testdata/math_geometry-v1-greedy_until
View file @
8c997e53
46bc4cb219b6903397da782699a684bdbb982c0c954ff82e6beeed5c84878f42
46bc4cb219b6903397da782699a684bdbb982c0c954ff82e6beeed5c84878f42
\ No newline at end of file
tests/testdata/math_geometry-v1-res.json
View file @
8c997e53
{
"results"
:
{
"math_geometry"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_geometry"
:
1
}}
{
"results"
:
{
"math_geometry"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_geometry"
:
1
}}
\ No newline at end of file
tests/testdata/math_intermediate_algebra-v0-greedy_until
View file @
8c997e53
d53c699de272d517ed7ad783b4e692302be9f9f97a8d4ac7a6541e538a7cabe0
d53c699de272d517ed7ad783b4e692302be9f9f97a8d4ac7a6541e538a7cabe0
\ No newline at end of file
tests/testdata/math_intermediate_algebra-v0-res.json
View file @
8c997e53
{
"results"
:
{
"math_intermediate_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_intermediate_algebra"
:
0
}}
{
"results"
:
{
"math_intermediate_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_intermediate_algebra"
:
0
}}
\ No newline at end of file
tests/testdata/math_intermediate_algebra-v1-greedy_until
View file @
8c997e53
d53c699de272d517ed7ad783b4e692302be9f9f97a8d4ac7a6541e538a7cabe0
d53c699de272d517ed7ad783b4e692302be9f9f97a8d4ac7a6541e538a7cabe0
\ No newline at end of file
tests/testdata/math_intermediate_algebra-v1-res.json
View file @
8c997e53
{
"results"
:
{
"math_intermediate_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_intermediate_algebra"
:
1
}}
{
"results"
:
{
"math_intermediate_algebra"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_intermediate_algebra"
:
1
}}
\ No newline at end of file
tests/testdata/math_num_theory-v0-greedy_until
View file @
8c997e53
b920ccb507afdcf3ef6f4c04891913731e9f32ec914801791c6d9f8abf6e1897
b920ccb507afdcf3ef6f4c04891913731e9f32ec914801791c6d9f8abf6e1897
\ No newline at end of file
tests/testdata/math_num_theory-v0-res.json
View file @
8c997e53
{
"results"
:
{
"math_num_theory"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_num_theory"
:
0
}}
{
"results"
:
{
"math_num_theory"
:
{
"acc"
:
0.0
,
"acc_stderr"
:
0.0
}},
"versions"
:
{
"math_num_theory"
:
0
}}
\ No newline at end of file
tests/testdata/math_num_theory-v1-greedy_until
View file @
8c997e53
b920ccb507afdcf3ef6f4c04891913731e9f32ec914801791c6d9f8abf6e1897
b920ccb507afdcf3ef6f4c04891913731e9f32ec914801791c6d9f8abf6e1897
\ No newline at end of file
Prev
1
…
15
16
17
18
19
20
21
22
23
…
32
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment