Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
cd130e17
Unverified
Commit
cd130e17
authored
Oct 01, 2023
by
Michael Goin
Committed by
GitHub
Oct 01, 2023
Browse files
Merge branch 'EleutherAI:master' into deepsparselm
parents
cec27dad
f2de8609
Changes
12
Hide whitespace changes
Inline
Side-by-side
Showing
12 changed files
with
18 additions
and
22 deletions
+18
-22
lm_eval/datasets/asdiv/asdiv.py
lm_eval/datasets/asdiv/asdiv.py
+2
-2
lm_eval/datasets/coqa/coqa.py
lm_eval/datasets/coqa/coqa.py
+1
-2
lm_eval/datasets/drop/drop.py
lm_eval/datasets/drop/drop.py
+2
-2
lm_eval/datasets/headqa/headqa.py
lm_eval/datasets/headqa/headqa.py
+1
-0
lm_eval/datasets/hendrycks_ethics/hendrycks_ethics.py
lm_eval/datasets/hendrycks_ethics/hendrycks_ethics.py
+2
-2
lm_eval/datasets/hendrycks_math/hendrycks_math.py
lm_eval/datasets/hendrycks_math/hendrycks_math.py
+2
-2
lm_eval/datasets/logiqa/logiqa.py
lm_eval/datasets/logiqa/logiqa.py
+1
-2
lm_eval/datasets/mutual/mutual.py
lm_eval/datasets/mutual/mutual.py
+1
-2
lm_eval/datasets/pile/pile.py
lm_eval/datasets/pile/pile.py
+2
-2
lm_eval/datasets/quac/quac.py
lm_eval/datasets/quac/quac.py
+2
-2
lm_eval/datasets/sat_analogies/sat_analogies.py
lm_eval/datasets/sat_analogies/sat_analogies.py
+1
-2
lm_eval/datasets/unscramble/unscramble.py
lm_eval/datasets/unscramble/unscramble.py
+1
-2
No files found.
lm_eval/datasets/asdiv/asdiv.py
View file @
cd130e17
...
...
@@ -43,8 +43,8 @@ level (for indicating the level of difficulty).
_HOMEPAGE
=
"https://github.com/chaochun/nlu-asdiv-dataset"
#
TODO: Add the l
icen
c
e
for th
e
d
at
aset here if you can find it
_LICENSE
=
""
#
L
icen
s
e
availabl
e at
https://github.com/chaochun/nlu-asdiv-dataset/blob/master/README.md
_LICENSE
=
"
CC BY-NC 4.0
"
_URLS
=
"https://github.com/chaochun/nlu-asdiv-dataset/archive/55790e5270bb91ccfa5053194b25732534696b50.zip"
...
...
lm_eval/datasets/coqa/coqa.py
View file @
cd130e17
...
...
@@ -44,8 +44,7 @@ appear in a conversation.
_HOMEPAGE
=
"https://stanfordnlp.github.io/coqa/"
# TODO: Add the licence for the dataset here if you can find it
_LICENSE
=
""
_LICENSE
=
"Different licenses depending on the content (see https://stanfordnlp.github.io/coqa/ for details)"
_URLS
=
{
"train"
:
"https://nlp.stanford.edu/data/coqa/coqa-train-v1.0.json"
,
...
...
lm_eval/datasets/drop/drop.py
View file @
cd130e17
...
...
@@ -43,8 +43,8 @@ and perform discrete operations over them (such as addition, counting, or sortin
_HOMEPAGE
=
"https://allenai.org/data/drop"
#
TODO: Add the l
icen
c
e
for th
e
d
at
aset here if you can find it
_LICENSE
=
""
#
L
icen
s
e
availabl
e at
https://allenai.org/data/drop
_LICENSE
=
"
CC BY
"
_URLS
=
{
"drop"
:
"https://s3-us-west-2.amazonaws.com/allennlp/datasets/drop/drop_dataset.zip"
,
...
...
lm_eval/datasets/headqa/headqa.py
View file @
cd130e17
...
...
@@ -51,6 +51,7 @@ The dataset contains questions about the following topics: medicine, nursing, ps
_HOMEPAGE
=
"https://aghie.github.io/head-qa/"
# License available at https://github.com/aghie/head-qa/blob/master/LICENSE
_LICENSE
=
"MIT License"
_URL
=
"https://drive.google.com/uc?export=download&confirm=t&id=1a_95N5zQQoUCq8IBNVZgziHbeM-QxG2t"
...
...
lm_eval/datasets/hendrycks_ethics/hendrycks_ethics.py
View file @
cd130e17
...
...
@@ -41,8 +41,8 @@ learning agents.
_HOMEPAGE
=
"https://github.com/hendrycks/ethics"
#
TODO: Add the l
icen
c
e
for th
e
d
at
aset here if you can find it
_LICENSE
=
""
#
L
icen
s
e
availabl
e at
https://github.com/hendrycks/ethics/blob/master/LICENSE
_LICENSE
=
"
MIT License
"
_URLS
=
"https://people.eecs.berkeley.edu/~hendrycks/ethics.tar"
...
...
lm_eval/datasets/hendrycks_math/hendrycks_math.py
View file @
cd130e17
...
...
@@ -38,8 +38,8 @@ models to generate answer derivations and explanations.
_HOMEPAGE
=
"https://github.com/hendrycks/math"
#
TODO: Add the l
icen
c
e
for the dataset here if you can find it
_LICENSE
=
""
#
L
icen
s
e
available at https://github.com/hendrycks/math/blob/main/LICENSE
_LICENSE
=
"
MIT License
"
_URLS
=
"https://people.eecs.berkeley.edu/~hendrycks/MATH.tar"
...
...
lm_eval/datasets/logiqa/logiqa.py
View file @
cd130e17
...
...
@@ -38,8 +38,7 @@ NLP setting.
_HOMEPAGE
=
"https://github.com/lgw863/LogiQA-dataset"
# TODO: Add the licence for the dataset here if you can find it
_LICENSE
=
""
_LICENSE
=
"No license found"
_URLS
=
{
"train"
:
"https://raw.githubusercontent.com/lgw863/LogiQA-dataset/master/Train.txt"
,
...
...
lm_eval/datasets/mutual/mutual.py
View file @
cd130e17
...
...
@@ -38,8 +38,7 @@ modified from Chinese high school English listening comprehension test data.
_HOMEPAGE
=
"https://github.com/Nealcly/MuTual"
# TODO: Add the licence for the dataset here if you can find it
_LICENSE
=
""
_LICENSE
=
"No license found"
_URLS
=
"https://github.com/Nealcly/MuTual/archive/master.zip"
...
...
lm_eval/datasets/pile/pile.py
View file @
cd130e17
...
...
@@ -38,8 +38,8 @@ math, computer science, and philosophy papers.
_HOMEPAGE
=
"https://pile.eleuther.ai/"
#
TODO: Add the l
icen
c
e
for th
e
d
at
aset here if you can find it
_LICENSE
=
""
#
L
icen
s
e
availabl
e at
https://github.com/EleutherAI/the-pile/blob/master/LICENSE
_LICENSE
=
"
MIT License
"
_URLS
=
{
"validation"
:
"https://the-eye.eu/public/AI/pile/val.jsonl.zst"
,
...
...
lm_eval/datasets/quac/quac.py
View file @
cd130e17
...
...
@@ -39,8 +39,8 @@ a teacher who answers the questions by providing short excerpts (spans) from the
_HOMEPAGE
=
"https://quac.ai/"
#
TODO: Add the l
icen
c
e
for the dataset here if you can find it
_LICENSE
=
""
#
L
icen
s
e
available at https://quac.ai/
_LICENSE
=
"
CC BY-SA 4.0
"
_URLS
=
{
"train"
:
"https://s3.amazonaws.com/my89public/quac/train_v0.2.json"
,
...
...
lm_eval/datasets/sat_analogies/sat_analogies.py
View file @
cd130e17
...
...
@@ -39,8 +39,7 @@ multiple-choice analogy questions; 5 choices per question.
_HOMEPAGE
=
"https://aclweb.org/aclwiki/SAT_Analogy_Questions_(State_of_the_art)"
# TODO: Add the licence for the dataset here if you can find it
_LICENSE
=
""
_LICENSE
=
"No license found"
class
SatAnalogies
(
datasets
.
GeneratorBasedBuilder
):
...
...
lm_eval/datasets/unscramble/unscramble.py
View file @
cd130e17
...
...
@@ -42,8 +42,7 @@ addition, or deletion of characters, and asking it to recover the original word.
_HOMEPAGE
=
"https://github.com/openai/gpt-3/tree/master/data"
# TODO: Add the licence for the dataset here if you can find it
_LICENSE
=
""
_LICENSE
=
"No license found"
_BASE_URL
=
"https://raw.githubusercontent.com/openai/gpt-3/master/data"
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment