Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
b21d2b6e
Unverified
Commit
b21d2b6e
authored
Sep 05, 2020
by
Anish Thite
Committed by
GitHub
Sep 05, 2020
Browse files
Merge pull request #1 from anishthite/master
Add coqa extraction
parents
1c7fb748
0feef0d6
Changes
3
Expand all
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
163 additions
and
0 deletions
+163
-0
README.md
README.md
+6
-0
datasets/coqa/evaluateCoQA.py
datasets/coqa/evaluateCoQA.py
+29
-0
datasets/coqa/testcoquaextraction.ipynb
datasets/coqa/testcoquaextraction.ipynb
+128
-0
No files found.
README.md
0 → 100644
View file @
b21d2b6e
# Evaluation Harness for Large Language Models
### Currently based on evaluations of GPT3 as mentioned in https://arxiv.org/pdf/2005.14165.pdf
## Summary (need to convert from google docs at some point):
https://docs.google.com/document/d/177dwJpH8GHebISXYZSn4NL98sXdCtQMH82b7O5F7jmw/edit?usp=sharing
datasets/coqa/evaluateCoQA.py
0 → 100644
View file @
b21d2b6e
from
...base
import
Dataset
import
os
import
json
class
CoQA
(
Dataset
):
def
training_docs
(
self
):
pass
def
validation_docs
(
self
):
pass
def
test_docs
(
self
):
pass
def
fewshot_examples
(
self
,
k
):
traindocs
=
list
(
self
.
training_docs
())
random
.
seed
(
123
)
random
.
shuffle
(
traindocs
)
return
traindocs
[:
k
]
def
fewshot_description
(
self
):
pass
def
doc_to_text
(
self
,
doc
,
include_target
=
True
):
json
.
load
(
open
(
doc
))
datasets/coqa/testcoquaextraction.ipynb
0 → 100644
View file @
b21d2b6e
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment