Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
fbeaa2c1
Commit
fbeaa2c1
authored
Jun 13, 2024
by
Yu Shi Jie
Browse files
Merge branch 'mmlu-pro' of github.com:ysjprojects/lm-evaluation-harness into mmlu-pro
Resolve conflict.
parents
91b2eec6
5c7cba23
Changes
27
Hide whitespace changes
Inline
Side-by-side
Showing
7 changed files
with
6 additions
and
4 deletions
+6
-4
lm_eval/tasks/squad_completion/task.py
lm_eval/tasks/squad_completion/task.py
+0
-2
lm_eval/tasks/squadv2/task.py
lm_eval/tasks/squadv2/task.py
+1
-0
lm_eval/tasks/tinyBenchmarks/utils_winogrande.py
lm_eval/tasks/tinyBenchmarks/utils_winogrande.py
+1
-1
lm_eval/tasks/tmmluplus/default/_generate_configs.py
lm_eval/tasks/tmmluplus/default/_generate_configs.py
+1
-0
scripts/clean_training_data/README.md
scripts/clean_training_data/README.md
+1
-1
scripts/make_table_results.py
scripts/make_table_results.py
+1
-0
scripts/make_table_tasks.py
scripts/make_table_tasks.py
+1
-0
No files found.
lm_eval/tasks/squad_completion/task.py
View file @
fbeaa2c1
"""
"""
import
re
from
typing
import
List
...
...
lm_eval/tasks/squadv2/task.py
View file @
fbeaa2c1
...
...
@@ -13,6 +13,7 @@ also determine when no answer is supported by the paragraph and abstain from ans
Homepage: https://rajpurkar.github.io/SQuAD-explorer/
"""
from
functools
import
partial
from
math
import
exp
...
...
lm_eval/tasks/tinyBenchmarks/utils_winogrande.py
View file @
fbeaa2c1
"""
This code mirrors the utils of the original winogrande task
"""
"""This code mirrors the utils of the original winogrande task"""
def
doc_to_text
(
doc
):
...
...
lm_eval/tasks/tmmluplus/default/_generate_configs.py
View file @
fbeaa2c1
"""
Take in a YAML, and output all "other" splits with this YAML
"""
import
argparse
import
os
...
...
scripts/clean_training_data/README.md
View file @
fbeaa2c1
...
...
@@ -10,7 +10,7 @@ It uses the approach described in the [GPT-3 paper](https://arxiv.org/abs/2005.1
the match, splitting the training data into chunks
3) Any chunks less than
`minimum_slice_length`
are removed
4) Training data sets split into more than
`too_dirty_cutoff`
are considered
completey contaminated and removed
complete
l
y contaminated and removed
OpenAI used:
```
...
...
scripts/make_table_results.py
View file @
fbeaa2c1
...
...
@@ -2,6 +2,7 @@
Usage:
python make_table_tasks.py --output <markdown_filename>
"""
import
json
import
logging
import
os
...
...
scripts/make_table_tasks.py
View file @
fbeaa2c1
...
...
@@ -2,6 +2,7 @@
Usage:
python make_table_tasks.py --output <markdown_filename>
"""
import
argparse
import
logging
...
...
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment