Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
05e6505b
Unverified
Commit
05e6505b
authored
Aug 04, 2024
by
zhabuye
Committed by
GitHub
Aug 04, 2024
Browse files
Update README.md (#2125)
parent
836eba52
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
lm_eval/tasks/README.md
lm_eval/tasks/README.md
+1
-1
No files found.
lm_eval/tasks/README.md
View file @
05e6505b
...
...
@@ -26,7 +26,7 @@
|
[
ceval
](
ceval/README.md
)
| Tasks that evaluate language understanding and reasoning in an educational context. | Chinese |
|
[
cmmlu
](
cmmlu/README.md
)
| Multi-subject multiple choice question tasks for comprehensive academic assessment. | Chinese |
| code_x_glue | Tasks that involve understanding and generating code across multiple programming languages. | Go, Java, JS, PHP, Python, Ruby |
|
[
commonsense_qa
](
comm
m
onsense_qa/README.md
)
| CommonsenseQA, a multiple-choice QA dataset for measuring commonsense knowledge. | English |
|
[
commonsense_qa
](
commonsense_qa/README.md
)
| CommonsenseQA, a multiple-choice QA dataset for measuring commonsense knowledge. | English |
|
[
copal_id
](
copal_id/README.md
)
| Indonesian causal commonsense reasoning dataset that captures local nuances. | Indonesian |
|
[
coqa
](
coqa/README.md
)
| Conversational question answering tasks to test dialog understanding. | English |
|
[
crows_pairs
](
crows_pairs/README.md
)
| Tasks designed to test model biases in various sociodemographic groups. | English, French |
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment