Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
0d195e90
Commit
0d195e90
authored
Sep 04, 2023
by
lintangsutawika
Browse files
add flan_zeroshot
parent
3531d9c1
Changes
28
Hide whitespace changes
Inline
Side-by-side
Showing
8 changed files
with
40 additions
and
0 deletions
+40
-0
lm_eval/tasks/bbh/flan_zeroshot/snarks.yaml
lm_eval/tasks/bbh/flan_zeroshot/snarks.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/sports_understanding.yaml
lm_eval/tasks/bbh/flan_zeroshot/sports_understanding.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/temporal_sequences.yaml
lm_eval/tasks/bbh/flan_zeroshot/temporal_sequences.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/tracking_shuffled_objects_five_objects.yaml
...flan_zeroshot/tracking_shuffled_objects_five_objects.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/tracking_shuffled_objects_seven_objects.yaml
...lan_zeroshot/tracking_shuffled_objects_seven_objects.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/tracking_shuffled_objects_three_objects.yaml
...lan_zeroshot/tracking_shuffled_objects_three_objects.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/web_of_lies.yaml
lm_eval/tasks/bbh/flan_zeroshot/web_of_lies.yaml
+5
-0
lm_eval/tasks/bbh/flan_zeroshot/word_sorting.yaml
lm_eval/tasks/bbh/flan_zeroshot/word_sorting.yaml
+5
-0
No files found.
lm_eval/tasks/bbh/flan_zeroshot/snarks.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
snarks"
"
description"
:
"
Determine
which
of
two
sentences
is
sarcastic.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_snarks"
lm_eval/tasks/bbh/flan_zeroshot/sports_understanding.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
sports_understanding"
"
description"
:
"
Determine
whether
an
artificially
constructed
sentence
relating
to
sports
is
plausible
or
not.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_sports_understanding"
lm_eval/tasks/bbh/flan_zeroshot/temporal_sequences.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
temporal_sequences"
"
description"
:
"
Task
description:
Answer
questions
about
which
times
certain
events
could
have
occurred.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_temporal_sequences"
lm_eval/tasks/bbh/flan_zeroshot/tracking_shuffled_objects_five_objects.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
tracking_shuffled_objects_five_objects"
"
description"
:
"
A
task
requiring
determining
the
final
positions
of
a
set
of
objects
given
their
initial
positions
and
a
description
of
a
sequence
of
swaps.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_tracking_shuffled_objects_five_objects"
lm_eval/tasks/bbh/flan_zeroshot/tracking_shuffled_objects_seven_objects.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
tracking_shuffled_objects_seven_objects"
"
description"
:
"
A
task
requiring
determining
the
final
positions
of
a
set
of
objects
given
their
initial
positions
and
a
description
of
a
sequence
of
swaps.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_tracking_shuffled_objects_seven_objects"
lm_eval/tasks/bbh/flan_zeroshot/tracking_shuffled_objects_three_objects.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
tracking_shuffled_objects_three_objects"
"
description"
:
"
A
task
requiring
determining
the
final
positions
of
a
set
of
objects
given
their
initial
positions
and
a
description
of
a
sequence
of
swaps.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_tracking_shuffled_objects_three_objects"
lm_eval/tasks/bbh/flan_zeroshot/web_of_lies.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
web_of_lies"
"
description"
:
"
Evaluate
a
random
boolean
function
expressed
as
a
word
problem.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_web_of_lies"
lm_eval/tasks/bbh/flan_zeroshot/word_sorting.yaml
0 → 100644
View file @
0d195e90
"
dataset_name"
:
"
word_sorting"
"
description"
:
"
Sort
a
list
of
words.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_zeroshot_word_sorting"
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment