Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
c06b0d6e
Commit
c06b0d6e
authored
Sep 04, 2023
by
lintangsutawika
Browse files
add flan_cot_zeroshot
parent
13940f1e
Changes
28
Hide whitespace changes
Inline
Side-by-side
Showing
8 changed files
with
40 additions
and
0 deletions
+40
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/snarks.yaml
lm_eval/tasks/bbh/flan_cot_zeroshot/snarks.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/sports_understanding.yaml
...val/tasks/bbh/flan_cot_zeroshot/sports_understanding.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/temporal_sequences.yaml
lm_eval/tasks/bbh/flan_cot_zeroshot/temporal_sequences.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/tracking_shuffled_objects_five_objects.yaml
..._cot_zeroshot/tracking_shuffled_objects_five_objects.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/tracking_shuffled_objects_seven_objects.yaml
...cot_zeroshot/tracking_shuffled_objects_seven_objects.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/tracking_shuffled_objects_three_objects.yaml
...cot_zeroshot/tracking_shuffled_objects_three_objects.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/web_of_lies.yaml
lm_eval/tasks/bbh/flan_cot_zeroshot/web_of_lies.yaml
+5
-0
lm_eval/tasks/bbh/flan_cot_zeroshot/word_sorting.yaml
lm_eval/tasks/bbh/flan_cot_zeroshot/word_sorting.yaml
+5
-0
No files found.
lm_eval/tasks/bbh/flan_cot_zeroshot/snarks.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
snarks"
"
description"
:
"
Determine
which
of
two
sentences
is
sarcastic.
\n\n
According
to
Cambridge
University
Dictionary,
sarcasm
is
\"
the
use
of
remarks
that
clearly
mean
the
opposite
of
what
they
say,
made
in
order
to
hurt
someone's
feelings
or
to
criticize
something
in
a
humorous
way.
\"
Sarcastic
sentences
often
contain
satirical
or
ironic
utterances,
hyperboles,
ambivalent
or
witty
remarks.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_snarks"
lm_eval/tasks/bbh/flan_cot_zeroshot/sports_understanding.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
sports_understanding"
"
description"
:
"
Determine
whether
an
artificially
constructed
sentence
relating
to
sports
is
plausible
or
not.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_sports_understanding"
lm_eval/tasks/bbh/flan_cot_zeroshot/temporal_sequences.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
temporal_sequences"
"
description"
:
"
Task
description:
Answer
questions
about
which
times
certain
events
could
have
occurred.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_temporal_sequences"
lm_eval/tasks/bbh/flan_cot_zeroshot/tracking_shuffled_objects_five_objects.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
tracking_shuffled_objects_five_objects"
"
description"
:
"
A
task
requiring
determining
the
final
positions
of
a
set
of
objects
given
their
initial
positions
and
a
description
of
a
sequence
of
swaps.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_tracking_shuffled_objects_five_objects"
lm_eval/tasks/bbh/flan_cot_zeroshot/tracking_shuffled_objects_seven_objects.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
tracking_shuffled_objects_seven_objects"
"
description"
:
"
A
task
requiring
determining
the
final
positions
of
a
set
of
objects
given
their
initial
positions
and
a
description
of
a
sequence
of
swaps.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_tracking_shuffled_objects_seven_objects"
lm_eval/tasks/bbh/flan_cot_zeroshot/tracking_shuffled_objects_three_objects.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
tracking_shuffled_objects_three_objects"
"
description"
:
"
A
task
requiring
determining
the
final
positions
of
a
set
of
objects
given
their
initial
positions
and
a
description
of
a
sequence
of
swaps.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_tracking_shuffled_objects_three_objects"
lm_eval/tasks/bbh/flan_cot_zeroshot/web_of_lies.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
web_of_lies"
"
description"
:
"
Evaluate
a
random
boolean
function
expressed
as
a
word
problem.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_web_of_lies"
lm_eval/tasks/bbh/flan_cot_zeroshot/word_sorting.yaml
0 → 100644
View file @
c06b0d6e
"
dataset_name"
:
"
word_sorting"
"
description"
:
"
Sort
a
list
of
words.
\n\n
"
"
doc_to_text"
:
"
Q:
{{input}}
\n
A:
Let's
think
step
by
step.
\n
"
"
include"
:
"
_template_yaml"
"
task"
:
"
bbh_flan_cot_zeroshot_word_sorting"
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment