Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
601be343
Commit
601be343
authored
Jun 23, 2025
by
Baber
Browse files
Merge branch 'main' into feature/eval_from_config
parents
d0884a96
68c3a811
Changes
1000
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
153 additions
and
156 deletions
+153
-156
lm_eval/tasks/acpbench/gen_2shot_with_pddl/just.yaml
lm_eval/tasks/acpbench/gen_2shot_with_pddl/just.yaml
+24
-0
lm_eval/tasks/acpbench/gen_2shot_with_pddl/land.yaml
lm_eval/tasks/acpbench/gen_2shot_with_pddl/land.yaml
+23
-0
lm_eval/tasks/acpbench/gen_2shot_with_pddl/next_act.yaml
lm_eval/tasks/acpbench/gen_2shot_with_pddl/next_act.yaml
+23
-0
lm_eval/tasks/acpbench/gen_2shot_with_pddl/prog.yaml
lm_eval/tasks/acpbench/gen_2shot_with_pddl/prog.yaml
+24
-0
lm_eval/tasks/acpbench/gen_2shot_with_pddl/reach.yaml
lm_eval/tasks/acpbench/gen_2shot_with_pddl/reach.yaml
+23
-0
lm_eval/tasks/acpbench/gen_2shot_with_pddl/val.yaml
lm_eval/tasks/acpbench/gen_2shot_with_pddl/val.yaml
+23
-0
lm_eval/tasks/afrimgsm/direct/afrimgsm.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm.yaml
+13
-0
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_amh.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_amh.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_eng.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_eng.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_ewe.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_ewe.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_fra.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_fra.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_hau.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_hau.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_ibo.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_ibo.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_kin.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_kin.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_lin.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_lin.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_lug.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_lug.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_orm.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_orm.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_sna.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_sna.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_sot.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_sot.yaml
+0
-12
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_swa.yaml
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_swa.yaml
+0
-12
No files found.
Too many changes to show.
To preserve performance only
1000 of 1000+
files are displayed.
Plain diff
Email patch
lm_eval/tasks/acpbench/gen_2shot_with_pddl/just.yaml
0 → 100644
View file @
601be343
task
:
acp_just_gen_with_pddl
dataset_name
:
acp_just_gen
include
:
_gen_yaml_2shot
fewshot_config
:
sampler
:
first_n
samples
:
-
context
:
"
A
robot
is
in
a
grid
and
can
only
move
to
places
that
are
connected
to
its
current
position.
\n
The
grid
size
is
5x5,
and
the
locations
are
of
the
form
fi-jf
(e.g.,
f3-2f
or
f0-1f).
The
grid
cells
are
connected
to
their
neighbors
(e.g.,
f1-2f
is
connected
to
the
four
neighbors
f0-2f,
f2-2f,
f1-1f,
and
f1-3f).
Some
positions
on
the
grid
are
locked
and
can
be
opened
with
a
key
of
a
matching
shape.
The
robot
has
an
arm
that
can
pick
up
a
key
when
the
key
is
in
same
location
as
the
robot
and
the
arm
is
empty.
\n
There
are
2
keys
in
0
different
shapes:
Key
key0-0
is
of
shape
shape0,
Key
key0-1
is
of
shape
shape0.
\n
Currently,
the
robot
is
at
position
f3-3f
and
its
arm
is
empty.
All
the
positions
are
open
except
the
following:
f4-2f
has
shape0
shaped
lock,
f2-0f
has
shape0
shaped
lock.
Key
key0-1
is
at
position
f1-3f.
Key
key0-0
is
at
position
f2-2f.
The
available
actions
are:
(unlock
?curpos
?lockpos
?key
?shape)
-
unlock
place
?lockpos
with
key
?key
of
shape
?shape
from
current
position
place
?curpos,
(move
?curpos
?nextpos)
-
transition
from
the
current
position
?curpos
to
the
next
position
?nextpos,
(pickup
?curpos
?key)
-
acquire
the
key
?key
from
the
place
?curpos,
(pickup-and-loose
?curpos
?newkey
?oldkey)
-
pick
up
the
key
?newkey
from
the
current
position
?curpos
and
loose
the
key
?oldkey
which
is
being
held,
and
(putdown
?curpos
?key)
-
place
the
key
?key
at
the
current
position
place
?curpos.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
Key
key0-1
is
at
f1-3f
location
and
Key
key0-0
is
at
f2-0f
location."
question
:
"
Simplify
the
plan
\"
(move
f3-3f
f3-2f)
(move
f3-2f
f2-2f)
(pickup
f2-2f
key0-0)
(move
f2-2f
f2-1f)
(putdown
f2-1f
key0-0)
(pickup
f2-1f
key0-0)
(unlock
f2-1f
f2-0f
key0-0
shape0)
(move
f2-1f
f2-0f)
(putdown
f2-0f
key0-0)
\"
by
removing
either
a
single
action
or
a
pair
of
consecutive
actions,
while
still
maintaining
a
valid
plan.
Provide
the
resulting
simplified
plan."
answer
:
"
[(move
f3-3f
f3-2f),
(move
f3-2f
f2-2f),
(pickup
f2-2f
key0-0),
(move
f2-2f
f2-1f),
(unlock
f2-1f
f2-0f
key0-0
shape0),
(move
f2-1f
f2-0f),
(putdown
f2-0f
key0-0)]"
PDDL_domain
:
"
(define
(domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:types
key
place
shape
-
object)
\n
(:predicates
(arm-empty)
(at
?r
-
key
?x
-
place)
(at-robot
?x
-
place)
(conn
?x
-
place
?y
-
place)
(holding
?k
-
key)
(key-shape
?k
-
key
?s
-
shape)
(lock-shape
?x
-
place
?s
-
shape)
(locked
?x
-
place)
(open
?x
-
place))
\n
(:action
move
\n
:parameters
(?curpos
-
place
?nextpos
-
place)
\n
:precondition
(and
(at-robot
?curpos)
(conn
?curpos
?nextpos)
(open
?nextpos))
\n
:effect
(and
(at-robot
?nextpos)
(not
(at-robot
?curpos)))
\n
)
\n
(:action
pickup
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(at
?key
?curpos)
(arm-empty))
\n
:effect
(and
(holding
?key)
(not
(at
?key
?curpos))
(not
(arm-empty)))
\n
)
\n
(:action
pickup-and-loose
\n
:parameters
(?curpos
-
place
?newkey
-
key
?oldkey
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?oldkey)
(at
?newkey
?curpos))
\n
:effect
(and
(holding
?newkey)
(at
?oldkey
?curpos)
(not
(holding
?oldkey))
(not
(at
?newkey
?curpos)))
\n
)
\n
(:action
putdown
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?key))
\n
:effect
(and
(arm-empty)
(at
?key
?curpos)
(not
(holding
?key)))
\n
)
\n
(:action
unlock
\n
:parameters
(?curpos
-
place
?lockpos
-
place
?key
-
key
?shape
-
shape)
\n
:precondition
(and
(conn
?curpos
?lockpos)
(key-shape
?key
?shape)
(lock-shape
?lockpos
?shape)
(at-robot
?curpos)
(locked
?lockpos)
(holding
?key))
\n
:effect
(and
(open
?lockpos)
(not
(locked
?lockpos)))
\n
)
\n
)"
PDDL_problem
:
"
(define
(problem
grid-x5-y5-t1-k2-l2-p100)
\n
(:domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:objects
key0-0
key0-1
-
key
f0-0f
f0-1f
f0-2f
f0-3f
f0-4f
f1-0f
f1-1f
f1-2f
f1-3f
f1-4f
f2-0f
f2-1f
f2-2f
f2-3f
f2-4f
f3-0f
f3-1f
f3-2f
f3-3f
f3-4f
f4-0f
f4-1f
f4-2f
f4-3f
f4-4f
-
place
shape0
-
shape)
\n
(:init
(arm-empty)
(at
key0-0
f2-2f)
(at
key0-1
f1-3f)
(at-robot
f3-3f)
(conn
f0-0f
f0-1f)
(conn
f0-0f
f1-0f)
(conn
f0-1f
f0-0f)
(conn
f0-1f
f0-2f)
(conn
f0-1f
f1-1f)
(conn
f0-2f
f0-1f)
(conn
f0-2f
f0-3f)
(conn
f0-2f
f1-2f)
(conn
f0-3f
f0-2f)
(conn
f0-3f
f0-4f)
(conn
f0-3f
f1-3f)
(conn
f0-4f
f0-3f)
(conn
f0-4f
f1-4f)
(conn
f1-0f
f0-0f)
(conn
f1-0f
f1-1f)
(conn
f1-0f
f2-0f)
(conn
f1-1f
f0-1f)
(conn
f1-1f
f1-0f)
(conn
f1-1f
f1-2f)
(conn
f1-1f
f2-1f)
(conn
f1-2f
f0-2f)
(conn
f1-2f
f1-1f)
(conn
f1-2f
f1-3f)
(conn
f1-2f
f2-2f)
(conn
f1-3f
f0-3f)
(conn
f1-3f
f1-2f)
(conn
f1-3f
f1-4f)
(conn
f1-3f
f2-3f)
(conn
f1-4f
f0-4f)
(conn
f1-4f
f1-3f)
(conn
f1-4f
f2-4f)
(conn
f2-0f
f1-0f)
(conn
f2-0f
f2-1f)
(conn
f2-0f
f3-0f)
(conn
f2-1f
f1-1f)
(conn
f2-1f
f2-0f)
(conn
f2-1f
f2-2f)
(conn
f2-1f
f3-1f)
(conn
f2-2f
f1-2f)
(conn
f2-2f
f2-1f)
(conn
f2-2f
f2-3f)
(conn
f2-2f
f3-2f)
(conn
f2-3f
f1-3f)
(conn
f2-3f
f2-2f)
(conn
f2-3f
f2-4f)
(conn
f2-3f
f3-3f)
(conn
f2-4f
f1-4f)
(conn
f2-4f
f2-3f)
(conn
f2-4f
f3-4f)
(conn
f3-0f
f2-0f)
(conn
f3-0f
f3-1f)
(conn
f3-0f
f4-0f)
(conn
f3-1f
f2-1f)
(conn
f3-1f
f3-0f)
(conn
f3-1f
f3-2f)
(conn
f3-1f
f4-1f)
(conn
f3-2f
f2-2f)
(conn
f3-2f
f3-1f)
(conn
f3-2f
f3-3f)
(conn
f3-2f
f4-2f)
(conn
f3-3f
f2-3f)
(conn
f3-3f
f3-2f)
(conn
f3-3f
f3-4f)
(conn
f3-3f
f4-3f)
(conn
f3-4f
f2-4f)
(conn
f3-4f
f3-3f)
(conn
f3-4f
f4-4f)
(conn
f4-0f
f3-0f)
(conn
f4-0f
f4-1f)
(conn
f4-1f
f3-1f)
(conn
f4-1f
f4-0f)
(conn
f4-1f
f4-2f)
(conn
f4-2f
f3-2f)
(conn
f4-2f
f4-1f)
(conn
f4-2f
f4-3f)
(conn
f4-3f
f3-3f)
(conn
f4-3f
f4-2f)
(conn
f4-3f
f4-4f)
(conn
f4-4f
f3-4f)
(conn
f4-4f
f4-3f)
(key-shape
key0-0
shape0)
(key-shape
key0-1
shape0)
(lock-shape
f2-0f
shape0)
(lock-shape
f4-2f
shape0)
(locked
f2-0f)
(locked
f4-2f)
(open
f0-0f)
(open
f0-1f)
(open
f0-2f)
(open
f0-3f)
(open
f0-4f)
(open
f1-0f)
(open
f1-1f)
(open
f1-2f)
(open
f1-3f)
(open
f1-4f)
(open
f2-1f)
(open
f2-2f)
(open
f2-3f)
(open
f2-4f)
(open
f3-0f)
(open
f3-1f)
(open
f3-2f)
(open
f3-3f)
(open
f3-4f)
(open
f4-0f)
(open
f4-1f)
(open
f4-3f)
(open
f4-4f))
\n
(:goal
(and
(at
key0-0
f2-0f)
(at
key0-1
f1-3f)))
\n
)"
-
context
:
"
There
are
several
cities,
each
containing
several
locations,
some
of
which
are
airports.
There
are
also
trucks,
which
can
drive
within
a
single
city,
and
airplanes,
which
can
fly
between
airports.
The
goal
is
to
get
some
packages
from
various
locations
to
various
new
locations.
\n
There
are
2
trucks
and
1
airplane,
as
well
as
4
packages.
There
are
4
locations
across
2
cities.
\n
The
locations
are
in
cities
as
follows:
l1-0
and
l1-1
are
in
c1;
l0-1
and
l0-0
are
in
c0.
\n
Currently,
p3,
p2,
and
p1
are
at
l1-0,
t0
is
at
l0-1,
p0
and
t1
are
at
l1-1,
a0
is
at
l0-0.
The
available
actions
are:
(load-truck
?obj
?truck
?loc)
-
place
the
object
?obj
into
the
truck
?truck
at
location
?loc,
(load-airplane
?obj
?airplane
?loc)
-
load
the
object
?obj
from
location
?loc
into
the
airplane
?airplane,
(unload-truck
?obj
?truck
?loc)
-
unload
the
object
?obj
from
the
truck
?truck
at
location
?loc,
(unload-airplane
?obj
?airplane
?loc)
-
remove
the
object
?obj
from
the
airplane
?airplane
and
place
it
on
the
location
?loc,
(drive-truck
?truck
?loc-from
?loc-to
?city)
-
navigate
the
truck
?truck
which
is
in
location
?loc-from
in
city
?city
to
another
location
?loc-to
in
the
same
city,
and
(fly-airplane
?airplane
?loc-from
?loc-to)
-
fly
the
airplane
?airplane
from
airport
?loc-from
to
airport
?loc-to.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
p2
is
at
l1-0,
p0
is
at
l0-0,
p3
is
at
l0-1,
and
p1
is
at
l1-0."
question
:
"
Simplify
the
plan
\"
(load-truck
p0
t1
l1-1)
(unload-truck
p0
t1
l1-1)
(load-truck
p0
t1
l1-1)
(drive-truck
t1
l1-1
l1-0
c1)
(unload-truck
p0
t1
l1-0)
(fly-airplane
a0
l0-0
l1-0)
(load-airplane
p0
a0
l1-0)
(load-airplane
p3
a0
l1-0)
(fly-airplane
a0
l1-0
l0-0)
(unload-airplane
p0
a0
l0-0)
(unload-airplane
p3
a0
l0-0)
(drive-truck
t0
l0-1
l0-0
c0)
(load-truck
p3
t0
l0-0)
(drive-truck
t0
l0-0
l0-1
c0)
(unload-truck
p3
t0
l0-1)
\"
by
removing
either
a
single
action
or
a
pair
of
consecutive
actions,
while
still
maintaining
a
valid
plan.
Provide
the
resulting
simplified
plan."
answer
:
"
[(load-truck
p0
t1
l1-1),
(drive-truck
t1
l1-1
l1-0
c1),
(unload-truck
p0
t1
l1-0),
(fly-airplane
a0
l0-0
l1-0),
(load-airplane
p0
a0
l1-0),
(load-airplane
p3
a0
l1-0),
(fly-airplane
a0
l1-0
l0-0),
(unload-airplane
p0
a0
l0-0),
(unload-airplane
p3
a0
l0-0),
(drive-truck
t0
l0-1
l0-0
c0),
(load-truck
p3
t0
l0-0),
(drive-truck
t0
l0-0
l0-1
c0),
(unload-truck
p3
t0
l0-1)]"
PDDL_domain
:
"
(define
(domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n\n
(:types
\n
location
locatable
city
-
object
\n
package
movable
-
locatable
\n
airport
-
location
\n
airplane
truck
-
movable
\n
)
\t\t\n
\n
(:predicates
\t\n\t\t
(at
?obj
-
locatable
?loc
-
location)
\n\t\t
(in
?obj1
-
package
?obj2
-
movable)
\n\t\t
(in-city
?obj
-
location
?city
-
city))
\n\n\n
(:action
LOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(at
?obj
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?truck)))
\n\n
(:action
LOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?obj
?loc)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?airplane)))
\n\n\n\n
(:action
UNLOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(in
?obj
?truck))
\n
:effect
\n
(and
(not
(in
?obj
?truck))
(at
?obj
?loc)))
\n\n
(:action
UNLOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(in
?obj
?airplane)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(in
?obj
?airplane))
(at
?obj
?loc)))
\n\n
(:action
DRIVE-TRUCK
\n
:parameters
\n
(?truck
-
truck
\n
?loc-from
-
location
\n
?loc-to
-
location
\n
?city
-
city)
\n
:precondition
\n
(and
\n
(at
?truck
?loc-from)
\n
(in-city
?loc-from
?city)
\n
(in-city
?loc-to
?city))
\n
:effect
\n
(and
(not
(at
?truck
?loc-from))
(at
?truck
?loc-to)))
\n\n
(:action
FLY-AIRPLANE
\n
:parameters
\n
(?airplane
-
airplane
\n
?loc-from
-
airport
\n
?loc-to
-
airport)
\n
:precondition
\n
(and
\n\t
(at
?airplane
?loc-from))
\n
:effect
\n
(and
(not
(at
?airplane
?loc-from))
(at
?airplane
?loc-to)))
\n
)"
PDDL_problem
:
"
(define
(problem
logistics-c2-s2-p4-a1)
\n
(:domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n
(:objects
a0
-
airplane
l0-0
l1-0
-
airport
c0
c1
-
city
l0-1
l1-1
-
location
p0
p1
p2
p3
-
package
t0
t1
-
truck)
\n
(:init
(at
a0
l0-0)
(at
p0
l1-1)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l1-0)
(at
t0
l0-1)
(at
t1
l1-1)
(in-city
l0-0
c0)
(in-city
l0-1
c0)
(in-city
l1-0
c1)
(in-city
l1-1
c1))
\n
(:goal
(and
(at
p0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l0-1)))
\n
)"
doc_to_text
:
"
#
PDDL
DOMAIN
\n\n
```
\n
{{PDDL_domain}}
\n
```
\n\n
#
PDDL
PROBLEM
\n\n
```
\n
{{PDDL_problem}}
\n
```
\n\n
**Question**:
{{context}}
{{question}}
**Final
Answer**:"
filter_list
:
-
name
:
"
acp_grammar_parse"
filter
:
-
function
:
"
ACP_grammar_filter"
grammar_task
:
"
action_list"
clean
:
"
simplified
plan"
-
function
:
"
take_first"
lm_eval/tasks/acpbench/gen_2shot_with_pddl/land.yaml
0 → 100644
View file @
601be343
task
:
acp_land_gen_with_pddl
dataset_name
:
acp_land_gen
include
:
_gen_yaml_2shot
fewshot_config
:
sampler
:
first_n
samples
:
-
context
:
"
A
robot
is
in
a
grid
and
can
only
move
to
places
that
are
connected
to
its
current
position.
\n
The
grid
size
is
5x5,
and
the
locations
are
of
the
form
fi-jf
(e.g.,
f3-2f
or
f0-1f).
The
grid
cells
are
connected
to
their
neighbors
(e.g.,
f1-2f
is
connected
to
the
four
neighbors
f0-2f,
f2-2f,
f1-1f,
and
f1-3f).
Some
positions
on
the
grid
are
locked
and
can
be
opened
with
a
key
of
a
matching
shape.
The
robot
has
an
arm
that
can
pick
up
a
key
when
the
key
is
in
same
location
as
the
robot
and
the
arm
is
empty.
\n
There
are
2
keys
in
0
different
shapes:
Key
key0-0
is
of
shape
shape0,
Key
key0-1
is
of
shape
shape0.
\n
Currently,
the
robot
is
at
position
f1-0f
and
its
arm
is
empty.
All
the
positions
are
open
except
the
following:
f4-2f
has
shape0
shaped
lock.
Key
key0-1
is
at
position
f1-3f.
Key
key0-0
is
at
position
f1-0f.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
Key
key0-1
is
at
f1-3f
location
and
Key
key0-0
is
at
f2-0f
location.
The
available
propositions
are:
(at
?r
?x)
-
Key
?r
is
at
?x
location,
(at-robot
?x)
-
Robot
is
at
?x
location,
(locked
?x)
-
Location
?x
is
locked,
(holding
?k)
-
Robot
is
holding
?k,
(open
?x)
-
Location
?x
is
open,
and
(arm-empty)
-
Robot's
arm
is
empty."
question
:
"
Generate
a
non-trivial
fact
landmark,
one
that
does
not
hold
in
the
initial
state
or
goal."
answer
:
"
(at-robot
f2-0f)"
PDDL_domain
:
"
(define
(domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:types
key
place
shape
-
object)
\n
(:predicates
(arm-empty)
(at
?r
-
key
?x
-
place)
(at-robot
?x
-
place)
(conn
?x
-
place
?y
-
place)
(holding
?k
-
key)
(key-shape
?k
-
key
?s
-
shape)
(lock-shape
?x
-
place
?s
-
shape)
(locked
?x
-
place)
(open
?x
-
place))
\n
(:action
move
\n
:parameters
(?curpos
-
place
?nextpos
-
place)
\n
:precondition
(and
(at-robot
?curpos)
(conn
?curpos
?nextpos)
(open
?nextpos))
\n
:effect
(and
(at-robot
?nextpos)
(not
(at-robot
?curpos)))
\n
)
\n
(:action
pickup
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(at
?key
?curpos)
(arm-empty))
\n
:effect
(and
(holding
?key)
(not
(at
?key
?curpos))
(not
(arm-empty)))
\n
)
\n
(:action
pickup-and-loose
\n
:parameters
(?curpos
-
place
?newkey
-
key
?oldkey
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?oldkey)
(at
?newkey
?curpos))
\n
:effect
(and
(holding
?newkey)
(at
?oldkey
?curpos)
(not
(holding
?oldkey))
(not
(at
?newkey
?curpos)))
\n
)
\n
(:action
putdown
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?key))
\n
:effect
(and
(arm-empty)
(at
?key
?curpos)
(not
(holding
?key)))
\n
)
\n
(:action
unlock
\n
:parameters
(?curpos
-
place
?lockpos
-
place
?key
-
key
?shape
-
shape)
\n
:precondition
(and
(conn
?curpos
?lockpos)
(key-shape
?key
?shape)
(lock-shape
?lockpos
?shape)
(at-robot
?curpos)
(locked
?lockpos)
(holding
?key))
\n
:effect
(and
(open
?lockpos)
(not
(locked
?lockpos)))
\n
)
\n
)"
PDDL_problem
:
"
(define
(problem
grid-x5-y5-t1-k2-l2-p100)
\n
(:domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:objects
key0-0
key0-1
-
key
f0-0f
f0-1f
f0-2f
f0-3f
f0-4f
f1-0f
f1-1f
f1-2f
f1-3f
f1-4f
f2-0f
f2-1f
f2-2f
f2-3f
f2-4f
f3-0f
f3-1f
f3-2f
f3-3f
f3-4f
f4-0f
f4-1f
f4-2f
f4-3f
f4-4f
-
place
shape0
-
shape)
\n
(:init
(arm-empty)
(at
key0-0
f1-0f)
(at
key0-1
f1-3f)
(at-robot
f1-0f)
(conn
f0-0f
f0-1f)
(conn
f0-0f
f1-0f)
(conn
f0-1f
f0-0f)
(conn
f0-1f
f0-2f)
(conn
f0-1f
f1-1f)
(conn
f0-2f
f0-1f)
(conn
f0-2f
f0-3f)
(conn
f0-2f
f1-2f)
(conn
f0-3f
f0-2f)
(conn
f0-3f
f0-4f)
(conn
f0-3f
f1-3f)
(conn
f0-4f
f0-3f)
(conn
f0-4f
f1-4f)
(conn
f1-0f
f0-0f)
(conn
f1-0f
f1-1f)
(conn
f1-0f
f2-0f)
(conn
f1-1f
f0-1f)
(conn
f1-1f
f1-0f)
(conn
f1-1f
f1-2f)
(conn
f1-1f
f2-1f)
(conn
f1-2f
f0-2f)
(conn
f1-2f
f1-1f)
(conn
f1-2f
f1-3f)
(conn
f1-2f
f2-2f)
(conn
f1-3f
f0-3f)
(conn
f1-3f
f1-2f)
(conn
f1-3f
f1-4f)
(conn
f1-3f
f2-3f)
(conn
f1-4f
f0-4f)
(conn
f1-4f
f1-3f)
(conn
f1-4f
f2-4f)
(conn
f2-0f
f1-0f)
(conn
f2-0f
f2-1f)
(conn
f2-0f
f3-0f)
(conn
f2-1f
f1-1f)
(conn
f2-1f
f2-0f)
(conn
f2-1f
f2-2f)
(conn
f2-1f
f3-1f)
(conn
f2-2f
f1-2f)
(conn
f2-2f
f2-1f)
(conn
f2-2f
f2-3f)
(conn
f2-2f
f3-2f)
(conn
f2-3f
f1-3f)
(conn
f2-3f
f2-2f)
(conn
f2-3f
f2-4f)
(conn
f2-3f
f3-3f)
(conn
f2-4f
f1-4f)
(conn
f2-4f
f2-3f)
(conn
f2-4f
f3-4f)
(conn
f3-0f
f2-0f)
(conn
f3-0f
f3-1f)
(conn
f3-0f
f4-0f)
(conn
f3-1f
f2-1f)
(conn
f3-1f
f3-0f)
(conn
f3-1f
f3-2f)
(conn
f3-1f
f4-1f)
(conn
f3-2f
f2-2f)
(conn
f3-2f
f3-1f)
(conn
f3-2f
f3-3f)
(conn
f3-2f
f4-2f)
(conn
f3-3f
f2-3f)
(conn
f3-3f
f3-2f)
(conn
f3-3f
f3-4f)
(conn
f3-3f
f4-3f)
(conn
f3-4f
f2-4f)
(conn
f3-4f
f3-3f)
(conn
f3-4f
f4-4f)
(conn
f4-0f
f3-0f)
(conn
f4-0f
f4-1f)
(conn
f4-1f
f3-1f)
(conn
f4-1f
f4-0f)
(conn
f4-1f
f4-2f)
(conn
f4-2f
f3-2f)
(conn
f4-2f
f4-1f)
(conn
f4-2f
f4-3f)
(conn
f4-3f
f3-3f)
(conn
f4-3f
f4-2f)
(conn
f4-3f
f4-4f)
(conn
f4-4f
f3-4f)
(conn
f4-4f
f4-3f)
(key-shape
key0-0
shape0)
(key-shape
key0-1
shape0)
(lock-shape
f2-0f
shape0)
(lock-shape
f4-2f
shape0)
(locked
f4-2f)
(open
f0-0f)
(open
f0-1f)
(open
f0-2f)
(open
f0-3f)
(open
f0-4f)
(open
f1-0f)
(open
f1-1f)
(open
f1-2f)
(open
f1-3f)
(open
f1-4f)
(open
f2-0f)
(open
f2-1f)
(open
f2-2f)
(open
f2-3f)
(open
f2-4f)
(open
f3-0f)
(open
f3-1f)
(open
f3-2f)
(open
f3-3f)
(open
f3-4f)
(open
f4-0f)
(open
f4-1f)
(open
f4-3f)
(open
f4-4f))
\n
(:goal
(and
(at
key0-0
f2-0f)
(at
key0-1
f1-3f)))
\n
)"
-
context
:
"
There
are
several
cities,
each
containing
several
locations,
some
of
which
are
airports.
There
are
also
trucks,
which
can
drive
within
a
single
city,
and
airplanes,
which
can
fly
between
airports.
The
goal
is
to
get
some
packages
from
various
locations
to
various
new
locations.
\n
There
are
2
trucks
and
1
airplane,
as
well
as
4
packages.
There
are
4
locations
across
2
cities.
\n
The
locations
are
in
cities
as
follows:
l0-1
and
l0-0
are
in
c0;
l1-0
and
l1-1
are
in
c1.
\n
Currently,
a0
and
t0
are
at
l0-0,
t1
is
at
l1-1,
p2,
p1,
and
p3
are
at
l1-0,
p0
is
in
t1.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
p1
is
at
l1-0,
p0
is
at
l0-0,
p2
is
at
l1-0,
and
p3
is
at
l0-1.
The
available
propositions
are:
(at
?obj
?loc)
-
?obj
is
at
?loc
and
(in
?obj1
?obj2)
-
?obj1
is
in
?obj2."
question
:
"
Generate
a
non-trivial
fact
landmark,
one
that
does
not
hold
in
the
initial
state
or
goal."
answer
:
"
(in
p3
t0)"
PDDL_domain
:
"
(define
(domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n\n
(:types
\n
location
locatable
city
-
object
\n
package
movable
-
locatable
\n
airport
-
location
\n
airplane
truck
-
movable
\n
)
\t\t\n
\n
(:predicates
\t\n\t\t
(at
?obj
-
locatable
?loc
-
location)
\n\t\t
(in
?obj1
-
package
?obj2
-
movable)
\n\t\t
(in-city
?obj
-
location
?city
-
city))
\n\n\n
(:action
LOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(at
?obj
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?truck)))
\n\n
(:action
LOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?obj
?loc)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?airplane)))
\n\n\n\n
(:action
UNLOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(in
?obj
?truck))
\n
:effect
\n
(and
(not
(in
?obj
?truck))
(at
?obj
?loc)))
\n\n
(:action
UNLOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(in
?obj
?airplane)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(in
?obj
?airplane))
(at
?obj
?loc)))
\n\n
(:action
DRIVE-TRUCK
\n
:parameters
\n
(?truck
-
truck
\n
?loc-from
-
location
\n
?loc-to
-
location
\n
?city
-
city)
\n
:precondition
\n
(and
\n
(at
?truck
?loc-from)
\n
(in-city
?loc-from
?city)
\n
(in-city
?loc-to
?city))
\n
:effect
\n
(and
(not
(at
?truck
?loc-from))
(at
?truck
?loc-to)))
\n\n
(:action
FLY-AIRPLANE
\n
:parameters
\n
(?airplane
-
airplane
\n
?loc-from
-
airport
\n
?loc-to
-
airport)
\n
:precondition
\n
(and
\n\t
(at
?airplane
?loc-from))
\n
:effect
\n
(and
(not
(at
?airplane
?loc-from))
(at
?airplane
?loc-to)))
\n
)"
PDDL_problem
:
"
(define
(problem
logistics-c2-s2-p4-a1)
\n
(:domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n
(:objects
a0
-
airplane
l0-0
l1-0
-
airport
c0
c1
-
city
l0-1
l1-1
-
location
p0
p1
p2
p3
-
package
t0
t1
-
truck)
\n
(:init
(at
a0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l1-0)
(at
t0
l0-0)
(at
t1
l1-1)
(in
p0
t1)
(in-city
l0-0
c0)
(in-city
l0-1
c0)
(in-city
l1-0
c1)
(in-city
l1-1
c1))
\n
(:goal
(and
(at
p0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l0-1)))
\n
)"
doc_to_text
:
"
#
PDDL
DOMAIN
\n\n
```
\n
{{PDDL_domain}}
\n
```
\n\n
#
PDDL
PROBLEM
\n\n
```
\n
{{PDDL_problem}}
\n
```
\n\n
**Question**:
{{context}}
{{question}}
Provide
only
the
ground
proposition
or
None.
**Final
Answer**:"
filter_list
:
-
name
:
"
acp_grammar_parse"
filter
:
-
function
:
"
ACP_grammar_filter"
grammar_task
:
"
act"
-
function
:
"
take_first"
lm_eval/tasks/acpbench/gen_2shot_with_pddl/next_act.yaml
0 → 100644
View file @
601be343
task
:
acp_nexta_gen_with_pddl
dataset_name
:
acp_nexta_gen
include
:
_gen_yaml_2shot
fewshot_config
:
sampler
:
first_n
samples
:
-
context
:
"
A
robot
is
in
a
grid
and
can
only
move
to
places
that
are
connected
to
its
current
position.
\n
The
grid
size
is
5x5,
and
the
locations
are
of
the
form
fi-jf
(e.g.,
f3-2f
or
f0-1f).
The
grid
cells
are
connected
to
their
neighbors
(e.g.,
f1-2f
is
connected
to
the
four
neighbors
f0-2f,
f2-2f,
f1-1f,
and
f1-3f).
Some
positions
on
the
grid
are
locked
and
can
be
opened
with
a
key
of
a
matching
shape.
The
robot
has
an
arm
that
can
pick
up
a
key
when
the
key
is
in
same
location
as
the
robot
and
the
arm
is
empty.
\n
There
are
2
keys
in
0
different
shapes:
Key
key0-0
is
of
shape
shape0,
Key
key0-1
is
of
shape
shape0.
\n
Currently,
the
robot
is
at
position
f1-1f
and
its
arm
is
empty.
All
the
positions
are
open
except
the
following:
f4-2f
has
shape0
shaped
lock.
Key
key0-0
is
at
position
f1-0f.
Key
key0-1
is
at
position
f1-3f.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
Key
key0-1
is
at
f1-3f
location
and
Key
key0-0
is
at
f2-0f
location.
The
available
actions
are:
(unlock
?curpos
?lockpos
?key
?shape)
-
use
the
key
?key
of
shape
?shape
to
unlock
the
place
?lockpos
from
the
current
position
?curpos,
(move
?curpos
?nextpos)
-
move
to
place
?nextpos
from
place
?curpos,
(pickup
?curpos
?key)
-
retrieve
the
key
?key
from
its
current
position
?curpos,
(pickup-and-loose
?curpos
?newkey
?oldkey)
-
pick
up
key
?newkey
at
current
position
place
?curpos
and
loose
key
?oldkey
being
held,
and
(putdown
?curpos
?key)
-
put
the
key
?key
at
the
current
position
place
?curpos."
question
:
"
What
is
the
next
action
that
takes
us
towards
the
goal?"
answer
:
"
(move
f1-1f
f1-0f)"
PDDL_domain
:
"
(define
(domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:types
key
place
shape
-
object)
\n
(:predicates
(arm-empty)
(at
?r
-
key
?x
-
place)
(at-robot
?x
-
place)
(conn
?x
-
place
?y
-
place)
(holding
?k
-
key)
(key-shape
?k
-
key
?s
-
shape)
(lock-shape
?x
-
place
?s
-
shape)
(locked
?x
-
place)
(open
?x
-
place))
\n
(:action
move
\n
:parameters
(?curpos
-
place
?nextpos
-
place)
\n
:precondition
(and
(at-robot
?curpos)
(conn
?curpos
?nextpos)
(open
?nextpos))
\n
:effect
(and
(at-robot
?nextpos)
(not
(at-robot
?curpos)))
\n
)
\n
(:action
pickup
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(at
?key
?curpos)
(arm-empty))
\n
:effect
(and
(holding
?key)
(not
(at
?key
?curpos))
(not
(arm-empty)))
\n
)
\n
(:action
pickup-and-loose
\n
:parameters
(?curpos
-
place
?newkey
-
key
?oldkey
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?oldkey)
(at
?newkey
?curpos))
\n
:effect
(and
(holding
?newkey)
(at
?oldkey
?curpos)
(not
(holding
?oldkey))
(not
(at
?newkey
?curpos)))
\n
)
\n
(:action
putdown
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?key))
\n
:effect
(and
(arm-empty)
(at
?key
?curpos)
(not
(holding
?key)))
\n
)
\n
(:action
unlock
\n
:parameters
(?curpos
-
place
?lockpos
-
place
?key
-
key
?shape
-
shape)
\n
:precondition
(and
(conn
?curpos
?lockpos)
(key-shape
?key
?shape)
(lock-shape
?lockpos
?shape)
(at-robot
?curpos)
(locked
?lockpos)
(holding
?key))
\n
:effect
(and
(open
?lockpos)
(not
(locked
?lockpos)))
\n
)
\n
)"
PDDL_problem
:
"
(define
(problem
grid-x5-y5-t1-k2-l2-p100)
\n
(:domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:objects
key0-0
key0-1
-
key
f0-0f
f0-1f
f0-2f
f0-3f
f0-4f
f1-0f
f1-1f
f1-2f
f1-3f
f1-4f
f2-0f
f2-1f
f2-2f
f2-3f
f2-4f
f3-0f
f3-1f
f3-2f
f3-3f
f3-4f
f4-0f
f4-1f
f4-2f
f4-3f
f4-4f
-
place
shape0
-
shape)
\n
(:init
(arm-empty)
(at
key0-0
f1-0f)
(at
key0-1
f1-3f)
(at-robot
f1-1f)
(conn
f0-0f
f0-1f)
(conn
f0-0f
f1-0f)
(conn
f0-1f
f0-0f)
(conn
f0-1f
f0-2f)
(conn
f0-1f
f1-1f)
(conn
f0-2f
f0-1f)
(conn
f0-2f
f0-3f)
(conn
f0-2f
f1-2f)
(conn
f0-3f
f0-2f)
(conn
f0-3f
f0-4f)
(conn
f0-3f
f1-3f)
(conn
f0-4f
f0-3f)
(conn
f0-4f
f1-4f)
(conn
f1-0f
f0-0f)
(conn
f1-0f
f1-1f)
(conn
f1-0f
f2-0f)
(conn
f1-1f
f0-1f)
(conn
f1-1f
f1-0f)
(conn
f1-1f
f1-2f)
(conn
f1-1f
f2-1f)
(conn
f1-2f
f0-2f)
(conn
f1-2f
f1-1f)
(conn
f1-2f
f1-3f)
(conn
f1-2f
f2-2f)
(conn
f1-3f
f0-3f)
(conn
f1-3f
f1-2f)
(conn
f1-3f
f1-4f)
(conn
f1-3f
f2-3f)
(conn
f1-4f
f0-4f)
(conn
f1-4f
f1-3f)
(conn
f1-4f
f2-4f)
(conn
f2-0f
f1-0f)
(conn
f2-0f
f2-1f)
(conn
f2-0f
f3-0f)
(conn
f2-1f
f1-1f)
(conn
f2-1f
f2-0f)
(conn
f2-1f
f2-2f)
(conn
f2-1f
f3-1f)
(conn
f2-2f
f1-2f)
(conn
f2-2f
f2-1f)
(conn
f2-2f
f2-3f)
(conn
f2-2f
f3-2f)
(conn
f2-3f
f1-3f)
(conn
f2-3f
f2-2f)
(conn
f2-3f
f2-4f)
(conn
f2-3f
f3-3f)
(conn
f2-4f
f1-4f)
(conn
f2-4f
f2-3f)
(conn
f2-4f
f3-4f)
(conn
f3-0f
f2-0f)
(conn
f3-0f
f3-1f)
(conn
f3-0f
f4-0f)
(conn
f3-1f
f2-1f)
(conn
f3-1f
f3-0f)
(conn
f3-1f
f3-2f)
(conn
f3-1f
f4-1f)
(conn
f3-2f
f2-2f)
(conn
f3-2f
f3-1f)
(conn
f3-2f
f3-3f)
(conn
f3-2f
f4-2f)
(conn
f3-3f
f2-3f)
(conn
f3-3f
f3-2f)
(conn
f3-3f
f3-4f)
(conn
f3-3f
f4-3f)
(conn
f3-4f
f2-4f)
(conn
f3-4f
f3-3f)
(conn
f3-4f
f4-4f)
(conn
f4-0f
f3-0f)
(conn
f4-0f
f4-1f)
(conn
f4-1f
f3-1f)
(conn
f4-1f
f4-0f)
(conn
f4-1f
f4-2f)
(conn
f4-2f
f3-2f)
(conn
f4-2f
f4-1f)
(conn
f4-2f
f4-3f)
(conn
f4-3f
f3-3f)
(conn
f4-3f
f4-2f)
(conn
f4-3f
f4-4f)
(conn
f4-4f
f3-4f)
(conn
f4-4f
f4-3f)
(key-shape
key0-0
shape0)
(key-shape
key0-1
shape0)
(lock-shape
f2-0f
shape0)
(lock-shape
f4-2f
shape0)
(locked
f4-2f)
(open
f0-0f)
(open
f0-1f)
(open
f0-2f)
(open
f0-3f)
(open
f0-4f)
(open
f1-0f)
(open
f1-1f)
(open
f1-2f)
(open
f1-3f)
(open
f1-4f)
(open
f2-0f)
(open
f2-1f)
(open
f2-2f)
(open
f2-3f)
(open
f2-4f)
(open
f3-0f)
(open
f3-1f)
(open
f3-2f)
(open
f3-3f)
(open
f3-4f)
(open
f4-0f)
(open
f4-1f)
(open
f4-3f)
(open
f4-4f))
\n
(:goal
(and
(at
key0-0
f2-0f)
(at
key0-1
f1-3f)))
\n
)"
-
context
:
"
There
are
several
cities,
each
containing
several
locations,
some
of
which
are
airports.
There
are
also
trucks,
which
can
drive
within
a
single
city,
and
airplanes,
which
can
fly
between
airports.
The
goal
is
to
get
some
packages
from
various
locations
to
various
new
locations.
\n
There
are
2
trucks
and
1
airplane,
as
well
as
4
packages.
There
are
4
locations
across
2
cities.
\n
The
locations
are
in
cities
as
follows:
l1-1
and
l1-0
are
in
c1;
l0-0
and
l0-1
are
in
c0.
\n
Currently,
p1,
p3,
t1,
p2,
and
a0
are
at
l1-0,
t0
is
at
l0-0,
p0
is
in
a0.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
p1
is
at
l1-0,
p3
is
at
l0-1,
p0
is
at
l0-0,
and
p2
is
at
l1-0.
The
available
actions
are:
(load-truck
?obj
?truck
?loc)
-
load
the
object
?obj
from
location
?loc
into
the
truck
?truck,
(load-airplane
?obj
?airplane
?loc)
-
load
the
object
?obj
from
location
?loc
onto
the
airplane
?airplane,
(unload-truck
?obj
?truck
?loc)
-
unload
the
object
?obj
from
the
truck
?truck
at
location
?loc,
(unload-airplane
?obj
?airplane
?loc)
-
offload
the
object
?obj
from
the
airplane
?airplane
at
location
?loc,
(drive-truck
?truck
?loc-from
?loc-to
?city)
-
navigate
the
truck
?truck
which
is
in
location
?loc-from
in
city
?city
to
another
location
?loc-to
in
the
same
city,
and
(fly-airplane
?airplane
?loc-from
?loc-to)
-
fly
the
airplane
?airplane
from
airport
?loc-from
to
airport
?loc-to."
question
:
"
What
is
the
next
action
that
takes
us
towards
the
goal?"
answer
:
"
(load-airplane
p3
a0
l1-0)"
PDDL_domain
:
"
(define
(domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n\n
(:types
\n
location
locatable
city
-
object
\n
package
movable
-
locatable
\n
airport
-
location
\n
airplane
truck
-
movable
\n
)
\t\t\n
\n
(:predicates
\t\n\t\t
(at
?obj
-
locatable
?loc
-
location)
\n\t\t
(in
?obj1
-
package
?obj2
-
movable)
\n\t\t
(in-city
?obj
-
location
?city
-
city))
\n\n\n
(:action
LOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(at
?obj
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?truck)))
\n\n
(:action
LOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?obj
?loc)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?airplane)))
\n\n\n\n
(:action
UNLOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(in
?obj
?truck))
\n
:effect
\n
(and
(not
(in
?obj
?truck))
(at
?obj
?loc)))
\n\n
(:action
UNLOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(in
?obj
?airplane)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(in
?obj
?airplane))
(at
?obj
?loc)))
\n\n
(:action
DRIVE-TRUCK
\n
:parameters
\n
(?truck
-
truck
\n
?loc-from
-
location
\n
?loc-to
-
location
\n
?city
-
city)
\n
:precondition
\n
(and
\n
(at
?truck
?loc-from)
\n
(in-city
?loc-from
?city)
\n
(in-city
?loc-to
?city))
\n
:effect
\n
(and
(not
(at
?truck
?loc-from))
(at
?truck
?loc-to)))
\n\n
(:action
FLY-AIRPLANE
\n
:parameters
\n
(?airplane
-
airplane
\n
?loc-from
-
airport
\n
?loc-to
-
airport)
\n
:precondition
\n
(and
\n\t
(at
?airplane
?loc-from))
\n
:effect
\n
(and
(not
(at
?airplane
?loc-from))
(at
?airplane
?loc-to)))
\n
)"
PDDL_problem
:
"
(define
(problem
logistics-c2-s2-p4-a1)
\n
(:domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n
(:objects
a0
-
airplane
l0-0
l1-0
-
airport
c0
c1
-
city
l0-1
l1-1
-
location
p0
p1
p2
p3
-
package
t0
t1
-
truck)
\n
(:init
(at
a0
l1-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l1-0)
(at
t0
l0-0)
(at
t1
l1-0)
(in
p0
a0)
(in-city
l0-0
c0)
(in-city
l0-1
c0)
(in-city
l1-0
c1)
(in-city
l1-1
c1))
\n
(:goal
(and
(at
p0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l0-1)))
\n
)"
doc_to_text
:
"
#
PDDL
DOMAIN
\n\n
```
\n
{{PDDL_domain}}
\n
```
\n\n
#
PDDL
PROBLEM
\n\n
```
\n
{{PDDL_problem}}
\n
```
\n\n
**Question**:
{{context}}
{{question}}
Each
action
starts
with
an
opening
parenthesis
and
ends
with
closing
parenthesis.
Provide
only
the
action.
**Final
Answer**:"
filter_list
:
-
name
:
"
acp_grammar_parse"
filter
:
-
function
:
"
ACP_grammar_filter"
grammar_task
:
"
action_name"
-
function
:
"
take_first"
lm_eval/tasks/acpbench/gen_2shot_with_pddl/prog.yaml
0 → 100644
View file @
601be343
task
:
acp_prog_gen_with_pddl
dataset_name
:
acp_prog_gen
include
:
_gen_yaml_2shot
fewshot_config
:
sampler
:
first_n
samples
:
-
context
:
"
A
robot
is
in
a
grid
and
can
only
move
to
places
that
are
connected
to
its
current
position.
\n
The
grid
size
is
5x5,
and
the
locations
are
of
the
form
fi-jf
(e.g.,
f3-2f
or
f0-1f).
The
grid
cells
are
connected
to
their
neighbors
(e.g.,
f1-2f
is
connected
to
the
four
neighbors
f0-2f,
f2-2f,
f1-1f,
and
f1-3f).
Some
positions
on
the
grid
are
locked
and
can
be
opened
with
a
key
of
a
matching
shape.
The
robot
has
an
arm
that
can
pick
up
a
key
when
the
key
is
in
same
location
as
the
robot
and
the
arm
is
empty.
\n
There
are
2
keys
in
0
different
shapes:
Key
key0-0
is
of
shape
shape0,
Key
key0-1
is
of
shape
shape0.
\n
Currently,
the
robot
is
at
position
f2-2f
and
its
arm
is
empty.
All
the
positions
are
open
except
the
following:
f4-2f
has
shape0
shaped
lock,
f2-0f
has
shape0
shaped
lock.
Key
key0-1
is
at
position
f1-3f.
Key
key0-0
is
at
position
f2-2f.
The
available
propositions
are:
(at
?r
?x)
-
Key
?r
is
at
?x
location,
(at-robot
?x)
-
Robot
is
at
?x
location,
(locked
?x)
-
Location
?x
is
locked,
(holding
?k)
-
Robot
is
holding
?k,
(open
?x)
-
Location
?x
is
open,
and
(arm-empty)
-
Robot
is
not
holding
anything."
question
:
"
Break
down
the
outcomes
of
performing
the
action
\"
retrieve
the
key
key0-0
from
its
current
position
f0-1f
\"
into
two
lists,
positive
effects
and
negative
effects.
Positive
effects
are
the
propositions
that
are
false
in
the
current
state
but
will
become
true
after
performing
the
action.
Negative
effects
are
the
propositions
that
are
true
in
the
current
state
and
will
become
false
after
performing
the
action."
answer
:
"
[(at-robot
f1-2f)]
[(at-robot
f2-2f)]"
PDDL_domain
:
"
(define
(domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:types
key
place
shape
-
object)
\n
(:predicates
(arm-empty)
(at
?r
-
key
?x
-
place)
(at-robot
?x
-
place)
(conn
?x
-
place
?y
-
place)
(holding
?k
-
key)
(key-shape
?k
-
key
?s
-
shape)
(lock-shape
?x
-
place
?s
-
shape)
(locked
?x
-
place)
(open
?x
-
place))
\n
(:action
move
\n
:parameters
(?curpos
-
place
?nextpos
-
place)
\n
:precondition
(and
(at-robot
?curpos)
(conn
?curpos
?nextpos)
(open
?nextpos))
\n
:effect
(and
(at-robot
?nextpos)
(not
(at-robot
?curpos)))
\n
)
\n
(:action
pickup
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(at
?key
?curpos)
(arm-empty))
\n
:effect
(and
(holding
?key)
(not
(at
?key
?curpos))
(not
(arm-empty)))
\n
)
\n
(:action
pickup-and-loose
\n
:parameters
(?curpos
-
place
?newkey
-
key
?oldkey
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?oldkey)
(at
?newkey
?curpos))
\n
:effect
(and
(holding
?newkey)
(at
?oldkey
?curpos)
(not
(holding
?oldkey))
(not
(at
?newkey
?curpos)))
\n
)
\n
(:action
putdown
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?key))
\n
:effect
(and
(arm-empty)
(at
?key
?curpos)
(not
(holding
?key)))
\n
)
\n
(:action
unlock
\n
:parameters
(?curpos
-
place
?lockpos
-
place
?key
-
key
?shape
-
shape)
\n
:precondition
(and
(conn
?curpos
?lockpos)
(key-shape
?key
?shape)
(lock-shape
?lockpos
?shape)
(at-robot
?curpos)
(locked
?lockpos)
(holding
?key))
\n
:effect
(and
(open
?lockpos)
(not
(locked
?lockpos)))
\n
)
\n
)"
PDDL_problem
:
"
(define
(problem
grid-x5-y5-t1-k2-l2-p100)
\n
(:domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:objects
key0-0
key0-1
-
key
f0-0f
f0-1f
f0-2f
f0-3f
f0-4f
f1-0f
f1-1f
f1-2f
f1-3f
f1-4f
f2-0f
f2-1f
f2-2f
f2-3f
f2-4f
f3-0f
f3-1f
f3-2f
f3-3f
f3-4f
f4-0f
f4-1f
f4-2f
f4-3f
f4-4f
-
place
shape0
-
shape)
\n
(:init
(arm-empty)
(at
key0-0
f2-2f)
(at
key0-1
f1-3f)
(at-robot
f2-2f)
(conn
f0-0f
f0-1f)
(conn
f0-0f
f1-0f)
(conn
f0-1f
f0-0f)
(conn
f0-1f
f0-2f)
(conn
f0-1f
f1-1f)
(conn
f0-2f
f0-1f)
(conn
f0-2f
f0-3f)
(conn
f0-2f
f1-2f)
(conn
f0-3f
f0-2f)
(conn
f0-3f
f0-4f)
(conn
f0-3f
f1-3f)
(conn
f0-4f
f0-3f)
(conn
f0-4f
f1-4f)
(conn
f1-0f
f0-0f)
(conn
f1-0f
f1-1f)
(conn
f1-0f
f2-0f)
(conn
f1-1f
f0-1f)
(conn
f1-1f
f1-0f)
(conn
f1-1f
f1-2f)
(conn
f1-1f
f2-1f)
(conn
f1-2f
f0-2f)
(conn
f1-2f
f1-1f)
(conn
f1-2f
f1-3f)
(conn
f1-2f
f2-2f)
(conn
f1-3f
f0-3f)
(conn
f1-3f
f1-2f)
(conn
f1-3f
f1-4f)
(conn
f1-3f
f2-3f)
(conn
f1-4f
f0-4f)
(conn
f1-4f
f1-3f)
(conn
f1-4f
f2-4f)
(conn
f2-0f
f1-0f)
(conn
f2-0f
f2-1f)
(conn
f2-0f
f3-0f)
(conn
f2-1f
f1-1f)
(conn
f2-1f
f2-0f)
(conn
f2-1f
f2-2f)
(conn
f2-1f
f3-1f)
(conn
f2-2f
f1-2f)
(conn
f2-2f
f2-1f)
(conn
f2-2f
f2-3f)
(conn
f2-2f
f3-2f)
(conn
f2-3f
f1-3f)
(conn
f2-3f
f2-2f)
(conn
f2-3f
f2-4f)
(conn
f2-3f
f3-3f)
(conn
f2-4f
f1-4f)
(conn
f2-4f
f2-3f)
(conn
f2-4f
f3-4f)
(conn
f3-0f
f2-0f)
(conn
f3-0f
f3-1f)
(conn
f3-0f
f4-0f)
(conn
f3-1f
f2-1f)
(conn
f3-1f
f3-0f)
(conn
f3-1f
f3-2f)
(conn
f3-1f
f4-1f)
(conn
f3-2f
f2-2f)
(conn
f3-2f
f3-1f)
(conn
f3-2f
f3-3f)
(conn
f3-2f
f4-2f)
(conn
f3-3f
f2-3f)
(conn
f3-3f
f3-2f)
(conn
f3-3f
f3-4f)
(conn
f3-3f
f4-3f)
(conn
f3-4f
f2-4f)
(conn
f3-4f
f3-3f)
(conn
f3-4f
f4-4f)
(conn
f4-0f
f3-0f)
(conn
f4-0f
f4-1f)
(conn
f4-1f
f3-1f)
(conn
f4-1f
f4-0f)
(conn
f4-1f
f4-2f)
(conn
f4-2f
f3-2f)
(conn
f4-2f
f4-1f)
(conn
f4-2f
f4-3f)
(conn
f4-3f
f3-3f)
(conn
f4-3f
f4-2f)
(conn
f4-3f
f4-4f)
(conn
f4-4f
f3-4f)
(conn
f4-4f
f4-3f)
(key-shape
key0-0
shape0)
(key-shape
key0-1
shape0)
(lock-shape
f2-0f
shape0)
(lock-shape
f4-2f
shape0)
(locked
f2-0f)
(locked
f4-2f)
(open
f0-0f)
(open
f0-1f)
(open
f0-2f)
(open
f0-3f)
(open
f0-4f)
(open
f1-0f)
(open
f1-1f)
(open
f1-2f)
(open
f1-3f)
(open
f1-4f)
(open
f2-1f)
(open
f2-2f)
(open
f2-3f)
(open
f2-4f)
(open
f3-0f)
(open
f3-1f)
(open
f3-2f)
(open
f3-3f)
(open
f3-4f)
(open
f4-0f)
(open
f4-1f)
(open
f4-3f)
(open
f4-4f))
\n
(:goal
(and
(at
key0-0
f2-0f)
(at
key0-1
f1-3f)))
\n
)"
-
context
:
"
There
are
several
cities,
each
containing
several
locations,
some
of
which
are
airports.
There
are
also
trucks,
which
can
drive
within
a
single
city,
and
airplanes,
which
can
fly
between
airports.
The
goal
is
to
get
some
packages
from
various
locations
to
various
new
locations.
\n
There
are
2
trucks
and
1
airplane,
as
well
as
4
packages.
There
are
4
locations
across
2
cities.
\n
The
locations
are
in
cities
as
follows:
l0-0
and
l0-1
are
in
c0;
l1-0
and
l1-1
are
in
c1.
\n
Currently,
p2
and
t1
are
at
l1-0,
a0
and
t0
are
at
l0-0,
p0
and
p3
are
in
a0,
p1
is
in
t1.
The
available
propositions
are:
(at
?obj
?loc)
-
?obj
is
at
?loc
and
(in
?obj1
?obj2)
-
?obj1
is
in
?obj2."
question
:
"
Break
down
the
outcomes
of
performing
the
action
\"
navigate
the
truck
t1
which
is
in
location
l1-0
in
city
c1
to
another
location
l1-1
in
the
same
city
\"
into
two
lists,
positive
effects
and
negative
effects.
Positive
effects
are
the
propositions
that
are
false
in
the
current
state
but
will
become
true
after
performing
the
action.
Negative
effects
are
the
propositions
that
are
true
in
the
current
state
and
will
become
false
after
performing
the
action."
answer
:
"
[(at
t1
l1-1)]
[(at
t1
l1-0)]"
PDDL_domain
:
"
(define
(domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n\n
(:types
\n
location
locatable
city
-
object
\n
package
movable
-
locatable
\n
airport
-
location
\n
airplane
truck
-
movable
\n
)
\t\t\n
\n
(:predicates
\t\n\t\t
(at
?obj
-
locatable
?loc
-
location)
\n\t\t
(in
?obj1
-
package
?obj2
-
movable)
\n\t\t
(in-city
?obj
-
location
?city
-
city))
\n\n\n
(:action
LOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(at
?obj
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?truck)))
\n\n
(:action
LOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?obj
?loc)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?airplane)))
\n\n\n\n
(:action
UNLOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(in
?obj
?truck))
\n
:effect
\n
(and
(not
(in
?obj
?truck))
(at
?obj
?loc)))
\n\n
(:action
UNLOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(in
?obj
?airplane)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(in
?obj
?airplane))
(at
?obj
?loc)))
\n\n
(:action
DRIVE-TRUCK
\n
:parameters
\n
(?truck
-
truck
\n
?loc-from
-
location
\n
?loc-to
-
location
\n
?city
-
city)
\n
:precondition
\n
(and
\n
(at
?truck
?loc-from)
\n
(in-city
?loc-from
?city)
\n
(in-city
?loc-to
?city))
\n
:effect
\n
(and
(not
(at
?truck
?loc-from))
(at
?truck
?loc-to)))
\n\n
(:action
FLY-AIRPLANE
\n
:parameters
\n
(?airplane
-
airplane
\n
?loc-from
-
airport
\n
?loc-to
-
airport)
\n
:precondition
\n
(and
\n\t
(at
?airplane
?loc-from))
\n
:effect
\n
(and
(not
(at
?airplane
?loc-from))
(at
?airplane
?loc-to)))
\n
)"
PDDL_problem
:
"
(define
(problem
logistics-c2-s2-p4-a1)
\n
(:domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n
(:objects
a0
-
airplane
l0-0
l1-0
-
airport
c0
c1
-
city
l0-1
l1-1
-
location
p0
p1
p2
p3
-
package
t0
t1
-
truck)
\n
(:init
(at
a0
l0-0)
(at
p2
l1-0)
(at
t0
l0-0)
(at
t1
l1-0)
(in
p0
a0)
(in
p1
t1)
(in
p3
a0)
(in-city
l0-0
c0)
(in-city
l0-1
c0)
(in-city
l1-0
c1)
(in-city
l1-1
c1))
\n
(:goal
(and
(at
p0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l0-1)))
\n
)"
doc_to_text
:
"
#
PDDL
DOMAIN
\n\n
```
\n
{{PDDL_domain}}
\n
```
\n\n
#
PDDL
PROBLEM
\n\n
```
\n
{{PDDL_problem}}
\n
```
\n\n
**Question**:
{{context}}
{{question}}
Provide
only
the
two
lists
with
the
ground
propositions.
**Final
Answer**:"
filter_list
:
-
name
:
"
acp_grammar_parse"
filter
:
-
function
:
"
ACP_grammar_filter"
grammar_task
:
"
progression_list"
clean
:
"
pos_neg"
-
function
:
"
take_first"
lm_eval/tasks/acpbench/gen_2shot_with_pddl/reach.yaml
0 → 100644
View file @
601be343
task
:
acp_reach_gen_with_pddl
dataset_name
:
acp_reach_gen
include
:
_gen_yaml_2shot
fewshot_config
:
sampler
:
first_n
samples
:
-
context
:
"
A
robot
is
in
a
grid
and
can
only
move
to
places
that
are
connected
to
its
current
position.
\n
The
grid
size
is
5x5,
and
the
locations
are
of
the
form
fi-jf
(e.g.,
f3-2f
or
f0-1f).
The
grid
cells
are
connected
to
their
neighbors
(e.g.,
f1-2f
is
connected
to
the
four
neighbors
f0-2f,
f2-2f,
f1-1f,
and
f1-3f).
Some
positions
on
the
grid
are
locked
and
can
be
opened
with
a
key
of
a
matching
shape.
The
robot
has
an
arm
that
can
pick
up
a
key
when
the
key
is
in
same
location
as
the
robot
and
the
arm
is
empty.
\n
There
are
2
keys
in
0
different
shapes:
Key
key0-1
is
of
shape
shape0,
Key
key0-0
is
of
shape
shape0.
\n
Currently,
the
robot
is
at
position
f3-1f
and
its
arm
is
empty.
All
the
positions
are
open
except
the
following:
f2-0f
has
shape0
shaped
lock.
Key
key0-1
is
at
position
f1-3f.
Key
key0-0
is
at
position
f3-1f.
The
available
propositions
are:
(at
?r
?x)
-
Key
?r
is
at
?x
location,
(at-robot
?x)
-
Robot
is
at
?x
location,
(locked
?x)
-
Location
?x
is
locked,
(holding
?k)
-
Robot
is
holding
?k,
(open
?x)
-
Location
?x
is
open,
and
(arm-empty)
-
Robot's
arm
is
empty."
question
:
"
What
proposition
can
never
hold
in
any
potentially
reachable
state?"
answer
:
"
(locked
f2-2f)"
PDDL_domain
:
"
(define
(domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:types
key
place
shape
-
object)
\n
(:predicates
(arm-empty)
(at
?r
-
key
?x
-
place)
(at-robot
?x
-
place)
(conn
?x
-
place
?y
-
place)
(holding
?k
-
key)
(key-shape
?k
-
key
?s
-
shape)
(lock-shape
?x
-
place
?s
-
shape)
(locked
?x
-
place)
(open
?x
-
place))
\n
(:action
move
\n
:parameters
(?curpos
-
place
?nextpos
-
place)
\n
:precondition
(and
(at-robot
?curpos)
(conn
?curpos
?nextpos)
(open
?nextpos))
\n
:effect
(and
(at-robot
?nextpos)
(not
(at-robot
?curpos)))
\n
)
\n
(:action
pickup
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(at
?key
?curpos)
(arm-empty))
\n
:effect
(and
(holding
?key)
(not
(at
?key
?curpos))
(not
(arm-empty)))
\n
)
\n
(:action
pickup-and-loose
\n
:parameters
(?curpos
-
place
?newkey
-
key
?oldkey
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?oldkey)
(at
?newkey
?curpos))
\n
:effect
(and
(holding
?newkey)
(at
?oldkey
?curpos)
(not
(holding
?oldkey))
(not
(at
?newkey
?curpos)))
\n
)
\n
(:action
putdown
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?key))
\n
:effect
(and
(arm-empty)
(at
?key
?curpos)
(not
(holding
?key)))
\n
)
\n
(:action
unlock
\n
:parameters
(?curpos
-
place
?lockpos
-
place
?key
-
key
?shape
-
shape)
\n
:precondition
(and
(conn
?curpos
?lockpos)
(key-shape
?key
?shape)
(lock-shape
?lockpos
?shape)
(at-robot
?curpos)
(locked
?lockpos)
(holding
?key))
\n
:effect
(and
(open
?lockpos)
(not
(locked
?lockpos)))
\n
)
\n
)"
PDDL_problem
:
"
(define
(problem
grid-x5-y5-t1-k2-l2-p100)
\n
(:domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:objects
key0-0
key0-1
-
key
f0-0f
f0-1f
f0-2f
f0-3f
f0-4f
f1-0f
f1-1f
f1-2f
f1-3f
f1-4f
f2-0f
f2-1f
f2-2f
f2-3f
f2-4f
f3-0f
f3-1f
f3-2f
f3-3f
f3-4f
f4-0f
f4-1f
f4-2f
f4-3f
f4-4f
-
place
shape0
-
shape)
\n
(:init
(arm-empty)
(at
key0-0
f3-1f)
(at
key0-1
f1-3f)
(at-robot
f3-1f)
(conn
f0-0f
f0-1f)
(conn
f0-0f
f1-0f)
(conn
f0-1f
f0-0f)
(conn
f0-1f
f0-2f)
(conn
f0-1f
f1-1f)
(conn
f0-2f
f0-1f)
(conn
f0-2f
f0-3f)
(conn
f0-2f
f1-2f)
(conn
f0-3f
f0-2f)
(conn
f0-3f
f0-4f)
(conn
f0-3f
f1-3f)
(conn
f0-4f
f0-3f)
(conn
f0-4f
f1-4f)
(conn
f1-0f
f0-0f)
(conn
f1-0f
f1-1f)
(conn
f1-0f
f2-0f)
(conn
f1-1f
f0-1f)
(conn
f1-1f
f1-0f)
(conn
f1-1f
f1-2f)
(conn
f1-1f
f2-1f)
(conn
f1-2f
f0-2f)
(conn
f1-2f
f1-1f)
(conn
f1-2f
f1-3f)
(conn
f1-2f
f2-2f)
(conn
f1-3f
f0-3f)
(conn
f1-3f
f1-2f)
(conn
f1-3f
f1-4f)
(conn
f1-3f
f2-3f)
(conn
f1-4f
f0-4f)
(conn
f1-4f
f1-3f)
(conn
f1-4f
f2-4f)
(conn
f2-0f
f1-0f)
(conn
f2-0f
f2-1f)
(conn
f2-0f
f3-0f)
(conn
f2-1f
f1-1f)
(conn
f2-1f
f2-0f)
(conn
f2-1f
f2-2f)
(conn
f2-1f
f3-1f)
(conn
f2-2f
f1-2f)
(conn
f2-2f
f2-1f)
(conn
f2-2f
f2-3f)
(conn
f2-2f
f3-2f)
(conn
f2-3f
f1-3f)
(conn
f2-3f
f2-2f)
(conn
f2-3f
f2-4f)
(conn
f2-3f
f3-3f)
(conn
f2-4f
f1-4f)
(conn
f2-4f
f2-3f)
(conn
f2-4f
f3-4f)
(conn
f3-0f
f2-0f)
(conn
f3-0f
f3-1f)
(conn
f3-0f
f4-0f)
(conn
f3-1f
f2-1f)
(conn
f3-1f
f3-0f)
(conn
f3-1f
f3-2f)
(conn
f3-1f
f4-1f)
(conn
f3-2f
f2-2f)
(conn
f3-2f
f3-1f)
(conn
f3-2f
f3-3f)
(conn
f3-2f
f4-2f)
(conn
f3-3f
f2-3f)
(conn
f3-3f
f3-2f)
(conn
f3-3f
f3-4f)
(conn
f3-3f
f4-3f)
(conn
f3-4f
f2-4f)
(conn
f3-4f
f3-3f)
(conn
f3-4f
f4-4f)
(conn
f4-0f
f3-0f)
(conn
f4-0f
f4-1f)
(conn
f4-1f
f3-1f)
(conn
f4-1f
f4-0f)
(conn
f4-1f
f4-2f)
(conn
f4-2f
f3-2f)
(conn
f4-2f
f4-1f)
(conn
f4-2f
f4-3f)
(conn
f4-3f
f3-3f)
(conn
f4-3f
f4-2f)
(conn
f4-3f
f4-4f)
(conn
f4-4f
f3-4f)
(conn
f4-4f
f4-3f)
(key-shape
key0-0
shape0)
(key-shape
key0-1
shape0)
(lock-shape
f2-0f
shape0)
(lock-shape
f4-2f
shape0)
(locked
f2-0f)
(open
f0-0f)
(open
f0-1f)
(open
f0-2f)
(open
f0-3f)
(open
f0-4f)
(open
f1-0f)
(open
f1-1f)
(open
f1-2f)
(open
f1-3f)
(open
f1-4f)
(open
f2-1f)
(open
f2-2f)
(open
f2-3f)
(open
f2-4f)
(open
f3-0f)
(open
f3-1f)
(open
f3-2f)
(open
f3-3f)
(open
f3-4f)
(open
f4-0f)
(open
f4-1f)
(open
f4-2f)
(open
f4-3f)
(open
f4-4f))
\n
(:goal
(and
(at
key0-0
f2-0f)
(at
key0-1
f1-3f)))
\n
)"
-
context
:
"
There
are
several
cities,
each
containing
several
locations,
some
of
which
are
airports.
There
are
also
trucks,
which
can
drive
within
a
single
city,
and
airplanes,
which
can
fly
between
airports.
The
goal
is
to
get
some
packages
from
various
locations
to
various
new
locations.
\n
There
are
2
trucks
and
1
airplane,
as
well
as
4
packages.
There
are
4
locations
across
2
cities.
\n
The
locations
are
in
cities
as
follows:
l1-0
and
l1-1
are
in
c1;
l0-1
and
l0-0
are
in
c0.
\n
Currently,
t1,
a0,
and
p2
are
at
l1-0,
t0
and
p0
are
at
l0-0,
p1
is
in
t1,
p3
is
in
t0.
The
available
propositions
are:
(at
?obj
?loc)
-
?obj
is
at
?loc
and
(in
?obj1
?obj2)
-
?obj1
is
in
?obj2."
question
:
"
What
proposition
can
never
hold
in
any
potentially
reachable
state?"
answer
:
"
(at
t0
l1-0)"
PDDL_domain
:
"
(define
(domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n\n
(:types
\n
location
locatable
city
-
object
\n
package
movable
-
locatable
\n
airport
-
location
\n
airplane
truck
-
movable
\n
)
\t\t\n
\n
(:predicates
\t\n\t\t
(at
?obj
-
locatable
?loc
-
location)
\n\t\t
(in
?obj1
-
package
?obj2
-
movable)
\n\t\t
(in-city
?obj
-
location
?city
-
city))
\n\n\n
(:action
LOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(at
?obj
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?truck)))
\n\n
(:action
LOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?obj
?loc)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?airplane)))
\n\n\n\n
(:action
UNLOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(in
?obj
?truck))
\n
:effect
\n
(and
(not
(in
?obj
?truck))
(at
?obj
?loc)))
\n\n
(:action
UNLOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(in
?obj
?airplane)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(in
?obj
?airplane))
(at
?obj
?loc)))
\n\n
(:action
DRIVE-TRUCK
\n
:parameters
\n
(?truck
-
truck
\n
?loc-from
-
location
\n
?loc-to
-
location
\n
?city
-
city)
\n
:precondition
\n
(and
\n
(at
?truck
?loc-from)
\n
(in-city
?loc-from
?city)
\n
(in-city
?loc-to
?city))
\n
:effect
\n
(and
(not
(at
?truck
?loc-from))
(at
?truck
?loc-to)))
\n\n
(:action
FLY-AIRPLANE
\n
:parameters
\n
(?airplane
-
airplane
\n
?loc-from
-
airport
\n
?loc-to
-
airport)
\n
:precondition
\n
(and
\n\t
(at
?airplane
?loc-from))
\n
:effect
\n
(and
(not
(at
?airplane
?loc-from))
(at
?airplane
?loc-to)))
\n
)"
PDDL_problem
:
"
(define
(problem
logistics-c2-s2-p4-a1)
\n
(:domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n
(:objects
a0
-
airplane
l0-0
l1-0
-
airport
c0
c1
-
city
l0-1
l1-1
-
location
p0
p1
p2
p3
-
package
t0
t1
-
truck)
\n
(:init
(at
a0
l1-0)
(at
p0
l0-0)
(at
p2
l1-0)
(at
t0
l0-0)
(at
t1
l1-0)
(in
p1
t1)
(in
p3
t0)
(in-city
l0-0
c0)
(in-city
l0-1
c0)
(in-city
l1-0
c1)
(in-city
l1-1
c1))
\n
(:goal
(and
(at
p0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l0-1)))
\n
)"
doc_to_text
:
"
#
PDDL
DOMAIN
\n\n
```
\n
{{PDDL_domain}}
\n
```
\n\n
#
PDDL
PROBLEM
\n\n
```
\n
{{PDDL_problem}}
\n
```
\n\n
**Question**:
{{context}}
{{question}}
Provide
one
proposition
or
None.
**Final
Answer**:"
filter_list
:
-
name
:
"
acp_grammar_parse"
filter
:
-
function
:
"
ACP_grammar_filter"
grammar_task
:
"
act"
-
function
:
"
take_first"
lm_eval/tasks/acpbench/gen_2shot_with_pddl/val.yaml
0 → 100644
View file @
601be343
task
:
acp_val_gen_with_pddl
dataset_name
:
acp_val_gen
include
:
_gen_yaml_2shot
fewshot_config
:
sampler
:
first_n
samples
:
-
context
:
"
A
robot
is
in
a
grid
and
can
only
move
to
places
that
are
connected
to
its
current
position.
\n
The
grid
size
is
5x5,
and
the
locations
are
of
the
form
fi-jf
(e.g.,
f3-2f
or
f0-1f).
The
grid
cells
are
connected
to
their
neighbors
(e.g.,
f1-2f
is
connected
to
the
four
neighbors
f0-2f,
f2-2f,
f1-1f,
and
f1-3f).
Some
positions
on
the
grid
are
locked
and
can
be
opened
with
a
key
of
a
matching
shape.
The
robot
has
an
arm
that
can
pick
up
a
key
when
the
key
is
in
same
location
as
the
robot
and
the
arm
is
empty.
\n
There
are
2
keys
in
0
different
shapes:
Key
key0-1
is
of
shape
shape0,
Key
key0-0
is
of
shape
shape0.
\n
Currently,
the
robot
is
at
position
f3-3f
and
its
arm
is
empty.
All
the
positions
are
open
except
the
following:
f2-0f
has
shape0
shaped
lock,
f4-2f
has
shape0
shaped
lock.
Key
key0-1
is
at
position
f1-3f.
Key
key0-0
is
at
position
f2-2f.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
Key
key0-1
is
at
f1-3f
location
and
Key
key0-0
is
at
f2-0f
location.
The
available
actions
are:
(unlock
?curpos
?lockpos
?key
?shape)
-
unlock
the
place
?lockpos
with
the
key
?key
of
the
shape
?shape
from
the
current
position
place
?curpos,
(move
?curpos
?nextpos)
-
move
from
place
?curpos
to
place
?nextpos,
(pickup
?curpos
?key)
-
retrieve
the
key
?key
from
its
current
position
?curpos,
(pickup-and-loose
?curpos
?newkey
?oldkey)
-
pick
up
the
key
?newkey
at
the
current
position
place
?curpos
and
loose
the
key
?oldkey
being
held,
and
(putdown
?curpos
?key)
-
put
the
key
?key
at
the
current
position
place
?curpos."
question
:
"
What
is
the
first
inapplicable
action
in
the
next
sequence
of
actions:
\"
(unlock
f1-0f
f2-0f
key0-1
shape0)
(move
f2-3f
f2-2f)
(pickup
f2-2f
key0-0)
(move
f2-2f
f2-1f)
(unlock
f2-1f
f2-0f
key0-0
shape0)
(move
f2-1f
f2-0f)
(putdown
f2-0f
key0-0)
\"
?"
answer
:
"
0"
PDDL_domain
:
"
(define
(domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:types
key
place
shape
-
object)
\n
(:predicates
(arm-empty)
(at
?r
-
key
?x
-
place)
(at-robot
?x
-
place)
(conn
?x
-
place
?y
-
place)
(holding
?k
-
key)
(key-shape
?k
-
key
?s
-
shape)
(lock-shape
?x
-
place
?s
-
shape)
(locked
?x
-
place)
(open
?x
-
place))
\n
(:action
move
\n
:parameters
(?curpos
-
place
?nextpos
-
place)
\n
:precondition
(and
(at-robot
?curpos)
(conn
?curpos
?nextpos)
(open
?nextpos))
\n
:effect
(and
(at-robot
?nextpos)
(not
(at-robot
?curpos)))
\n
)
\n
(:action
pickup
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(at
?key
?curpos)
(arm-empty))
\n
:effect
(and
(holding
?key)
(not
(at
?key
?curpos))
(not
(arm-empty)))
\n
)
\n
(:action
pickup-and-loose
\n
:parameters
(?curpos
-
place
?newkey
-
key
?oldkey
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?oldkey)
(at
?newkey
?curpos))
\n
:effect
(and
(holding
?newkey)
(at
?oldkey
?curpos)
(not
(holding
?oldkey))
(not
(at
?newkey
?curpos)))
\n
)
\n
(:action
putdown
\n
:parameters
(?curpos
-
place
?key
-
key)
\n
:precondition
(and
(at-robot
?curpos)
(holding
?key))
\n
:effect
(and
(arm-empty)
(at
?key
?curpos)
(not
(holding
?key)))
\n
)
\n
(:action
unlock
\n
:parameters
(?curpos
-
place
?lockpos
-
place
?key
-
key
?shape
-
shape)
\n
:precondition
(and
(conn
?curpos
?lockpos)
(key-shape
?key
?shape)
(lock-shape
?lockpos
?shape)
(at-robot
?curpos)
(locked
?lockpos)
(holding
?key))
\n
:effect
(and
(open
?lockpos)
(not
(locked
?lockpos)))
\n
)
\n
)"
PDDL_problem
:
"
(define
(problem
grid-x5-y5-t1-k2-l2-p100)
\n
(:domain
grid)
\n
(:requirements
:strips
:typing)
\n
(:objects
key0-0
key0-1
-
key
f0-0f
f0-1f
f0-2f
f0-3f
f0-4f
f1-0f
f1-1f
f1-2f
f1-3f
f1-4f
f2-0f
f2-1f
f2-2f
f2-3f
f2-4f
f3-0f
f3-1f
f3-2f
f3-3f
f3-4f
f4-0f
f4-1f
f4-2f
f4-3f
f4-4f
-
place
shape0
-
shape)
\n
(:init
(arm-empty)
(at
key0-0
f2-2f)
(at
key0-1
f1-3f)
(at-robot
f3-3f)
(conn
f0-0f
f0-1f)
(conn
f0-0f
f1-0f)
(conn
f0-1f
f0-0f)
(conn
f0-1f
f0-2f)
(conn
f0-1f
f1-1f)
(conn
f0-2f
f0-1f)
(conn
f0-2f
f0-3f)
(conn
f0-2f
f1-2f)
(conn
f0-3f
f0-2f)
(conn
f0-3f
f0-4f)
(conn
f0-3f
f1-3f)
(conn
f0-4f
f0-3f)
(conn
f0-4f
f1-4f)
(conn
f1-0f
f0-0f)
(conn
f1-0f
f1-1f)
(conn
f1-0f
f2-0f)
(conn
f1-1f
f0-1f)
(conn
f1-1f
f1-0f)
(conn
f1-1f
f1-2f)
(conn
f1-1f
f2-1f)
(conn
f1-2f
f0-2f)
(conn
f1-2f
f1-1f)
(conn
f1-2f
f1-3f)
(conn
f1-2f
f2-2f)
(conn
f1-3f
f0-3f)
(conn
f1-3f
f1-2f)
(conn
f1-3f
f1-4f)
(conn
f1-3f
f2-3f)
(conn
f1-4f
f0-4f)
(conn
f1-4f
f1-3f)
(conn
f1-4f
f2-4f)
(conn
f2-0f
f1-0f)
(conn
f2-0f
f2-1f)
(conn
f2-0f
f3-0f)
(conn
f2-1f
f1-1f)
(conn
f2-1f
f2-0f)
(conn
f2-1f
f2-2f)
(conn
f2-1f
f3-1f)
(conn
f2-2f
f1-2f)
(conn
f2-2f
f2-1f)
(conn
f2-2f
f2-3f)
(conn
f2-2f
f3-2f)
(conn
f2-3f
f1-3f)
(conn
f2-3f
f2-2f)
(conn
f2-3f
f2-4f)
(conn
f2-3f
f3-3f)
(conn
f2-4f
f1-4f)
(conn
f2-4f
f2-3f)
(conn
f2-4f
f3-4f)
(conn
f3-0f
f2-0f)
(conn
f3-0f
f3-1f)
(conn
f3-0f
f4-0f)
(conn
f3-1f
f2-1f)
(conn
f3-1f
f3-0f)
(conn
f3-1f
f3-2f)
(conn
f3-1f
f4-1f)
(conn
f3-2f
f2-2f)
(conn
f3-2f
f3-1f)
(conn
f3-2f
f3-3f)
(conn
f3-2f
f4-2f)
(conn
f3-3f
f2-3f)
(conn
f3-3f
f3-2f)
(conn
f3-3f
f3-4f)
(conn
f3-3f
f4-3f)
(conn
f3-4f
f2-4f)
(conn
f3-4f
f3-3f)
(conn
f3-4f
f4-4f)
(conn
f4-0f
f3-0f)
(conn
f4-0f
f4-1f)
(conn
f4-1f
f3-1f)
(conn
f4-1f
f4-0f)
(conn
f4-1f
f4-2f)
(conn
f4-2f
f3-2f)
(conn
f4-2f
f4-1f)
(conn
f4-2f
f4-3f)
(conn
f4-3f
f3-3f)
(conn
f4-3f
f4-2f)
(conn
f4-3f
f4-4f)
(conn
f4-4f
f3-4f)
(conn
f4-4f
f4-3f)
(key-shape
key0-0
shape0)
(key-shape
key0-1
shape0)
(lock-shape
f2-0f
shape0)
(lock-shape
f4-2f
shape0)
(locked
f2-0f)
(locked
f4-2f)
(open
f0-0f)
(open
f0-1f)
(open
f0-2f)
(open
f0-3f)
(open
f0-4f)
(open
f1-0f)
(open
f1-1f)
(open
f1-2f)
(open
f1-3f)
(open
f1-4f)
(open
f2-1f)
(open
f2-2f)
(open
f2-3f)
(open
f2-4f)
(open
f3-0f)
(open
f3-1f)
(open
f3-2f)
(open
f3-3f)
(open
f3-4f)
(open
f4-0f)
(open
f4-1f)
(open
f4-3f)
(open
f4-4f))
\n
(:goal
(and
(at
key0-0
f2-0f)
(at
key0-1
f1-3f)))
\n
)"
-
context
:
"
There
are
several
cities,
each
containing
several
locations,
some
of
which
are
airports.
There
are
also
trucks,
which
can
drive
within
a
single
city,
and
airplanes,
which
can
fly
between
airports.
The
goal
is
to
get
some
packages
from
various
locations
to
various
new
locations.
\n
There
are
2
trucks
and
1
airplane,
as
well
as
4
packages.
There
are
4
locations
across
2
cities.
\n
The
locations
are
in
cities
as
follows:
l1-1
and
l1-0
are
in
c1;
l0-1
and
l0-0
are
in
c0.
\n
Currently,
t1
and
p0
are
at
l1-1,
p3,
p2,
and
p1
are
at
l1-0,
t0
is
at
l0-1,
a0
is
at
l0-0.
The
goal
is
to
reach
a
state
where
the
following
facts
hold:
p0
is
at
l0-0,
p3
is
at
l0-1,
p2
is
at
l1-0,
and
p1
is
at
l1-0.
The
available
actions
are:
(load-truck
?obj
?truck
?loc)
-
load
object
?obj
into
truck
?truck
at
location
?loc,
(load-airplane
?obj
?airplane
?loc)
-
load
the
object
?obj
from
location
?loc
into
the
airplane
?airplane,
(unload-truck
?obj
?truck
?loc)
-
offload
the
object
?obj
from
the
truck
?truck
at
location
?loc,
(unload-airplane
?obj
?airplane
?loc)
-
remove
the
object
?obj
from
the
airplane
?airplane
and
place
it
on
the
location
?loc,
(drive-truck
?truck
?loc-from
?loc-to
?city)
-
navigate
the
truck
?truck
which
is
in
location
?loc-from
in
city
?city
to
another
location
?loc-to
in
the
same
city,
and
(fly-airplane
?airplane
?loc-from
?loc-to)
-
fly
airplane
?airplane
from
airport
?loc-from
to
airport
?loc-to."
question
:
"
What
is
the
first
inapplicable
action
in
the
next
sequence
of
actions:
\"
(drive-truck
t0
l0-1
l0-0
c0)
(fly-airplane
a0
l0-0
l1-0)
(load-airplane
p3
a0
l1-0)
(load-truck
p0
t1
l1-1)
(drive-truck
t1
l1-1
l1-0
c1)
(unload-truck
p0
t1
l1-0)
(load-airplane
p0
a0
l1-0)
(fly-airplane
a0
l1-0
l0-0)
(unload-airplane
p0
a0
l0-0)
(unload-airplane
p3
a0
l0-0)
(load-truck
p3
t0
l0-0)
(drive-truck
t0
l0-0
l0-1
c0)
(unload-airplane
p3
a0
l0-0)
\"
?"
answer
:
"
12"
PDDL_domain
:
"
(define
(domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n\n
(:types
\n
location
locatable
city
-
object
\n
package
movable
-
locatable
\n
airport
-
location
\n
airplane
truck
-
movable
\n
)
\t\t\n
\n
(:predicates
\t\n\t\t
(at
?obj
-
locatable
?loc
-
location)
\n\t\t
(in
?obj1
-
package
?obj2
-
movable)
\n\t\t
(in-city
?obj
-
location
?city
-
city))
\n\n\n
(:action
LOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(at
?obj
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?truck)))
\n\n
(:action
LOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?obj
?loc)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(at
?obj
?loc))
(in
?obj
?airplane)))
\n\n\n\n
(:action
UNLOAD-TRUCK
\n
:parameters
\n
(?obj
-
package
\n
?truck
-
truck
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(at
?truck
?loc)
(in
?obj
?truck))
\n
:effect
\n
(and
(not
(in
?obj
?truck))
(at
?obj
?loc)))
\n\n
(:action
UNLOAD-AIRPLANE
\n
:parameters
\n
(?obj
-
package
\n
?airplane
-
airplane
\n
?loc
-
location)
\n
:precondition
\n
(and
\n
(in
?obj
?airplane)
(at
?airplane
?loc))
\n
:effect
\n
(and
(not
(in
?obj
?airplane))
(at
?obj
?loc)))
\n\n
(:action
DRIVE-TRUCK
\n
:parameters
\n
(?truck
-
truck
\n
?loc-from
-
location
\n
?loc-to
-
location
\n
?city
-
city)
\n
:precondition
\n
(and
\n
(at
?truck
?loc-from)
\n
(in-city
?loc-from
?city)
\n
(in-city
?loc-to
?city))
\n
:effect
\n
(and
(not
(at
?truck
?loc-from))
(at
?truck
?loc-to)))
\n\n
(:action
FLY-AIRPLANE
\n
:parameters
\n
(?airplane
-
airplane
\n
?loc-from
-
airport
\n
?loc-to
-
airport)
\n
:precondition
\n
(and
\n\t
(at
?airplane
?loc-from))
\n
:effect
\n
(and
(not
(at
?airplane
?loc-from))
(at
?airplane
?loc-to)))
\n
)"
PDDL_problem
:
"
(define
(problem
logistics-c2-s2-p4-a1)
\n
(:domain
logistics-strips)
\n
(:requirements
:strips
:typing)
\n
(:objects
a0
-
airplane
l0-0
l1-0
-
airport
c0
c1
-
city
l0-1
l1-1
-
location
p0
p1
p2
p3
-
package
t0
t1
-
truck)
\n
(:init
(at
a0
l0-0)
(at
p0
l1-1)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l1-0)
(at
t0
l0-1)
(at
t1
l1-1)
(in-city
l0-0
c0)
(in-city
l0-1
c0)
(in-city
l1-0
c1)
(in-city
l1-1
c1))
\n
(:goal
(and
(at
p0
l0-0)
(at
p1
l1-0)
(at
p2
l1-0)
(at
p3
l0-1)))
\n
)"
doc_to_text
:
"
#
PDDL
DOMAIN
\n\n
```
\n
{{PDDL_domain}}
\n
```
\n\n
#
PDDL
PROBLEM
\n\n
```
\n
{{PDDL_problem}}
\n
```
\n\n
**Question**:
{{context}}
{{question}}
Provide
only
the
index
of
the
action.
**Final
Answer**:"
filter_list
:
-
name
:
"
acp_grammar_parse"
filter
:
-
function
:
"
ACP_grammar_filter"
grammar_task
:
"
index"
-
function
:
"
take_first"
lm_eval/tasks/afrimgsm/direct/afrimgsm.yaml
0 → 100644
View file @
601be343
group
:
afrimgsm-irokobench
task
:
-
afrimgsm_tasks_prompt_1
-
afrimgsm_tasks_prompt_2
-
afrimgsm_tasks_prompt_3
-
afrimgsm_tasks_prompt_4
-
afrimgsm_tasks_prompt_5
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
weight_by_size
:
true
metadata
:
version
:
2
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_amh.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
amh
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[15:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_amh
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_eng.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
eng
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_eng
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_ewe.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
ewe
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_ewe
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_fra.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
fra
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_fra
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_hau.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
hau
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_hau
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_ibo.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
ibo
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_ibo
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_kin.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
kin
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_kin
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_lin.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
lin
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_lin
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_lug.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
lug
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_lug
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_orm.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
orm
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_orm
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_sna.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
sna
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_sna
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_sot.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
sot
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_sot
lm_eval/tasks/afrimgsm/direct/afrimgsm_direct_swa.yaml
deleted
100644 → 0
View file @
d0884a96
# Generated by utils.py
dataset_name
:
swa
doc_to_target
:
'
{%
if
answer
is
not
none
%}{{answer[21:]}}{%
else
%}{{answer_number|string}}{%
endif
%}'
doc_to_text
:
'
{%
if
answer
is
not
none
%}{{question+"\nAnswer:"}}{%
else
%}{{"Question:
"+question+"\nAnswer:"}}{%
endif
%}'
generation_kwargs
:
do_sample
:
false
until
:
-
'
Question:'
-
</s>
-
<|im_end|>
include
:
direct_yaml
task
:
afrimgsm_direct_swa
Prev
1
2
3
4
5
6
7
…
50
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment