Introduction to Planning - University of Central Florida
Download
Report
Transcript Introduction to Planning - University of Central Florida
Introduction to Planning
Han Yu
University of Central Florida
Outline
What is planning?
Formal definitions of planning problems
Basic planning algorithms
Recent development
What is Planning?
Planning is a search problem that requires to
find an efficient sequence of actions that
transform a system from a given starting
state to the goal state
What’s Given?
Initial state of the problem
Goal state of the problem
A finite set of actions:
pre-conditions: a finite set of conditions for
the action to be performed
post-conditions: a finite set of conditions
that will be changed after the action is
performed
cost
What’s Output?
A sequence of actions that meet the
following criteria
every action matches the current system
state
can transform system from initial state to
goal state
the total cost of the actions is below a
specified value
Planning as a Real-World Problem
Planning problem has a wide range of
applications in the real world
planning in daily life
game world
workflow management
Planning a Trip
Begin
Preparation
Rental Car Reservation
Confirm Reservations
Airline Reservation
Hotel Reservation
End
Towers of Hanoi
Problem Analysis
Optimal solution contains 2n-1 actions,
where n is the number of disks
Up to 6 possible actions in each system
state
The upper bound on search space to
n-1)
(2
find an optimal solution will be 6
Sliding-Tile Puzzle
Planning in Workflow Management
Start Assembly
Examine Order
Gather
Components
Assembly Box
and Motherboard
Install
Video Card
Install
Network Card
Insert Modem
Plug in CD
Plug in Battery
Install
Internal Disk
Install
Motherboard
Test
End Assembly
Partial-Order Plan versus
Total-Order Plan
Partial-order plan
consists partially ordered set of actions
sequence constraints exist on these actions
plan generation algorithm can be applied to
transform partial-order plan to total-order plan
Total-order plan
consists totally ordered set of actions
Partial-Order Plan
Get brush
Paint ceiling
Start
Get ladder
Finish
Total-Order Plan
Start
Get ladder
Get brush
Paint ceiling
Finish
Start
Get brush
Get ladder
Paint ceiling
Finish
Features of Planning Problems
Large search space
Action is associated with system states
Restrictions on the action sequence
Valid solution may not exist
Optimization requirement
STRIPS Planning System
A tuple T = (P, O, I, G), where
P is a finite set of ground literals, the conditions
O is a finite set of operators
I is the initial state, a subset of P
G is the goal state, a subset of P
STRIPS Operators
Each operator O has the following attributes
PC, a set of ground literals, defines the precondition of
the operator
D, a set of ground literals, defines the conditions that
will be removed after the operation is executed
A, a set of ground literals, defines the conditions that
will be added after the operation is executed
C, the cost of the operation
A Simple Example - Blocks World
A
C
A
B
B
Initial State
On(C, A)
Clear(Fl)
On(A, Fl)
Clear(B)
On(B, Fl)
Clear(C)
C
Goal State
On(A, B)
On(B, C)
On(C, Fl)
Clear(A)
Clear(Fl)
Graphical Representation of
Initial State
On(C, A)
Clear(Fl)
On(A, Fl) Clear(B)
Start
T
On(B, Fl)
Clear(C)
Graphical Representation of
Goal State
Nil
finish
On(A, B)
On(B, C)
On(C, Fl)
Clear(A)
Clear(Fl)
Block World - Operator
Move(x, y, z)
Move block x that is above y to above z
PC: On(x,y), Clear(x), Clear(z)
D: Clear(z), On(x, y)
A: On(x,z), Clear(y), Clear(Fl)
Graphical Representation of
Operator
A
Clear(y)
Operator
PC
On(x, y)
On(x, z)
Clear(Fl)
Move(x, y, z)
Clear(x)
Clear(z)
Forward Chaining
Search from the initial state
Expand the search tree by finding the set of all
applicable operators from the current state
applicable means the precondition of the operator is a
subset of current state
Update current state
For every state that is reached, record the shortest
path (or path with lowest cost) from the initial state
to this state
If the goal state is reached, stop the algorithm
Forward Chaining
On(C, Fl)
Clear(Fl)
On(A, Fl)
Clear(B)
On(B, Fl)
Clear(C)
Clear(A)
Move(C, A, Fl)
On(C, A)
Clear(Fl)
On(A, Fl)
Clear(B)
On(B, Fl)
On(B, C)
Clear(C)
Clear(Fl)
Move(B, Fl, C)
On(C, A)
Clear(B)
On(A, Fl)
Backward Chaining
Search backward from the goal state
Expand the search tree by finding the set of all
applicable operators that can reach the current
state
applicable means set A of the operator is a subset of
current state
Update the state
If the initial state is reached, stop the algorithm
The solution is a partial-ordered plan
Constraints in action ordering may be violated
Backward Chaining
On(A, B)
On(B, C)
On(C, Fl)
Clear(A)
Clear(Fl)
On(B, C)
On(C, Fl)
Clear(A)
Clear(Fl)
On(A, Fl)
Clear(B)
Move(A, Fl, B)
Recent Developments
Plan Reuse
Graphplan
Problem-specific planning
Evolutionary computation approach
Plan Reuse
Reuse old plans for new planning problems
Consists of two steps
plan matching
plan modification
Research findings
generally, plan reuse is even harder than plan from
scratch
do better only when two problems are close enough
plan matching could be the bottleneck
Graphplan
Partial-order general planner
Constructing a planning graph before search
plan graph contains all possible actions that can be
taken in each time step
actions that interfere with one another can coexist in
the graph
More efficient than other general planners in
some problems
Problem-specific Planning
Heuristics combined during search
heuristics is problem dependent
cannot apply to other problems
Usually outperforms general planners in
specific problems
Example
in sliding-tile puzzle, accurate estimation of the
distance between current state and goal state can
speed up the search for a plan
Genetic Algorithms
Parallel search and optimization
algorithm
Inspired by the basic rule of natural
selection, survival-of-the-fittest
Non-deterministic algorithm
randomness is incorporated during
implementation
Solution Formation
Candidate solution is encoded as a list of
genes, called chromosome
1
0
1
1
0
1
Start with a population of randomly
generated chromosomes
Population is evolved in every generation
evaluate each chromosome
select chromosomes to the next generation
apply genetic operations
Genetic Operation
crossover
Before
crossover
After
crossover
Crossover point
1
0
1
1
0
1
0
0
0
1
1
1
1 0 1 1 1 1
0
0
0
1
0
1
Genetic Operation
Mutation
Before
mutation
1
0
1
1
0
1
After
mutation
1
0
0
1
0
1
GA Procedure
Initialize the population
Evaluate each chromosome in the population
While the stopping condition is not met
select chromosomes to the next generation
crossover, mutation
evaluate newly generated chromosomes
End while
Evolutionary Computation
Approach
Non-deterministic algorithm
Starts from a set of randomized plan
Plans are evolved during generations
In each generation
evaluate the performance of the plan
select the plans to next generation, based on
performance
crossover, mutation to create new plans
Evolutionary Computation
Approach
Solution encoding
a list of floating-point number, each number maps
to an action in the problem
number of actions can vary during search
0.8 0.5 0.3 0.2 0.5
A4 A3 A2 A1 A2
Evolutionary Computation
Approach
State-aware crossover
crossover points are selected based on the
matching of states
s1
Parent 1
s2
Parent 2
Child 1
Child 2
Match(s1, s2) = true
Evolutionary Computation
Approach
Mutation
randomly select an action and replace it with a
random floating-point number
Before
mutation
0.8 0.5 0.3 0.2 0.5
After
mutation
0.8 0.5 0.6 0.2 0.5
Evolutionary Computation
Approach
Fitness evaluation
consists of two parts: goal fitness fg and cost
fitness fc
goal fitness: how the solutions reach the goal
state
cost fitness: the total cost of the solution
overall fitness = a * fg + b * fc
Experimental Results
Towers of Hanoi
Sliding-tile puzzle
can solve the 5 and 6-disk case with high
probability
state-aware crossover does not perform well
can solve 3*3 problem case with high probability
state-aware crossover perform much better since
a looser definition of matching state is used
Performance scalability is not good
Heuristics may be helpful to improve the
References
Artificial Intelligence: A Modern Approach. Stuart
Russell and Peter Norvig. Prentice Hall.
Artificial Intelligence: A New Symthesis. Nils Nilsson.
Kaufmann.
Intelligent Planning: A Decomposition and Abstraction
Based Approach. Qiang Yang. Springer.
Internet-Based Workflow Management: Towards a
Semantic Web. Dan C. Marinescu. Wiley, 2002.
References
Plan reuse versus plan generation: a theoretical and
empirical analysis. B. Nebel, J. Koehler. Journal of Artificial
Intelligence 76 (1995), 427-454.
Fast planning through planning graph analysis. A. L. Blum,
M. L. Furst Journal of Artificial Intelligence 90 (1997), 281300.
Planning as heuristic search. B. Bonet, H. Geffner. Journal of
Artificial Intelligence 129 (2001), 5-33.
What it is, What it could be, An introduction to the Special Issue
on Planning and Scheduling. D. McDermott, J. Handler.
Journal of Artificial Intelligence 76 (1995), 1-16.
Concluding Words
Failing to plan is planning to fail
--Effie Jones