Probabilistic Reasoning over Time Russell and Norvig: ch 15

Download Report

Transcript Probabilistic Reasoning over Time Russell and Norvig: ch 15

Probabilistic Reasoning
over Time
Russell and Norvig: ch 15
Temporal Probabilistic
Agent
sensors
?
environment
agent
actuators
t1, t2, t3, …
Time and Uncertainty
The world changes, we need to track and predict it
Examples: diabetes management, traffic monitoring
Basic idea: copy state and evidence variables for
each time step
Xt – set of unobservable state variables at time t

e.g., BloodSugart, StomachContentst
Et – set of evidence variables at time t

e.g., MeasuredBloodSugart, PulseRatet, FoodEatent
Assumes discrete time steps
States and Observations
Process of change is viewed as series of snapshots,
each describing the state of the world at a
particular time
Each time slice involves a set or random variables
indexed by t:
1.
2.


the set of unobservable state variables Xt
the set of observable evidence variable Et
The observation at time t is Et = et for some set of
values et
The notation Xa:b denotes the set of variables from
Xa to Xb
Stationary Process/Markov Assumption
Markov Assumption: Xt depends on some previous Xis
First-order Markov process:
P(Xt|X0:t-1) = P(Xt|Xt-1)
kth order: depends on previous k time steps
Sensor Markov assumption:
P(Et|X0:t, E0:t-1) = P(Et|Xt)
Assume stationary process: transition model P(Xt|Xt-1)
and sensor model P(Et|Xt) are the same for all t
In a stationary process, the changes in the world
state are governed by laws that do not themselves
change over time
Which mode is right is determined by domain.
Eg.
If a particle is executing a random walk along the
x-axis, changing its position by +/-1 at each time step,
then using the x-coordinate as the state gives a firstorder Markov process.
Complete Joint Distribution
Given:



Transition model: P(Xt|Xt-1)
Sensor model: P(Et|Xt)
Prior probability: P(X0)
Then we can specify complete joint
distribution:
t
P( X 0 , X1 ,..., X t , E1 ,..., E t )  P( X 0 ) P( X i | X i 1 ) P( E i | X i )
i 1
Example
Raint-1
Umbrellat-1
Rt-1
P(Rt|Rt-1)
T
F
0.7
0.3
Raint
Raint+1
Umbrellat
Umbrellat+1
Rt
P(Ut|Rt)
T
F
0.9
0.2
Assumption modification
first-order Markov process is used in previous
example
There are two possible fixes if the
approximation proves too inaccurate:
1. Increasing the order of the Markov process
model.

Use second-order model by adding Raint-2 as a parent
of Rain, which might give slightly more accurate
predictions (for example, in Palo ,Alto it very rarely
rains more than two days in a row).
2. Increasing the set of state variables.

For example, we could add Season{ to allow us to
incorporate historical records of rainy seasons, or we
could add Temperaturet, Humidityt and Pressuret to
allow us to use a physical model of rainy conditions.
Note
adding state variables might improve
the system's predictive power but also
increases the prediction requirements

we now have to predict the new variables
as well.
 add new sensors (Eg. measurements of
temperature and pressure
Example: robot wandering
Problem: tracking a robot wandering
randomly on the X-Y plane
state variables: position and velocity


the new position can be calculated by
Newton's laws
the velocity may change unpredictably
 Eg. battery-powered robot’s velocity may be
effected by a battery exhaustion
Example: robot wandering
Battery exhaustion depends on how
much power was used by all previous
maneuvers, the Markov property is
violated.
Add charge level Batteryt as state
variable


new model to predict Batteryt from Batteryt-1
and the velocity
New sensors for charge level
Inference Tasks
Filtering or monitoring: P(Xt|e1,…,et)
computing current belief state, given all evidence to date
Prediction: P(Xt+k|e1,…,et)
computing prob. of some future state
Smoothing: P(Xk|e1,…,et)
computing prob. of past state (hindsight)(0≤k ≤t)
Most likely explanation:
arg maxx1,..xtP(x1,…,xt|e1,…,et)
given sequence of observation, find sequence of states that is
most likely to have generated those observations.
Examples
Filtering: What is the probability that it is raining
today, given all the umbrella observations up through
today?
Prediction: What is the probability that it will rain
the day after tomorrow, given all the umbrella
observations up through today?
Smoothing: What is the probability that it rained
yesterday, given all the umbrella observations
through today?
Most likely explanation: if the umbrella appeared
the first three days but not on the fourth, what is the
most likely weather sequence to produce these
umbrella sightings?
Filtering & Prediction
We use recursive estimation to compute P(Xt+1 |
e1:t+1) as a function of et+1 and P(Xt | e1:t)
We can write this as follows:
P( X t 1 | e1:t 1 )  P( X t 1 | e1:t , et 1 )(dividing up the evidence)
 P (et 1 | X t 1 , e1:t ) P ( X t 1 | e1:t )(using Bayes' rule)
 P (et 1 | X t 1 ) P ( X t 1 | e1:t )(by the Markov property of evidence).
 P (et 1 | X t 1 ) P ( X t 1 | xt ) P ( xt | e1:t )(using the Markov property)
xt

P(et+l lXt+1) is obtainable directly from the sensor model
This leads to a recursive definition

f1:t+1 = FORWARD(f1:t:t,et+1)
Example: rain & umbrella p. 543
Model,P(R0) =<0.5,0.5>
Smoothing
Compute P(Xk|e1:t) for 0<= k < t
Using a backward message bk+1:t = P(Ek+1:t | Xk), we obtain

P(Xk|e1:t) = f1:kbk+1:t
The backward message can be computed using
bk 1:t  P(ek 1 | xk 1 )P(ek  2:t | xk 1 )P(xk 1 | Xk )
xk 1
(15.7)
This leads to a recursive definition

Bk+1:t = BACKWARD(bk+2:t,ek+1:t)
bk 1:t  P(ek 1 | xk 1 )P(ek  2:t | xk 1 )P(xk 1 | Xk )
Example from R&N: p. 545
xk 1
computing the smoothed estimate for the
probability of rain at t = 1, given the umbrella
observations on days 1 and 2.
According to E.15.7
Pluged into E.15.8
Probabilistic Temporal Models
Hidden Markov Models (HMMs)
Kalman Filters
Dynamic Bayesian Networks (DBNs)
Hidden Markov Models (HMMs)
Further simplification:
Only one state variable.
We can use matrices, now.
Ti,j = P(Xt=j|Xt-1=i)


Tij is the probability of a transition from
state i to state j.
transition matrix for the umbrella world:
Hidden Markov Models (HMMs)
sensor model:diagonal matrix Ot


diagonal entries are given by the values
P(et lXt = i)
Other entried:0
 Umbrella world
Hidden Markov Models (HMMs)
Example: speech recognition