MURI Kickoff: UR
Download
Report
Transcript MURI Kickoff: UR
Recognizing Activities of
Daily Living
from Sensor Data
Henry Kautz
Department of Computer Science
University of Rochester
Activity Recognition
Much recent interest in recognizing human
activity from heterogeneous sensor data
Motion sensors
GPS
RFID
Video
Compelling applications
Military / security operations (e.g. ASSIST)
Smart homes & offices
Gathering data on indoor
activities
Interpreting RFID Data
(using Switching HMM)
Gathering Multi-view Video
Interpreting Video
Computing scene statistics
Computing object statistics
Ai = activity
Oi = object
Si = scene statistic
Di = object statistics
Ri = RFID label (for training)
Gathering data on outdoor
activities
Raw GPS
Discovering significant places
Conditional Random Field
Predicting transportation
goals
Dynamic Bayesian Network
Issue
Previous work on activity recognition has
used a wide variety of probabilistic models for
different tasks and kinds of data
Background knowledge is implicitly encoded
in the structure of the models
HMMs, DBNs, CRFs, …
E.g.: Relation between transportation goals,
plans, actions
Increasingly difficult to scale & integrate
Markov Logic
Markov logic will provide common modeling
language & inference tools, enabling
Easier integration of multiple sensors
Easier generalization
From one activity at a time to multiple ongoing
activities
From one individual to multiple individuals
Easier modification of background knowledge
Add / modify library of plans and goals
Example Scenario
John goes into his kitchen (video)
He takes out a jug from the refrigerator, and a
bowl from the cabinet (RFID)
He leaves his apartment, and walks to a
convenience store (GPS)
He returns carrying a box (video)
He pours the box into the bowl (accelerometer)
and the contents of the jug (accelerometer &
RFID)
Why did John leave the apartment? What did he do?
UR Contributions to MURI: Scenario
Development & Data Collection
Develop set of activity recognition scenarios of
increasing complexity
Enact and gather sensor data
Activities in the home
Outdoor activities
Heterogeneous: GPS, RFID, video, motion, …
Intermittent and noisy
Make dataset available to team
Including feature sequences extracted from video and
acceleration data
Ground truth
1st data set mid-Year One, then ongoing
UR Contributions to MURI: Unified ML
Model of Daily Activities
Recast our previous work on recognition using
HMMs, DBNs, CRFs in Markov Logic
Integrate and generalize earlier results
Year One:
HMM ML
Generalize to multiple ongoing activities
Handle novel observations using similarity
Representing actions, intentions, and goals
Extend ML to include “modal operators”
Distinguish beliefs of observer from beliefs of subject
Ability to model imperfect agents, whose plans are flawed
From HMMs to ML
Hidden Markov models describe the world as
probabilistic state machine
ML encoding of HMM can be relaxed to
allow subject to be in multiple states (multiple
activities) by making “unique state” constraint
soft
w : a a, i . Activity (a, i ) Activity (a, i)
From HMMs to ML
Novel observations can be handled by
applying background knowledge about
similarity
w : a, obj , obj . Uses (a, obj ) Similar (obj , obj )
Uses( a, obj )
Modal Operators
Most previous work on probabilistic activity
recognition does not distinguish
What system infers is true about the world
What the subject believes is true about the world
What the system predicts will happen
What the subject intends to happen
Modal operators relate agents to attitudes
Bel( John, contains(jug, gasoline) )
But system may know jug is empty
Goal( John, ignite(jug) )
Knowledge of subject’s goal can drive cooperative system to
help subject, or antagonistic system to block user
Semantic Inference
Modal operators do not work like ordinary predicates
or logical connectives
Modal proof theory is hard to automate
However:
Modal operators have well-understood “possible world”
semantics
Agent believes P in possible world W iff P is true in all worlds
W’ such that reachable(W,W’)
ML’s inference engine works at the semantic level (direct
search over possible worlds)
Promising approach: semantic inference for modal
constructs in ML
Explicitly model reachability relationships for each attitude
and agent
Idea
Alchemy searches over models (truth
assignments)
Structure = set of models and accessibility
relationships over the models
Modal formulas are evaluated over structures
Structures are too big to explicitly search
Modify Alchemy to search over samples drawn
from structures
Holds( Bel (agent , formula), w)
w.R(agent , w, w) Holds( formula, w)