Spatio Temporal Rule Mining For LBS
Download
Report
Transcript Spatio Temporal Rule Mining For LBS
Mining Long, Sharable Patterns in
Trajectories of Moving Objects
Győző Gidófalvi
Geomatic ApS
Center for Geoinformatik
and
Torben Bach Pedersen
Aalborg University
18-07-2015
STDBM 2006, Seoul, South Korea
1
Outline
Why mine trajectories of moving objects?
From spatio-temporal data to spatio-temporal
baskets: pivoting
From trajectories to transactions in 3 steps
Method for mining long, sharable patterns
Observations and associated steps
Experiments
Conclusions and future directions
18-07-2015
STDBM 2006, Seoul, South Korea
2
Outline
Why mine trajectories of moving objects?
From spatio-temporal data to spatio-temporal
baskets: pivoting
From trajectories to transactions in 3 steps
Method for mining long, sharable patterns
Observations and associated steps
Experiments
Conclusions and future directions
18-07-2015
STDBM 2006, Seoul, South Korea
3
Outline
Why mine trajectories of moving objects?
From spatio-temporal data to spatio-temporal
baskets: pivoting
From trajectories to transactions in 3 steps
Method for mining long, sharable patterns
Observations and associated steps
Experiments
Conclusions and future directions
18-07-2015
STDBM 2006, Seoul, South Korea
4
Outline
Why mine trajectories of moving objects?
From spatio-temporal data to spatio-temporal
baskets: pivoting
From trajectories to transactions in 3 steps
Method for mining long, sharable patterns
Observations and associated steps
Experiments
Conclusions and future directions
18-07-2015
STDBM 2006, Seoul, South Korea
5
Outline
Why mine trajectories of moving objects?
From spatio-temporal data to spatio-temporal
baskets: pivoting
From trajectories to transactions in 3 steps
Method for mining long, sharable patterns
Observations and associated steps
Experiments
Conclusions and future directions
18-07-2015
STDBM 2006, Seoul, South Korea
6
Outline
Why mine trajectories of moving objects?
From spatio-temporal data to spatio-temporal
baskets: pivoting
From trajectories to transactions in 3 steps
Method for mining long, sharable patterns
Observations and associated steps
Experiments
Conclusions and future directions
18-07-2015
STDBM 2006, Seoul, South Korea
7
Why mine trajectories of moving objects?
Large amounts of location data from mobile devices
Locations data is interesting as a sequence
Extracting patterns in
trajectories can:
aid the management,
storage, and retrieval of
trajectories
improve tracking of
moving objects
be used to provide
customized LocationBased Services (LBS)
and Location-Based
Advertising (LBA)
18-07-2015
STDBM 2006, Seoul, South Korea
8
From spatio-temporal data to spatio-temporal
baskets: pivoting
Pivoting is the process of grouping a
set of records based on one or more
attributes (pivoting attributes) and
assigning the values of one of more
different attributes (pivoted attributes)
to groups or baskets.
Frequent itemset mining to
discover relationships between items
in the basket [AGR94]
pivoting
attribute
ObjectID LocationID
1
A
1
B
1
C
1
B
1
A
2
A
2
B
2
C
2
D
2
C
2
B
2
A
spatio-temporal item
ObjectID
1
2
18-07-2015
pivoted
attributes
Time
8:00
8:05
8:10
8:15
8:20
8:00
8:05
8:10
8:15
8:20
8:25
8:30
pivoting
Spatio-temporal basket
A_8:00, B_8:05, C_8:10, B_8:15, A_8:20
A_8:00, B_8:05, C_8:10, D_8:15, C_8:20, B_8:25, A_8:30
STDBM 2006, Seoul, South Korea
9
From trajectories to transactions in 3 steps
STEP I: Identify trips
a trip ends when the total
displacement in k consecutive GPS
readings is less than δ
STEP II: Capture periodicity of
patterns
Map date-time domain to time-ofday domain
STEP III: Eliminate the problem
of noisy GPS readings
Substitute readings with spatiotemporal regions
18-07-2015
STDBM 2006, Seoul, South Korea
10
Mining long, sharable patterns (LSP)
LBS example application: intelligent ridesharing
Find sharable routes for a set of commuters and
suggest rideshare possibilities to them
Unique requirements:
Patterns should rather be long than frequent
Patterns should be shareable
Unique challenges:
Patterns are extremely long
Interesting patterns have relatively low support
Not all sub-patterns are interesting
18-07-2015
STDBM 2006, Seoul, South Korea
11
Sample trajectory database
Task: Find all patterns that have at least length 4, are parts of at
least 4 trajectories, which belong to at least 2 distinct objects
(shareable / common)
18-07-2015
STDBM 2006, Seoul, South Korea
12
Filter infrequent itemsets
Observation: Infrequent items cannot be part of a pattern
DEF: An item is frequent if the number of transactions that contain
that item >= MinSupp (for example 4), and the number of unique
objects associated with those transactions >= n (for example 2).
STEP 1: Delete all infrequent items
infrequent items
18-07-2015
STDBM 2006, Seoul, South Korea
13
Filter short transactions
Observation: A short transaction can never support a long pattern
DEF: A transaction is short if the number of items in it is less than
or equal to MinLength (for example 4).
STEP 2: Delete all short transactions
short transactions
18-07-2015
STDBM 2006, Seoul, South Korea
14
Project DB based on item with least n-support
Observation: All patterns that a given item i participates in can be
found in the item-projected DB, T_i.
DEF: An item-projected DB, T_i, contains all the items from the
transactions containing item i.
STEP 3: Project DB based on the item with the least n-support
Projecting item with least n-support
18-07-2015
STDBM 2006, Seoul, South Korea
15
Find the single most frequent itemset in the
projected DB
Observation: There is a single most frequent itemset in a projected
DB, and its support is equal to the support of the projecting item
STEP 4: Identify single most frequent itemset
projecting item
Closed Frequent Itemset
n-support in
projected DB
18-07-2015
STDBM 2006, Seoul, South Korea
16
Delete unnecessary items from predecessor DB
Observation: Items that have the same support in the item projected
DB as the projected DB, can be deleted from the later.
STEP 5: Delete unnecessary items from predecessor DB
Unnecessary items
support in predecessor DB
support in projected DB
18-07-2015
STDBM 2006, Seoul, South Korea
17
Pseudo code for complete LCP mining
Recursive, projection-based LCP mining
1.
2.
3.
4.
5.
Filter infrequent items
Filter short transactions
Project DB based on item with least n-support
Find the single most frequent itemset in the projected DB
Recursively mine the projected DB by further projecting
on frequent items that are not part of the single most
frequent itemset
6. Delete unnecessary items from predecessor DB
18-07-2015
STDBM 2006, Seoul, South Korea
18
Extracted long sharable patterns
18-07-2015
STDBM 2006, Seoul, South Korea
19
Experimental setup & performance results
INFATI data: 20 cars driving in Aalborg for 2 month, 1.8 million GPS
readings, 3699 trips
LCP implemented on MS-SQL, running on 3.6 GHz, 2GB RAM
Minimum
length
experiments
Minimum
support
experiments
18-07-2015
STDBM 2006, Seoul, South Korea
20
Visualization of extracted patterns
Large amount of spatio-temporal data
Some but not all patterns are visible, but not
explicit
Difficult to analyse
18-07-2015
28 explicit, at least 200-long patterns that
are supported by at least 4 trips of at least 2
cars
Easy to analyse
STDBM 2006, Seoul, South Korea
21
Conclusions and future directions
Results:
Frequent itemset mining can be modified to extract long
sharable patterns from trajectories
A RDBMS provides support for an easy but effective
implementation
Challenges:
Some of the patterns are while different in terms of items, but
they are not all that different in spatio-temporal space.
How can we further compress the patterns in a way that
takes this spatial-temporal closeness into consideration?
Compression should take place during the mining
process and should be exploited to further improve
efficiency
18-07-2015
STDBM 2006, Seoul, South Korea
22