Maximum query latency Network traffic per query
Download
Report
Transcript Maximum query latency Network traffic per query
HERO: Online Real-time Vehicle
Tracking in Shanghai
Xuejia Lu
11/17/2008
SG Project Overview
ShanghaiGrid(SG) project aims to provide abundant intelligent
transportation services
•
•
•
•
•
navigation
trip planning
optimal route selection to avoid congestion
bus arrival prediction
…
Online real-time vehicle tracking is the most fundamental service
• active RFID tag is planted on the tire
• RFID readers and wireless APs are
installed on crossroads
• local node collecting data
• local node connects to Internet
SG Project Challenges
Real-time requirement
• an query must be answered within a certain bounded time
• e.g. a stolen car
Scalable
• up to millions of users and hundreds of thousands of vehicles
• huge number of simultaneous queries
Robust to node failures
• thousands of local nodes
• system maintenance is not easy
Several Existing Solutions
Centralized scheme
• centralized database
• stores all vehicles’ location information
• accepts all queries
• more than 22,000 crossroads in Shanghai, thousands of events/s
Distributed schemes
• captured vehicle information can be stored locally at distributed nodes
• BUT no hint about the enquired vehicle for a query
Several Existing Solutions (cont’d)
Distributed schemes
Flooding: flood the query across the network
• large amount of traffic
• poor scalability
• fail to satisfy real-time requirement
Random Walk: long query latency
DHTs: map objects to peers
• Chord, Tapestry, Pastry etc.
• large computation and traffic overhead for large number
of rapid updates of moving objects
Proposed Solution: HERO
HERO stands for : Hierarchical Exponential Region Organization
GOAL :
• limit the maximum query response time
• minimize network traffic
Core Idea:
update location information in a controlled way
Four Components
Overlay construction
Hierarchy initialization
Restricted location updating
Query Routing
Overlay construction
• overlay network matches underlying road network
• connection between two geographically neighboring local nodes
Hierarchy initialization
• Each region Ri has a radius ri (in hops)
• r1 r ri r k i 1
• If ri 1 r k i 1 ,then ri ri 1 2
• Every node maintains a next-insider pointer that points to a node which
is on the boundary of the immediate inner region
• First node that captures a new vehicle trigger initialization procedure:
a packet contains router field and journey field, initializes to its IP address
and one
• Other nodes will set its next-insider to router
contained in the packet
• If journey value equals to the radius of certain ri :
1) modifies router field to its IP
2) marks itself the boundary node of Ri
Restricted location updating
• Chaser: the local node that moving vehicle passes by
• Chaser perform location updating and maintains the hierarchy
• Three cases to consider depending on the chaser’s location
• The simplest case 1 : chaser is an interior node with R1
chaser floods the vehicle’s information to all other nodes within R1
Case 2: boundary node of R1
• HERO needs to re-organize R1
• node a initiates update packet: similar to initialization but with an additional
scale field to indicate the propagate area, here it is R1
• nodes in R2 will be update to have the current position of the new R1
• special case: new R1 could be truncated by R2, to ensure container relationship
Case 3: common boundary node
•
•
•
•
HERO needs to re-organize R1
the several regions need to be re-build, node b will be the new circle center
nodes in R3 will be update to have the current position of the new R2
special case: new R2 could be truncated by R3, to ensure container relationship
The maximum network traffic overhead of location updating for a
vehicle moving a distance of D (network diameter) :
( D) c(kD2 2r (r k 1) D 6r 2 ) (c is a constant coefficient)
Query Routing
• R1 will always have the latest location information of the vehicle.
• A query can be issued from any node, when a boundary node receive the query,
unless it is on the R1, it will forward the query to the inner region’s boundary node.
• It takes at most logk ( D / r ) hops for a query to be answered,
where D is the network diameter.
Performance Evaluation
• small white dots: 1,000 nodes
• red dots: location captured, tax is vacant
• dark dots: tax is delivering
• one hour extensive trace data of 100 taxies,
real GPS data, randomly generate 100,000
queries that hour
Two Metrics:
Maximum query latency
Network traffic per query
Results
• latency drops when k and r increases: logk ( D / r )
Results(cont’d)
• traffic increases when k or r increases: ( D) c(kD 2 2r (r k 1) D 6r 2 )
• extreme case: r=D or r=1, k=D
Assumptions and Limitations
Assumes all vehicles have active RFID tag planted in tires
Number of hierarchies could be up to the same as number of nodes
A node could have many next-insider pointers, query routing would be
similar to that of Gnutella
How to solve query routing for the vehicle that is not exist, could lead
to cycle
System cost could be high, because of planning one node (server) on
every crossroad
Privacy concern of tracking personal vehicle all the time