Overview - Spatial Database Group

Download Report

Transcript Overview - Spatial Database Group

Mapping and analysis for public safety: An Overview
Motivation

Crime generators and attractors
Identifying events (e.g. Bar closing,
football games) that lead to increased crime.


Predicting crime events
Identifying location and time where a serial
offender would commit his next crime.


Courtsey: www.startribune.com
Predicting the next target of a burglary offender
Identification of patrol routes

Force deployment to mitigate crime hotspots.
http://www.dublincrime.com/blog/wpcontent/MappingOurMeanStreets.jpg
Scientific Domain: Environmental Criminology
Crime pattern theory
Courtsey: http://www.popcenter.org/learning/60steps/index.cfm?stepNum=16
Routine activity theory and Crime Triangle
Courtsey: http://www.popcenter.org/learning/60steps/index.cfm?stepnum=8
 Crime Event: Motivated offender, vulnerable victim (available at an appropriate
location and time), absence of a capable guardian.
 Crime Generators : offenders and targets come together in time place, large
gatherings (e.g. Bars, Football games)
 Crime Attractors : places offering many criminal opportunities and offenders may
relocate to these areas (e.g. drug areas)
What is spatiotemporal data mining ?
Process of discovering interesting, useful and non-trivial patterns from
spatiotemporal data.
Data mining Tasks
Traditional Data Mining
Frequent patterns (e.g.
Associations, Sequential
association, frequent graphs)
Spatiotemporal data mining
(STDM)
ST Frequent patterns (e.g. ST
Co-occurrence, ST
Sequences and Cascading ST
patterns)
Clustering
Hotspot Analysis
Anomaly detection
ST Outliers
Classification/ Regression
ST Classification /
ST (auto) Regression
STDM pattern families
Co-occurrence Patterns
Hotspots
www.sentient.nl/crimeanabody.html
Spatial outliers: sensor (#9) on I-35
Location prediction: nesting sites
Nest locations
Vegetation
durability
Distance to open water
Water depth
Projects : Mapping and Analysis for public safety
 US DoJ/NIJ- Mapping and analysis for Public Safety
 CrimeStat .NET Libaries 1.0 : Modularization of CrimeStat, a tool for the analysis of crime
incidents.
 Performance tuning of Spatial analysis routines in CrimeStat
 CrimeStat 3.2a - 3.3: Addition of new modules for spatial analysis.
 US DOD/ ERDC/ AGC – Cascade models for multi scale
pattern discovery
 Designed new interest measures and formulated pattern
mining algorithms for identifying patterns from large crime
report datasets.
 US DOD –

Spatial network hotspot discovery
New algorithms to discover hotspots along street networks
CrimeStat




A Spatial statistics software to analyze crime incident locations.
It provides modules for spatial statistics, space-time analysis, finding
patterns:
 Hotspot Analysis
 Spatial Modeling
 Crime Travel Demand
Used widely by law enforcement agencies throughout the country.
Popular among Public Health agencies and research groups throughout the
country.
CrimeStat
 Used by law enforcement all over the country (e.g. Redlands Police
Department, Baltimore County)
 File down loads: Fall 2010 65,875 (Source:
http://www.icpsr.umich.edu/CrimeStat/about.html )
 6 Releases since 1999
Our Contributions
•
Crime Stat Libraries 1.0[1]
–
–
•
Set of .NET components distributed by NIJ
Credits: http://www.icpsr.umich.edu/CrimeStat/files/Documentation_for_CrimeStat_Libraries_1.0.pdf
Crime Stat v 3.2-3.3
– Statistical Simulation functions for Spatial Analysis Routines
– Credits: http://www.icpsr.umich.edu/CrimeStat/files/CrimeStat3.3updatenotesPartI.pdf
•
Scalability to Large Datasets
– Self-Join Index[2]
[1]
http://www.spatial.cs.umn.edu/projects/crimestat-pub/beta/
Pradeep Mohan, Shashi Shekhar, Ned Levine, Ronald E. Wilson, Betsy George, Mete Celik, Should SDBMS support the join index ?:
A Case Study from Crimestat. In Proc. of 16th ACM SIGSPATIAL International Conference on
Advances in Geographic Information Systems (ACM GIS 2008), California, USA,2008.
[2]
Real Crime Datasets
Lincoln, NE Dataset
Real Data
 Years 2002- 2007
 > 40 Crime types
 > 200 Sub types
 Average size of each year ~ 40000
Cascading spatio-temporal pattern (CSTP)
Time T1
Time T2 > T1
Time T3>T2
Aggregate(T1,T2,T3)
a
Bar Closing(B)
Assault(A)
Drunk Driving (C)
 Input: Crime reports with location and time.
CSTP: P1
Output: CSTP
C
 Partially ordered subsets of ST event types.
 Located together in space.
 Occur in stages over time.
B
A
Lincoln, NE crime dataset: Case study
 Is bar closing a generator for crime related CSTP ?
Bar locations in Lincoln, NE
Questions
Observation: Crime peaks around bar-closing!
 Is bar closing a crime generator ?
Bar closing
 Are there other generators (e.g.
Saturday Nights )?
Saturday Night
Increase(Larceny,vandalism, assaults)
Increase(Larceny,vandalism, assaults)
K.S Test: Saturday night significantly different than normal day bar closing (P-value = 1.249x10-7 , K =0.41)
Lincoln, NE crime dataset: Case study
{Assault}
{Bar Closing}
{Vandalism}
Spatial
Gen-CPR
Neighborhood
CPI
Max-CPR
1 Mile
0.0386
0.02283
0.0386
2.5 Miles
0.18491
0.04539
0.18491
Temporal Neighbor Size = 1 hr
Dataset Years 2002-2006
Lincoln, NE crime dataset: Case study
Probability of a Bar closing generating a
crime in Lincoln City = 0.038
Crimes considered: Assault and Vandalism
Probability of a Lincoln city downtown
Bar closing generating a crime = 0.0862
Lincoln, NE crime dataset: Case study
Probability of a Vandalism after Bar
closing in Lincoln City = 0.022
Probability of a Vandalism after a
downtown Bar closing = 0.0397
 Only bar closings that also generate assaults
 Downtown subsetting may decrease/ increase
chances.
Lincoln, NE crime dataset: Case study
Probability of an Assault after Bar closing
in Lincoln City = 0.029
Probability of an Assault after a
downtown Bar closing = 0.021
 Only bar closings that also generate Vandalism
 Downtown subsetting may decrease/ increase
chances
Spatial Network Hotspots
Geometric Hotspot
Network Hotspot