Interoperating with GIS and Statistical

Download Report

Transcript Interoperating with GIS and Statistical

Interoperating with GIS and
Statistical Environment
for an Interactive
Spatial Data Mining
Didier Josselin, THEMA, UPRESA 6049 du CNRS, Besançon, GDR CASSINI
[email protected] http://thema.univ-fcomte.fr/didier.htm
Xlisp-Stat programming :
@ D. Betz, L. Tierney, C. Brunsdon, D. Josselin, L. Guerre, B. Dancuo
French Research Group about GIS
(1990-2000 : GDR CASSINI, 2000...?)
The spatial data mining quest
Finding significant relations
between geographical objects
in order to cluster them
Examples of geographical purpose
Sub-objectives at
geographical entity scale
 1st
door : the statistical dependency
some entities have common characteristics...
 2nd
door : the spatial relation
some entities are contiguous, closed from each others…
 3rd
door : the combination of spatial and statistical relation
some entities are similar and closed...
Sub-objectives at territory and
geographical space scale

1st door : the spatial cutting out and data aggregation : a succession
of deriving ...
Analysing spatial repartition,
Identifiing gradients,
Detecting discontinuities...

2nd door : the spatial auto-correlation measure
Global and local

3rd door : the identification of geographical composite
(heterogeneous) entities
Geographical agricultural
flows analysis
Agricultural flows between French communes
Commune A
Commune B
Various flows status
LES FLUX SORTANTS EN FRANCHE-COMTÉ
Outgoing
flowsENin1988
Franche-Comté
Source : RGA 1988
What are we looking for ?
Commune aggregate with its key and boundary
Commune described by an attribute
Commune couple flow
Which softwares may be
available and convenient ?
Geographical Information
Systems
+

Various structured query languages
 Existing
tools to build clean structured databases

Graphical and mapping functionalities

generally open to other softwares
-

Poor in statistical functions

Rarely integrate Exploratory Data Analysis

Need to write queries rather execute them in a graphic
way
ESDA Environment
+

Numerous statistical functions

Numerous graphic representations

Ease to select objects on screen
 Dynamic

link between objects
generally open to development by programming
-

Poor in geographical and semiologic functionalities

Does not integrate structured databases functions

Does not include geometrical or topological models
Any solutions ?
Modifying existing softwares
First methodological choice
Adding to a statistical
environment some mapping and
relational functionalities
ARPEGE’ : a tool to Analyse
Robustly in Practice and
Explore Geographical
Environment (XlispStat)
The « visioner » in ARPEGE’
Using ARPEGE’ to analyse flows
+

Dynamic link between multiple objects

Relative fastness to support expert decision making

Facilities to implement relations and triggers between
objects

Possibility to focus on many crossed selections

Difficult to manage with multiscaling
 Users
may miss some synthetic statistical indicators
or automatic methods

Application must be quite simple (RAM limitations)

Combinatory explosion risk !
Coupling two complementary
softwares
Second methodological choice
Interoperating with a GIS and a
statistical environment software
LAVSTAT : a dynamic Link
between ArcView and XlispSTAT
Interaction
LAVSTAT principles
Services, DDE
Server
XlispStat
ArcView
+

Dynamic link between GIS and Statistical software
 The

whole functionalities access to both systems
Increases the ways to investigate spatial data

A screen is not enough to explore data

A few time loss to make interoperating the two
softwares

Not already stable (memory conflicts)
CONCLUSION
A few advices for spatial
analysis to take reliant
decisions in order
to shape the future ...
If you have some objectives to
reach with data to explore...
Choose the appropriate
methods ...
0
Keep a critical look on tools
and methods ...
1
Choose most robusts methods
to analyse your data ...
2
Check hypothesis without too
tight assumptions
3
Try to dominate time during
anaysis and to be inside
learning process ...
4
Keep in touch with all
individual data
5
Bring to light all aspects of
your problem by multiple
representations
6
Use dynamic links and
interactivity
7
Study the fringe as the trend...
8
...and model deviation,
residuals ...
7
9
… and relations between
geographical objects through
different scales...
10
… which may be well defined
(semantic,topology, structural,
functional ...)
11
Validate your results by maths
and
expertise
12
f ( x)   x.dx
And consider the
“measurement density” is not
constant
13