EcoGrid and Virtual Laboratory e-Science - National e

Download Report

Transcript EcoGrid and Virtual Laboratory e-Science - National e

Ecogrid
&
Virtual Laboratory for e-Science
from field observation to spatial knowledge
Willem Bouten, project leader
Floris Sluiter, design & implementation
Guido van Reenen, data analysis
Victor Mensing, Vlinderstichting
Dirk Zoetebier, Sovon
Aart Jan van der Linden, Talmon Comm.
CBPG
Virtual Laboratory for e-Science
Mission
To boost e-Science by creating an e-Science environment and
carrying out research on methodologies.
Essential components
- e-Science development areas
- a Virtual Laboratory
development area
- a Large-Scale Distributed
computing development area,
consisting of high performance
networking and grid parts
Virtual Laboratory for e-Science
Aim
To develop and apply a Grid and Virtual Lab technology
based information system and research environment.
Virtual Laboratory for e-Science
The VL-e project has four programme lines, most of
them containing more than one subprogramme.
P1 – e-Science in Applications
P2 – generic Virtual Laboratory methodology
P3 – large-scale distributes systems
P4 – scaling up & validating in ‘real-time’ applications
VL- e
Virtual Laboratory for e-Science
Research of the P1 line is carried out within the following
subprogrammes:
P1 – e-Science in Applications
SP1 – Data Intensive Science
SP2 – Food Informatics
SP3 – Medical Diagnosis & Imaging
SP4 – Biodiversity
SP5 – Bioinformatics ASP
SP6 – The Dutch Telescience Laboratory
VL- e
e-Science in Applications
Virtual Laboratory for e-Science
Biodiversity
Aim
To develop and apply a Grid and Virtual Lab technology
based information system and research environment
for identification, distribution, integration and analyses
of observations and model results of the dynamics of flora
and fauna
VL- e
e-Science in Applications
biodiversity
Virtual Laboratory for e-Science
Biodiversity
Partners:
CBPG – Computational Bio- and Physical Geography
(IBED – Institute for Biodiversity and Ecosystem Dynamics)
(UvA – Universiteit van Amsterdam)
RNLAF – Royal Netherlands Airforce
VOFF – Association for Research of Flora & Fauna
And co-operation with other programme lines, especially subprogrammes
of P2 (generic Virtual Laboratory methodology)
VL- e
e-Science in Applications
biodiversity
CBPG
Virtual Laboratory for e-Science
Biodiversity
Two main components:
1.
a national database for biodiversity information
 this talk (Ecogrid)
2.
a PSE for integrated analysis of observations and model results
 topic of Judy Shamoun-Baranes (BAMBAS)
VL- e
e-Science in Applications
biodiversity
EcoGrid
Ecogrid
Objectives
1.
Construct a virtual database that is
connected to geographically distributed databases which
contain information on the distributions of species, on
landscape characteristics and on weather.
Ecogrid
Objectives
1.
2.
Construct a virtual database that is
connected to geographically distributed databases which
contain information on the distributions of species, on
landscape characteristics and on weather.
Develop generic methodologies and tools for scale conversions,
to be able to integrate and interpret data that are observed at
different spatial scales.
Ecogrid
Objectives
1.
2.
3.
Construct a virtual database that is
connected to geographically distributed databases which
contain information on the distributions of species, on
landscape characteristics and on weather.
Develop generic methodologies and tools for scale conversions,
to be able to integrate and interpret data that are observed at
different spatial scales.
Use this infrastructure across the boundaries of organisations
to indentify:
- spatial wood web structures
- biodiversity hotspots
- effects of changed land-use
- ...
Example: insect diversity and Red-backed Shrikes
#
#
##
#
#
#
#
##
#
#
##
#
##
##
#
#
##
#
##
#
##
##
#
## #
##
#
##
#
#
#
##
# ##
#
#
##
##
##
##
##
##
## #
##
### #
## #
###
# #
###
##
##
# ## #
#
#
#
#
##
#
##
# ##
#
####
#
#
#
#
#
#
##
#
### #
##
# # #
# ###
##
# ##
##
#
##
##
##
#
#
#
#
#
#
#
###
#
##
#
###
##
##
##
#
#
## ##
#
# #
#
## #
##
# #
# ##
#
#
##
####
#
#
##
#
##
##
##
#
##
#
#
#
#
## #
##
#
####
## #
###
#
##
# ##
##
### # #
#
# ##
Nitrogen dep.
##
##
##
# ##
#
#
#
#
#
#
#
#
#
##
#
#
#
#
##
##
#
#
#
#
#
#
#
# #
#
# #
# ## #
#
# ##
#
##
##
##
#
#
##
## #
#
#
#
#
Ecogrid - data acquisition
Initial Situation
Vlinderstichting
FLORON
NMV
RAVON
BLWG
VZZ
ANNEMOON
EIS-NL
Tinea
SOVON
VOFF
Ecogrid - data acquisition
taxonomy
observation
locality
central
VOFF
database
SOVON
NCO databases
central databases
Data shared by all NCO’s
NCO databases
NCO-specific data
RAVON
central databases
WEB
GIS
webpages
maps
etc.
METADATA
PERSONS
TAXONOMY
portal login
NCO databases
GIS
LOCATION
ACTIVE MEMBER
PK
Person ID
SPECIES
PK
Taxon ID
km hok
x, y
route
relevé
OBSERVATOR GROUP
OBSERVATION
PK
Observation ID
Species ID
FK
Person ID
FK
Observation ID
Main Observator
EXTRA INFO
spatial databases
• Spatial functions to
query (SQL) within
and across layers:
• Equals()
• Disjoint()
• Intersects()
• Touches()
• Crosses()
• Within()
• Contains()
• Overlaps()
‘Layers’ build with these data types,
stored as tables in PostgreSQL/PostGIS.
• Special indexing
techniques (GisT)
Ecogrid system - overview
data
acquisition
portal
science
portal
central
databases
VOFF
EcoGrid
virtual
meta
database
landscape
BAMBAS
database
geostatistics
Spotfire
scale conversion
modules
data mining
modules
model
experiments
weather
processed
data
VL-eScience Research Environment
Ecogrid system - prototype
data
acquisition
portal
science
portal
SOVON
EcoGrid
virtual
meta
database
landscape
BAMBAS
database
geostatistics
Spotfire
scale conversion
modules
data mining
modules
model
experiments
weather
processed
data
VL-eScience Research Environment
Ecogrid - portal prototype
Ecogrid - portal prototype
Ecogrid - portal prototype
Ecogrid - portal prototype
Ecogrid - portal prototype
Concluding remarks
• PostgreSQL (and Postgis) are especially suited for
large scientific databases
• By the end of this year all the databases will be
populated and we will have online data
acquisition.
• Then we will shift our focus on using the data and
concentrate on Analysis and Datamining