seacoos_data_management_details
Download
Report
Transcript seacoos_data_management_details
Near Real Time
Ocean Observations
Online
the Detailed Escape of SEACOOS
(Southeastern Atlantic Coastal Ocean Observing System)
Data Management and Visualization Secrets
Software : engines
Mapserver
Open source from University of Minnesota
USC runs a mixture of versions from 3.6 to 4.2
PostgreSQL
Open source from postgresql.org
USC runs version 7.4.1
PostGIS
Open source from Refractions.net
USC runs version 0.8.1
Near Real Time Ocean Observations
Online : SEACOOS
Software : additional
PHP
Open source from php.org
USC runs version 4.3.2
Perl
Open source from perl.org
USC runs version 5.8.0
Miscellaneous
ANiS and gifsicle
Imagemagick
Near Real Time Ocean Observations
Online : SEACOOS
Hardware
data scout
In-situ
RS
application server
Apache 2.x
MapServer
Perl, PHP, misc.
database A
In-situ
RS
Near Real Time Ocean Observations
Online : SEACOOS
database B
In-situ
model output
Directory structure
Data that sits on USC filesystems
RS images
cached images
All files have strict naming convention that
includes timestamp
Near Real Time Ocean Observations
Online : SEACOOS
System administration
Databases backed up nightly
Databases cleaned up and optimized nightly
requires some downtime (~ 2 hours)
significant overhead ~ 4 hours
Some backend data massaging (mainly for model
output aggregation)
Near Real Time Ocean Observations
Online : SEACOOS
Database structure
One table category per data type
e.g. in-situ winds
e.g. QuikSCAT winds
more complicated since requires aggregation and normalization
e.g. OI SST
wind_prod contains all wind data
wind_map contains wind data appropriate for maps
table containing pointers to data files on disk
Ancillary tables for specific data types
e.g. OI SST
table containing RGB to SST lookup values (for querying purposes)
Near Real Time Ocean Observations
Online : SEACOOS
Data processing overview
Data scout (netCDF)
Perl code flattens incoming netCDF into arrays which are turned into
SQL INSERT statemtents.
Triggers update the new records suitable for normalization as well as
Mapserver display elements.
Procedures run to optimize tables for display.
RS images (HDF to PNG)
begin as HDF but are availed to USC as PNG’s
Images have standard naming convention and agreed upon extents as well
as predefined RGB to value, e.g. RGB to SST, pairs.
Incoming files cause postgreSQL table to be updated with the new file
and timestamp.
Near Real Time Ocean Observations
Online : SEACOOS