DCO`s Data Science Day - Rensselaer Polytechnic Institute

Download Report

Transcript DCO`s Data Science Day - Rensselaer Polytechnic Institute

DCO's Data Science Day
Introduction June 5, 2014, Troy NY
Peter Fox (Rensselaer Polytechnic
Institute)
Schematic for Deep Carbon Observatory Data Flows
Data Management and Data Science Guidance DCO-wide and Compatible with National and International Best Practices
Many means
of generation
Experiments
Physics/
Chemistry
Models
Sensor
streams
Standards based,
Provenance captured
Global Census, Virtual
Mineral Laboratory, ...
Efficient generation,
identifiers issued
Data
Repositories
Others
EOS
Q uery,
access and
use of existing data
Existing Community Data
infrastructure; identifiers, Catalogs
DCO data infrastructure; identifiers, Catalogs
GIDI
Sample
collection
….
Sequencing
EarthChem
Including International Geo Sample Number (IGSN) issuer
MINDAT
Emission/
Compositions
M etadata,
schema,
data
... ... ...
deepcarbon.net
Schematic for Deep Carbon Virtual Observatory and Interoperability
Integrated
Applications
Discovery
visualizations
Semantic
interoperability
Analytics
and mining
Global Census, Virtual
Mineral Laboratory, ...
Application-level mediation: vocabulary,
mapping to science and data terms
Software,
Tools & Apps
Deep Energy/
Life
Applications
Semantic
interoperability
Physics/
Chemistry
Models
Semantic query,
hypothsis and
inference
….
Res/Flux
Applications
Query,
access and
use of data
Semantic mediation: physics, chemistry, mineral, emission data - ChemML,
Data
Repositories
GVP
MINDAT
EOS
EarthChem
M etadata,
schema,
data
... ... ...
Emission/
Compositions
deepcarbon.net
DCO-DATA SCIENCE OPERATIONAL STRUCTURE
Data Science
Team with
Data Science
Advisory
Committee
data
requirements
Deep
Life
data
curation
DCO
Data Portal
data publication
& reuse
Reservoirs
& Fluxes
Physics &
Chemistry
Advancing the science of the Communities
Engagement
Team with
Engagement
Liaisons
DCO
Secretariat
Deep
Energy
deepcarbon.net
Visualized information viewing
6
All information is linked and traceable!
7
Collaboration tools
Group data
deposit and
reporting
Group Based Collaboration
Listings of
group content
Group
management
and messaging
Group
bibliography
Group shared
calendar
Group task
management
Group
membership
Group event
management
10
Research activities report dashboard:
Results updated in real time as new member registered,
new activities reported, new publication uploaded…
11
• Partnership formed between Royal Meteorological
Society and academic publishers Wiley Blackwell to
develop a mechanism for the formal publication of
data in the Open Access Geoscience Data Journal
• GDJ publishes short data articles cross-linked to,
and citing, datasets that have been deposited in
approved data centres and awarded DOIs (or other
permanent identifier).
• A data article describes a dataset, giving details of
its collection, processing, software, file formats, etc.,
without the requirement of novel analyses or ground
breaking conclusions.
• the when, how and why data was collected and
what the data-product is.
• DCO is an approved repository and is eligible for
discounts on publications
Decadal goals = Discovery science
Global community of ‘Carbon’ scientists contributing to
Deep Earth Computer (data legacy) comprising:
•
•
•
•
•
Global Earth Mineral Laboratory
Inventory of Deep Fluids
Active Volcano Gas Emissions
Global Census of Deep Microbial Life
State of High Pressure and Temperature Carbon and
Related Materials
• Inventory of Diamonds with Inclusions
Organizing Commitee
Peter Fox
• Rensselaer Polytechnic Institute, USA
• DCO Data Science Team
Xiaogang (Marshall) Ma
• Rensselaer Polytechnic Institute, USA
• DCO Data Science Team
Craig Schiffries
• Carnegie Institution for Science, USA
• DCO Secretariat
Julia Sheets
• Ohio State University, USA
• Deep Energy
Mark Ghiorso
• OFM Research Inc., USA
• Extreme Physics and Chemistry
Kerstin Lehnert
• Lamont-Doherty Earth Observatory,
Columbia University, USA
• Reservoirs and Fluxes
Mitch Sogin
• Marine Biological Laboratory, USA
• Deep Life
Frank Baker
• University of Rhode Island, USA
• DCO Engagement Team
Patrick West
• Rensselaer Polytechnic Institute, USA
• DCO Data Science Team
John Erickson
• Rensselaer Polytechnic Institute, USA
• DCO Data Science Team
Alia Awadallah
• Carnegie Institution for Science, USA
• DCO Secretariat
https://deepcarbon.net//event/deep-carbon-observatory-data-science-day