Here goes the title
Download
Report
Transcript Here goes the title
EUDAT
PIDs in EUDAT
Webinar, 15 Februari 2013
Mark van de Sanden
EUDAT WP leader
SARA, The Netherlands
Collaborative Data Infrastructure
-A framework for the future? -
Trust
Data Curation
Data
Generators
Users
User functionalities, data capture
& transfer, virtual research
environments
Community Support Services
Data discovery & navigation,
workflow generation, annotation,
interpretability
Common Data Services
Persistent storage, identification,
authenticity, workflow execution,
mining
3
Consortium
4
Five research communities on Board
•
•
•
•
•
EPOS: European Plate Observatory System
CLARIN: Common Language Resources and Technology Infrastructure
ENES: Service for Climate Modelling in Europe
LifeWatch: Biodoversity Data and Observatories
VPH: The Virtual Physiological Human
• All share common challenges:
–
–
–
–
–
Reference models and architectures
Persistent data identifiers
Metadata management
Distributed data sources
Data interoperability
5
Communities ↔ Data Centers
First EUDAT Services
Metadata Catalogue
AAI
Aggregated EUDAT metadata domain.
Data inventory
Network of trust
among
authentication
and
authorization
actors
Data Staging
Safe Replication
Simple Store
Dynamic replication
to HPC workspace
for processing
Data curation and
access optimization
Researcher data
store (simple
upload, share and
access)
First EUDAT Services
Metadata Catalogue
AAI
Aggregated EUDAT metadata domain.
Data inventory
Network of trust
among
authentication
and
authorization
actors
Data Staging
Safe Replication
Simple Store
Dynamic replication
to HPC workspace
for processing
Data curation and
access optimization
Researcher data
store (simple
upload, share and
access)
Metadata
Data
PID
Scientific Data Pyramid
Long tail of data
• These type of services targets small research groups,
homeless and citizen scientists
• Register data that it can be referenced
Data Life Cycle
enrichment
processing
reduction
analysis
domain of globally referable data
temporary
data
data
acquisition
generation
description
global
registration
registration
preservation
referable
data
citable
publication
Use Case: CLARIN – Safe Replication
EPIC PID registry
EPIC Consortium
• Make data reference able and findable
• Provide a sustainable service for storing and
maintaining large volumes of PIDs
• Consists of 4 partners: GWDG, DKRZ, CSC and
SARA
• Service based on the Handle service with an EPIC
extension for easy management
• Handle service provides replication of PIDs and
global search and across PID domain (Handle,
DataCite and EPIC)
EPIC Service
Find hdl:11100/00335742-6026-11e2-9724-e41f13eb41b2
Data User
• EPIC stores more
then 5.5M DO
• EPOS communities
registered >500k DO
in the last few months
Data Manager
Manage 11100/xxx-……..-xxxx
Questions
15