eBank UK - linking research data, scholarly communications

Download Report

Transcript eBank UK - linking research data, scholarly communications

Archiving research data and research
publications.
Dr Leslie Carr, Intelligence, Agents Multimedia, University of
Southampton
Dr Simon Coles, School of Chemistry, University of Southampton
Dr Liz Lyon, UKOLN, University of Bath
RCUK, Octiber 2004
1
Overview
• In an Open Access environment
–
–
–
–
scientific outputs are openly available
described by appropriate metadata
in Institutional Repositories
harvestable by OAI protocols
• Scientists can use the same infrastructure
– (here eprints.org software and an existing scientific portal
service)
– to provide maximal open access
– to all their data, as well as their published articles
• raw data, intermediate calculations, final results
• in a searchable, accessible form
• BUT this is subject to ongoing investigation.
RCUK, Octiber 2004
2
Current chemistry publishing protocols
Ideas and interpretations
Hooks into the literature
Raw data!
Results &
derived
data
RCUK, Octiber 2004
3
Presentation services: subject, media-specific, data, commercial portals
Data creation /
capture /
gathering:
laboratory
experiments,
Grids,
fieldwork,
surveys, media
Resource
discovery, linking,
embedding
Data analysis,
transformation,
mining, modelling
Searching ,
harvesting,
embedding
Aggregator
services: national,
commercial
Resource
discovery,
linking,
embedding
Learning object
creation, re-use
Harvesting
metadata
Research &
e-Science
workflows
Validation
Deposit / selfarchiving
Learning &
Teaching
workflows
Repositories :
institutional,
e-prints, subject,
data, learning objects
Validation
Publication
Resource
discovery, linking,
embedding
Linking
Data curation:
databases & databanks
Deposit / selfarchiving
Institutional
presentation
services: portals,
Learning
Management
Systems, u/g, p/g
courses, modules
Peer-reviewed
publications: journals,
conference proceedings
RCUK, Octiber 2004
Validation
Quality
assurance
bodies
4
The data deluge
Data
Overload!
EPSRC National
Crystallography
Service
How do we
disseminate?
RCUK, Octiber 2004
5
CombeChem: An EPSRC pilot project
Simulation
Video
Diffractometer
Properties
Analysis
Structures
Database
Properties
e-Lab
X-Ray
e-Lab
Grid Middleware
RCUK, Octiber 2004
6
Crystallography workflow
• Initialisation: mount new sample on diffractometer &
set up data collection
• Collection: collect data
• Processing: process and correct images
• Solution: solve structures
• Refinement: refine structure
• CIF: produce CIF (Crystallographic Information File
format)
• Report: generate Crystal Structure Report
RAW DATA
DERIVED DATA
RESULTS DATA
RCUK, Octiber 2004
7
Deposition into the archive
RCUK, Octiber 2004
8
An Archive entry
ecrystals.chem.soton.ac.uk
RCUK, Octiber 2004
9
All the way back to the underlying data…
RCUK, Octiber 2004
10
Dataset
Searching,
linking and
embedding
Data flow in eBank
Dataset
Dataset
dcterms:references
Harvesting
OAI-PMH
oai_dc
Crystal structure
(data holding)
Linking
ebank_dc
record (XML)
dc:identifier
dc:type=“CrystalStructure”
and/or “Collection”
Institutional
repository
Crystal structure
report (HTML)
Searching,
linking and
embedding
Harvesting
OAI-PMH
PSIgate
portal
ebank_dc
eBank UK
aggregator
service
dcterms:isReferencedBy
Eprint
“jump-off”
page
(HTML)
Eprint
manifestation
(e.g. PDF)
Deposit
ePrint UK
aggregator
service
Harvesting
OAI-PMH
oai_dc
dc:identifier
Linking
Model input Andy Powell, UKOLN.
Eprint oai_dc
record (XML)
dc:type=“Eprint”
and/or ”Text”
RCUK, Octiber 2004
Subject service
Searching,
linking and
embedding
11
Harvesting: OAIster
RCUK, Octiber 2004
12
Linking and aggregating: Search & discover
RCUK, Octiber 2004
13
Linking and aggregating: Hit browsing
RCUK, Octiber 2004
14
And finally…
eBank embedded in a science portal
RCUK, Octiber 2004
15