R. Kiley - The Dryad data repository wiki

Download Report

Transcript R. Kiley - The Dryad data repository wiki

UKPMC and Dryad
Dryad-UK meeting: 28th April 2010
Robert Kiley,
Head Digital Services, Wellcome Library
Email: [email protected]
Overview
• Brief look at UKPMC – and the
developments which have come online
over the past few months
• Consider how UKPMC handles
Supplementary material
• Look at usage of this material at
UKPMC and PMC
• UKPMC and Dryad-UK: thoughts,
issues and questions
– Here to listen – rather than provide any
answers
UKPMC - Headlines
• Repository contains around 1.7 million full text documents
• Significant increase in use of UKPMC – (Fig 1) – though still low
compared with PMC
• Compliance with Trust OA mandate increasing (Fig 2)
• R&D Developments now coming on-stream
UKPMC usage: 2007-2010
% of papers in UKPMC
900000
50.00
800000
45.00
764308
40.00
600000
500000
Unique sessions
Downloads
400000
316226
300000
213774
4310 13310
9289
March 2007
March 2008
30.00
25.00
% of papers in UKPMC
20.00
15.00
10.00
5.00
132582
0.00
65566
100000
35.00
0
March 2009
Usage in a single month
Figure 1 – repository usage
March 2010
No
v0
Fe 6
b0
M 7
ay
-0
Au 7
g0
No 7
v0
Fe 7
b0
M 8
ay
-0
Au 8
g0
No 8
v0
Fe 8
b0
M 9
ay
-0
Au 9
g09
200000
Compliance (%)
700000
Month
Figure 2 – compliance with Trust mandate
New services at UKPMC – Single access point
View abstracts or
full text.
Single search across PubMed,
Patents, Clinical Guidelines and
UK Theses databases.
Results faceted by source.
New services at UKPMC – Text mining
Results of
text mining
the full-text
New services at UKPMC – Citations tab
New services at UKPMC – Grant Reporting
“My Impact Report”
“My Grant Report”
New services at UKPMC – Funder analyses
“Cross-funder report”
“Journal Report”
UKPMC and Supplementary material
• UKPMC holds around 1.7 million full text articles
• Around 90,000 documents have supplementary material
– Equates to over 277GB of data
– Around 200,000 files, predominantly PDF’s and MS Word files, but also
13,000 MOV files, 18,000 TIFF files
• Currently supplementary information is not searchable, nor curated
– i.e. no characterisation of files; no preservation strategy
• MIMAS doing a small-scale study to look at whether a tool like
Autonomy could provide some form of access to these objects
– Vic Lyte will talk to this
Use of Supplementary Material
• 01 January through 31st March 2010
– 696,221 unique user sessions at UKPMC, who downloaded 2,218,743
papers
– 130 downloads of “Supplementary Material”, from 47 documents.
• PMC2612414 – SI data downloaded 19 times (Figures and tables – PDF
format))
• PMC2262892 – SI data downloaded 15 times (Molecule structures – PDF
format)
– Low usage may, in part, be a reflection of the fact that these objects are
not indexed and searchable
• Higher usage at PMC
– On the DOCSUM page – around Click Through Rate (CTR) 0.5%
– On Full text views – CTR around 8% (55000 full text documents viewed
4200 views to Suppl. Material
Questions/clarifications/Issues
• Overall, keen to listen to, and participate in, these discussions
• If Dryad-UK does go ahead, we would fully support establishment of
appropriate links with UKPMC to maximise access to data
– NCBI have been asked to consider offering a service to allow NIH
grantees to upload supplementary data. This would be accessible
through the PubMed record (and PMC if the full-text was in this
repository)
– NCBI would not curate the data
• Costs?
– UKPMC costs WT around £500k a year – equal to around 0.1% of
research spend
– Any idea what Dryad-UK might cost?