Transcript - Arrow@DIT
Breeda Herlihy, IR Manager, UCC Library
UCC selected DSpace in 2008
Software selection group
Staff from Library IT, Computer Centre, Special Collections,
Archives & Repository Services
Shortlist of 3 platforms
DSpace – Demo site provided by Enovation Solutions
Eprints – DemoPrints, demo site provided by Eprints
http://demoprints3.eprints.org/
Digital Commons – Sample sites, web conference call,
questionnaire
Evaluation matrix
Recommendation for DSpace to Library Strategy Group
Evaluating IR platforms
Evaluation matrix based on
Open Society Institute- A Guide to Institutional Repository
Software, 2004 http://www.soros.org/openaccess/software/index.shtml
Technical Evaluation of selected Open Source Repository
Solutions on behalf of CPIT Version 1.3 approved, 2006.
Commissioned by OARINZ
https://eduforge.org/docman/view.php/131/1062/Repository%20Evaluation%20Document.pdf
System documentation available from various websites
Literature Review
Repositories Support Project, UK
*New* evaluation March 2009 - http://www.rsp.ac.uk/software/surveyresults
Reasons for recommendation
Open source repository solution
No annual licence fees
Initial investment in set up and configuration required
Development of staff skill set in longer term
Integration with new Research Support System
DSpace Foundation and Fedora Commons announced plans to
combine strengths in July 2008
Have since combined their organizations to create DuraSpace
Active and open development community
Wide adoption nationally and internationally
DSpace technical specification
Operating system : Linux / UNIX / MacOSx/ Windows/ Solaris
UCC use RedHat Linux
Programming Language : Java
Database: Oracle / PostgreSQL
UCC use PostgreSQL
Web server: Any, ships with Apache Tomcat
UCC use Tomcat
Web User Interface : JSP or XML
UCC use JSP
Internal search engine: Lucene
DSpace set up in UCC
Enovation Solutions
set up and configured DSpace v 1.5.1 for UCC
provided advice on hardware specification
annual maintenance contract
maintain ‘master copy’ of CORA source code in SVN
Library IT
System administration
Hardware maintenance ….and general hand holding!
CORA - Cork Open Research Archive http://cora.ucc.ie
live 31 March 2009
CORA server architecture
User Interface
JSP
Web application server
Apache Tomcat
PostgreSQL database
Database Server Dell PowerEdge 1950
•Metadata
•Organization of content
•Information about e-people
•Authorization
•Workflows
Web Server
Dell PowerEdge R300
•Asset store – deposited items
CORA user interface… very close to default installation
but customization is possible….web ui
Localization – multilingual support
More customization …statistics
Default DSpace statistics…pretty basic but can be public or private
Google Analytics
Google Analytics allow a richer and more detailed suite of
statistics such as:
Time visitors spent on the site
Where they came from
Terms they used in search engines to find items
The geographic location of visitors
How many pages they looked at
Which pages they started and ended their visit on
JavaScript that needs inserting in the footer of all your DSpace
pages
Statistics Add on – University of Minho
Used by Research Online @ UCD
So what does DSpace do?
An open source solution to
Capture – mediated and self archiving
Store – bitstream, licences, descriptive & technical metadata
Index – metadata and full text indexing
Distribute – OAI-PMH
Preserve – various file formats
scholarly works in any digital format.
CORA capture of content
Mediated archiving by IR Manager
Self archiving by researchers -UCC Research Support System
Integrated with CORA via SWORD
Workflows can be customised
No workflow – one person performs all steps
Accept/Reject
Accept / Reject / Edit Metadata
Edit Metadata
Different workflows per collection
Change steps in workflow e.g. In ULIR license is step 2
Store - Hierarchy
Store - Hierarchy
Index
1. Descriptive Metadata
2. Full text indexing
Distribute
Records are exposed through OAI-PMH
http://cora.ucc.ie/oai/request?verb=ListRecords&metadataPrefix=oai_dc
http://en.scientificcommons.org/
Preserve
Data files, also called bitstreams, are organized together into
related sets. Each bitstream has a technical format and other
technical information. This technical information is kept with
bitstreams to assist with preservation over time.
DSpace is committed to going beyond reliable file preservation
to offer functional preservation where files are kept
accessible as technology formats, media, and paradigms
evolve over time for as many types of files as possible.
DSpace Roadmap
Version 1.6 expected early 2009. New features to include
Statistics
Embargo facility
Batch metadata editing
Tidying up of metadata (e.g. spell check)
Restructuring of metadata (move elements from one field to another)
Global find and replace
Add new items (metadata only) without having to create SIPs that conform to
the DSpace batch import format
Bulk move items between collections
Bulk ‘map’ items into new collections
New documentation improvements
DSpace training and resources
Online course in CADAIR Aberystwyth University IR
The DSpace course http://hdl.handle.net/2160/615
DSpace Live CD http://hdl.handle.net/2160/641
DSpace wiki http://wiki.dspace.org/index.php/Main_Page
DSpace mailing lists http://wiki.dspace.org/index.php/DSpaceResources#Mailing_Lists
General / Technical / Developer
IRC (internet relay chat) channel
Alternative service providers
DSpace.org http://www.dspace.org/service-providers/Service-Providers.html