Transcript - Arrow@DIT

Breeda Herlihy, IR Manager, UCC Library
UCC selected DSpace in 2008
 Software selection group
 Staff from Library IT, Computer Centre, Special Collections,
Archives & Repository Services
 Shortlist of 3 platforms
 DSpace – Demo site provided by Enovation Solutions
 Eprints – DemoPrints, demo site provided by Eprints
http://demoprints3.eprints.org/
 Digital Commons – Sample sites, web conference call,
questionnaire
 Evaluation matrix
 Recommendation for DSpace to Library Strategy Group
Evaluating IR platforms
 Evaluation matrix based on
 Open Society Institute- A Guide to Institutional Repository
Software, 2004 http://www.soros.org/openaccess/software/index.shtml
 Technical Evaluation of selected Open Source Repository
Solutions on behalf of CPIT Version 1.3 approved, 2006.
Commissioned by OARINZ
https://eduforge.org/docman/view.php/131/1062/Repository%20Evaluation%20Document.pdf
 System documentation available from various websites
 Literature Review
 Repositories Support Project, UK
*New* evaluation March 2009 - http://www.rsp.ac.uk/software/surveyresults
Reasons for recommendation
 Open source repository solution



No annual licence fees
Initial investment in set up and configuration required
Development of staff skill set in longer term
 Integration with new Research Support System
 DSpace Foundation and Fedora Commons announced plans to
combine strengths in July 2008

Have since combined their organizations to create DuraSpace
 Active and open development community
 Wide adoption nationally and internationally
DSpace technical specification
 Operating system : Linux / UNIX / MacOSx/ Windows/ Solaris
 UCC use RedHat Linux
 Programming Language : Java
 Database: Oracle / PostgreSQL
 UCC use PostgreSQL
 Web server: Any, ships with Apache Tomcat
 UCC use Tomcat
 Web User Interface : JSP or XML
 UCC use JSP
 Internal search engine: Lucene
DSpace set up in UCC
 Enovation Solutions
 set up and configured DSpace v 1.5.1 for UCC
 provided advice on hardware specification
 annual maintenance contract
 maintain ‘master copy’ of CORA source code in SVN
 Library IT
 System administration
 Hardware maintenance ….and general hand holding!
 CORA - Cork Open Research Archive http://cora.ucc.ie
 live 31 March 2009
CORA server architecture
User Interface
JSP
Web application server
Apache Tomcat
PostgreSQL database
Database Server Dell PowerEdge 1950
•Metadata
•Organization of content
•Information about e-people
•Authorization
•Workflows
Web Server
Dell PowerEdge R300
•Asset store – deposited items
CORA user interface… very close to default installation
but customization is possible….web ui
Localization – multilingual support
More customization …statistics
 Default DSpace statistics…pretty basic but can be public or private
Google Analytics
 Google Analytics allow a richer and more detailed suite of
statistics such as:
 Time visitors spent on the site
 Where they came from
 Terms they used in search engines to find items
 The geographic location of visitors
 How many pages they looked at
 Which pages they started and ended their visit on
 JavaScript that needs inserting in the footer of all your DSpace
pages
Statistics Add on – University of Minho
Used by Research Online @ UCD
So what does DSpace do?
An open source solution to
 Capture – mediated and self archiving
 Store – bitstream, licences, descriptive & technical metadata
 Index – metadata and full text indexing
 Distribute – OAI-PMH
 Preserve – various file formats
scholarly works in any digital format.
CORA capture of content
 Mediated archiving by IR Manager
 Self archiving by researchers -UCC Research Support System
 Integrated with CORA via SWORD
Workflows can be customised
 No workflow – one person performs all steps
 Accept/Reject
 Accept / Reject / Edit Metadata
 Edit Metadata
 Different workflows per collection
 Change steps in workflow e.g. In ULIR license is step 2
Store - Hierarchy
Store - Hierarchy
Index
1. Descriptive Metadata
2. Full text indexing
Distribute
 Records are exposed through OAI-PMH
http://cora.ucc.ie/oai/request?verb=ListRecords&metadataPrefix=oai_dc
http://en.scientificcommons.org/
Preserve

Data files, also called bitstreams, are organized together into
related sets. Each bitstream has a technical format and other
technical information. This technical information is kept with
bitstreams to assist with preservation over time.

DSpace is committed to going beyond reliable file preservation
to offer functional preservation where files are kept
accessible as technology formats, media, and paradigms
evolve over time for as many types of files as possible.
DSpace Roadmap
Version 1.6 expected early 2009. New features to include
 Statistics
 Embargo facility
 Batch metadata editing




Tidying up of metadata (e.g. spell check)
Restructuring of metadata (move elements from one field to another)
Global find and replace
Add new items (metadata only) without having to create SIPs that conform to
the DSpace batch import format
 Bulk move items between collections
 Bulk ‘map’ items into new collections
 New documentation improvements
DSpace training and resources
 Online course in CADAIR Aberystwyth University IR
 The DSpace course http://hdl.handle.net/2160/615
 DSpace Live CD http://hdl.handle.net/2160/641
 DSpace wiki http://wiki.dspace.org/index.php/Main_Page
 DSpace mailing lists http://wiki.dspace.org/index.php/DSpaceResources#Mailing_Lists
 General / Technical / Developer
 IRC (internet relay chat) channel
 Alternative service providers

DSpace.org http://www.dspace.org/service-providers/Service-Providers.html