Transcript Linked Data

Linked Data:
Emblematic applications on Legacy Data in Libraries
Overview of Presentation
• Linked Data: Introduction
– Evolution of Syntactic Web to Linked Web
– Linked Data Working Principles
• Current Trends & Implementations
• Linking Legacy Data
– Annotating Text Data & Publishing Legacy Data as Linked Data
• Conclusion
4/1/2016
2
Evolution of Syntactic Web to Linked Web
Linked Web
Meaningful Links between related Data sets,
Making Web like a single global database
Web of Data
Semantic Web
Syntactic Web
4/1/2016
Web 3.0
Machine Processable & Understandable,
Data sets Isolated, unconnected
Web of Documents,
Human clickable hyperlinks,
Web 1.0, Web 2.0,
Content not well structured
Can’t ask expressive queries
Can’t process content within applications
3
Need for Linked Data
• To enhance re-usability, findability & visibility of our data
• Publish structured data directly on the Web in addition to
classic HTML pages.
• Different Approaches
– Microformats
– Web APIs & Mashups
» APIs slice the Web into separate data silos
– Linked Data
Christian Bizer: How to Publish Linked Data
on the Web - Introduction (10/27/2008)
4/1/2016
4
The Web of Linked Data
Use Semantic Web technologies to
publish structured data on the Web,
set links between data from one data source to data within other data
sources
4/1/2016
5
Linked Data Working Principles
• Use URIs as names for things.
• Use URIs, so that people can look up those names.
• When someone looks up a URI, provide useful information,
using the standards (RDF).
• Include links to other URIs, so that they can discover more
things.
Tim Berners-Lee 2007
http://www.w3.org/DesignIssues/LinkedData.html
4/1/2016
6
URIs to identify people & relationships
http://drtc.isibang.ac.in/DRTC/students/sayon
http://drtc.isibang.ac.in/DRTC/students/vikas
http://xmlns.com/foaf/0.1/knows
http://drtc.isibang.ac.in/DRTC/students/debashis
http://live.dbpedia.org/page/Bangalore
4/1/2016
7
Implementation in OCLC Worldcat
4/1/2016
8
Facts about few others
Name
Software
Events
Reasoning
SPARQL
Endpoint
LCSH
MarkLogic
& 4Store
Yes
Yes
Yes
Eurovoc
Sesame
No
No
Yes
Wordnet 3.0
3Store
Yes
Yes
Yes
4/1/2016
Triples
License
More than
300
Open
million
Not
N/A
Open
More than
Open
2 million
9
OpenRefine(ex-Google Refine)
•Free, open source tool used for
Cleaning messy data & Linking it to data sets like Freebase,
DBpedia, Europeana etc.
•Survey result of 2014 reflects
Librarians as largest
user base.
4/1/2016 Image Source : http://en.wikipedia.org/wiki/File:Google-refine-logo.svg
10
Linked Media Framework
• Server application that bundles central Semantic Web
technologies such as
LMF Core :Linked Data Server that allows to expose data
following the Linked Data Principles.
LMF Modules :Used to extend the functionality of the
Linked Media Server.
• LMF Usage Scenarios :Publishing Legacy Data as Linked Data,
Building Semantic Search over Data
4/1/2016
Image Source: http://semanticweb.org/wiki/File:Weis_mit_bunter_schrift.png
11
Hostel Library Records
4/1/2016
Data Entry by :http://drtc.isibang.ac.in/DRTC/students
12
Reconciliation against DBpedia writer Ontology
4/1/2016
13
Configuring Triples
4/1/2016
Editing RDF Skeleton Continued…
14
RDF Skeleton of the Project
4/1/2016
Editing RDF Skeleton Continued…
15
4/1/2016
16
RDF/XML using OpenRefine & csv2rdf
<rdf:Description rdf:about="http://localhost:8080/LMF/resource/author/1">
<rdf:type rdf:resource="http://xmlns.com/foaf/0.1/person"/>
<foaf:name>Robert Silverberg</foaf:name>
<owl:sameAs
rdf:resource="http://dbpedia.org/resource/Robert_Silverberg"/>
</rdf:Description>
<rdf:Description rdf:about="http://localhost:8080/LMF/resource/book/1">
<dc:title>Passengers (short story)</dc:title>
<owl:sameAs
rdf:resource="http://dbpedia.org/resource/Passengers_(short_story)"/>
</rdf:Description>
<rdf:Description rdf:about="http://localhost:8080/LMF/resource/author/1">
<dc:creator rdf:resource="http://localhost:8080/LMF/resource/book/1"/>
</rdf:Description>
OpenRefine output
4/1/2016
<rdf:Description rdf:about="#entry00001">
<rdf:type rdf:resource="http://data-gov.tw.rpi.edu/2009/data-govtwc.rdf#DataEntry"/>
<id1>1</id1>
<authors>Robert Silverberg</authors>
<id2>1</id2>
<book_title>Passengers (short story)</book_title>
<author_id>1</author_id>
<book_id>1</book_id>
</rdf:Description>
csv2rdf output
17
Conclusion
•
•
•
•
•
Linked Data converts the whole web into a single database
Designed for machines and humans.
Links between concept-concept and concept-and its description.
Encourages reuse, reduces repetition.
Enables to add more value to data over the network.
• People can more easily find library resources on the web.
• More creative applications based on library metadata
4/1/2016
18
Thank you