cmpe 588 engineering the semantic web information system

Download Report

Transcript cmpe 588 engineering the semantic web information system

EASTERN MEDITERRANEAN UNIVERSITY
COMPUTER ENGINEERING DEPARTMENT
CMPE 588 ENGINEERING THE SEMANTIC WEB
INFORMATION SYSTEM
ONTOLOGY-DRIVEN SEMANTIC
MARK UP OF UNSTRUCTURED
TEXTS
Faaya H. Tiwuya
Instructor: Assoc. Prof.
Dr. Atilla ELCI
OUTLINE
•
•
•
•
•
•
•
•
Introduction
Defination of terms
Need to use ontologies
Annotation tools
Project Tools
Ontology driven semantic mark up/annotation process
Sample Semantic Mark up
Querying knowledge base
OUTLINE
• Conclusion
• References
• Questions
INTRODUCTION
• A recent estimates by Merrill Lynch states that
more than 85% of all business information
exists as unstructured data
• In order to realise the goal of the Semantic Web
which is to allow machines share and exploit
knowledge without central authority, there is a
need to mark up this bulk of data in a way that
machines will be able to read them
INTRODUCTION
• Blogs are a typical example of information that
exists in an unstructured format.
• There is a need therefore to mark-up blog sites
within the context of an “ontology” so that that
meaningful information can be extracted from
them for the population of a knowledge base
for retrieval at a later time by different
applications.
DEFINATION OF TERMS
• What then is a blog, ontology and semantic
annotation?
– Blogs: A blog is a website where entries are written
in a chronological order and displayed in reverse
chronological order. Blogs provide commentary or
news on a particular subject such as food, politics, or
local news.
DEFINATION OF TERMS
• Ontology: An ontology is a data model that represents
a set of concepts within a domain and the relationships
between those concepts.
– It is used to reason about the objects within that domain.
• Annotation: Annotation is the process of marking up
or adding notes/comments to something. Within the
context of the semantic web, it is the process of
marking up documents with meaningful tags
NEED TO USE ONTOLOGIES
• Why do we need to use ontologies?
– To share common understanding of the structure of
information among people or software agents
– To enable reuse of domain knowledge
– To make domain assumptions explicit
– To separate domain knowledge from the operational
knowledge
– To analyze domain knowledge
ANNOTATION TOOLS
•
•
•
•
•
•
SHOE Knowledge Annotator
Annotea
SMORE
Semantic Word
KIM Semantic Annotation Platform
MnM
– MnM is an annotation tool which provides both automated
and semi-automated support for annotating web pages with
semantic contents. MnM integrates a web browser with an
ontology editor and provides open APIs to link to ontology
servers and for integrating information extraction tools.
PROJECT TOOLS
• MnM
• TinyXml
• C++ BUILDER 5.0
ONTOLOGY DRIVEN SEMANTIC
ANNOTATION PROCESS
A CELEBRITY IS A PERSON
• A celebrity “is a” person who is famous ( not for
something infamous ) and attracts a lot of media
attention. E.g David Beckham, Angelina Jolie,
Zinedane Zidane.
• Sites exists where people post comments about
these sort of people almost o a daily basis.
• Some of these sites contain important
information.
A CELEBRITY IS A PERSON
SAMPLE ONTOLOGY –DRIVEN SEMANTIC
MARKUP ON A DAVID BECKHAM BLOGUSING
MnM
XML FILE ON BECKHAM MARK-UP
SAMPLE QUERY FOR INFO ON
BECKHAM USING CELEBQRY
SAMPLE QUERY FOR INFO ON
BECKHAM USING CELEBQRY
CONCLUSION
• Blogs contain a lot of valuable information
about different topics
• An otology driven semantic annotation of these
blogs will create a unified means to share the
wealth of information these blogs possess.
REFERENCES
1.
2.
3.
4.
5.
6.
7.
8.
9.
Prof. Dr. Atilla Aelci, Lecture slides CMPE 588 Engineering Semantic Web
information Systems.
Ontology Development 101: A Guide to Creating Your First Ontology
Natalya F. Noy and Deborah L. McGuinness Stanford University, Stanford,
CA, 94305 [email protected] and [email protected]
http://en.wikipedia.org/wiki/Unstructured_data
http://semanticweb2001.aifb.uni-karlsruhe.de/
http://en.wikipedia.org/wiki/Blog
http://en.wikipedia.org/wiki/Ontology_(computer_science)
http://annotation.semanticweb.org/tools/
M.Vargas-Vera, E. Motta , J. Domingue M. Lanzoni , A. Stutt, F.
Ciravegna, MnM: Ontology-Driven Tool for Semantic Markup
TinyXml, http://www.grinninglizard.com/tinyxmldocs/index.html
• THANK YOU FOR YOUR ATTENTION
• QUESTIONS