Digital Enterprise Research Intitute
Download
Report
Transcript Digital Enterprise Research Intitute
Digital Enterprise Research Institute
Report on internship @ DERI,
Galway
Jan Zemanek (DIKE, UEP; SmILE, DERI)
[email protected]
Copyright 2007 Digital Enterprise Research Institute. All rights reserved.
www.deri.ie
DERI, Galway in a nutshell
Digital Enterprise Research Institute
http://www.deri.ie & http://blog.deri.ie/
DERI – Digital Enterprise Research Intitute at National
University of Ireland
„Making Semantic Web real.“
Research areas
Semantic Web (cluster)
Semantic Web Services (cluster)
eLearning (cluster)
Director: Prof. Dr. Stefan Decker
Vice director: Prof. Dr. Manfred Hauswirth
around 100 members
2 stable Czech members: Dr. Tomas Vitvar, Vit Novacek
– Tomas has launched his own weblog lately, you can find it at
http://www.vitvar.com/blog/
www.deri.ie
SmILE subcluster
Digital Enterprise Research Institute
www.deri.ie
http://smile.deri.ie/ & http://smile.deri.ie/blog/
SmILE stands for Semantic Information Systems and
Language Engineering Group
Group leader: Dr. Siegfried Handschuh
„focused around the application of Semantic Web and
Language Engineering techniques to support knowledge
acquisition and re-use in different settings“
leading project: NEPOMUK (Networked Environment for
Personal Ontology-based Management of Unified Knowledge)
NEPOMUK aims to build a Social Semantic Desktop which will
present information in a well defined manner, which will be
processible by computer, and which will connect and exchange
data with other desktops
DINO ontology lifecycle scenario
and framework
Digital Enterprise Research Institute
www.deri.ie
DINO stands for „Dynamics, INtegration and
Ontology“ or „Data and INtensive Ontology“
is a scenario and framework for practical handling
of dynamic and large data-sets in an ontology
lifecycle, focusing particularly on dynamic
integration of learned knowledge into
collaboratively developed ontologies
Ontology development
Digital Enterprise Research Institute
www.deri.ie
ontologies are very likely subject to change given
the dynamic nature of domain knowledge
ontology construction is usually the result of
collaboration
it is not always feasible to process all the relevant
data and extract the knowledge from them
manually
this implies a need for (partial) automation of ontology
extraction and management processes in dynamic and
data-intensive environments
– this can only be achieved by ontology learning
DINO ontology integration
Digital Enterprise Research Institute
based on
Dynamic Integration of Medical Ontologies in Large
Scale, Novacek, V.; Laera, L.; Handschuh, S.; article
much more details in
D2.3.8v1 Report and Prototype of Dynamics in the
Ontology Lifecycle
www.deri.ie
DINO ontology integration
Digital Enterprise Research Institute
scheme of the integration process
phases of the integration
providing a master ontology
providing an extending ontology
alignment/negotiation
reasoning/management
ontology diff
triple sorter
mapping triples to natural language suggestions
www.deri.ie
DINO integration scheme
Digital Enterprise Research Institute
www.deri.ie
DINO phases of integration
Digital Enterprise Research Institute
www.deri.ie
providing a master ontology
providing an extending ontology
ontology learning
– machine learning and NLP methods are used for a processing
of relevant resources and extracting knowledge from them
– is realised using Text2Onto
any “external” ontology can be provided
– we can integrate e.g. different ontologies from the same
domain or specialised/general ontologies
DINO phases of integration
Digital Enterprise Research Institute
www.deri.ie
alignment/negotiation
provided ontologies need to be reconciled since they cover
the same domain, but might be structured differently
contsists of mappings between the concepts, properties,
and relationships in provided ontologies
uses ontology alignment API developed by INRIA RhoneAlpes
DINO phases of integration
Digital Enterprise Research Institute
www.deri.ie
reasoning/management
used for merging of the provided ontologies according to
statements in an „alignment ontology“
– the „alignment ontology“ consists of axioms merging classes,
individuals and properties
handles inconsistencies like sub-class hierarchy cycles,
disjointness-subsumption, disjointness-instantiation
resulting ontology is passed to an ontology diff
uses Jena 2 Ontology API
DINO phases of integration
Digital Enterprise Research Institute
www.deri.ie
ontology diff
possible ontology extensions are equal to the additions
that the merged ontology brings into the master ontology
the addition triples form a base to eventual ontology
extension suggestions
DINO phases of integration
Digital Enterprise Research Institute
www.deri.ie
triple sorter
applies an ordering taking a relevance measure of possible
suggestions into account (based on preferred and
unwanted terms)
DINO phases of integration
Digital Enterprise Research Institute
www.deri.ie
mapping triples to natural language suggestions
produced suggestions are in a form of very simple natural
language statements which are obtained directly from the
sorted triples
DINO integration manager
Digital Enterprise Research Institute
original plans
DINO should have been a part of MarcOnt portal initially
– MarcOnt portal (http://www.marcont.org/)
– an environment for collaborative ontology development being
developed at DERI, Galway
DINO as a bunch of cooperating Protege(-OWL) plugins
– Semantic Version Manager plugin
– Protege plugin built upon SemVersion
– Collaborative Protege
– problems with 3rd party libraries used in Text2Onto and
GATE
reality
DINO as a stand-alone Java application
www.deri.ie
DINO integration manager
Digital Enterprise Research Institute
www.deri.ie
DINO integration manager
Digital Enterprise Research Institute
Demo
www.deri.ie
Semantic web for Java developers
Digital Enterprise Research Institute
interesting Java tools handling Semantic web
technologies I encountered or had to deal with
directly
SemVersion
RDF2Go
RDFReactor
www.deri.ie
SemVersion
Digital Enterprise Research Institute
http://wiki.ontoworld.org/wiki/SemVersion
developed mainly by Max Voelkel
a versioning system for RDF and RDF ontologies
backed by Sesame 2 (since v1.0.0 alpha)
enables to
version RDF models
commit and merge RDF models
Semantic Version Manager
an implementation of SemVersion as a Protege plugin
www.deri.ie
Semantic Version Manager
Digital Enterprise Research Institute
www.deri.ie
Semantic Version Manager
Digital Enterprise Research Institute
www.deri.ie
RDF2Go
Digital Enterprise Research Institute
http://wiki.ontoworld.org/wiki/RDF2Go
an abstraction over triple (and quad) stores
allows a programmer to code against RDF2Go
interface and thus to stay independent of the
underlying RDF store
supported RDF stores
Jena 2.4
Sesame 2.0 beta 6 (the latest release)
used in
SemVersion
Aperture
www.deri.ie
RDF2Go
Digital Enterprise Research Institute
RDF2Go example code:
www.deri.ie
RDFReactor
Digital Enterprise Research Institute
http://wiki.ontoworld.org/wiki/RDFReactor
a view of RDF data through object-oriented Java
proxies making using RDF natural for Java
developers
„Think in objects, not statements.“
all state information is in a RDF model in a RDF
store at all times
www.deri.ie
RDFReactor Java proxies are stateless
Java proxies are generated automatically from RDF
Schema
RDFReactor
Digital Enterprise Research Institute
example code:
www.deri.ie
The very last slide
Digital Enterprise Research Institute
Any (other) questions?
Thank you for your attention!
www.deri.ie