Transcript Chapter 8
Chapter 8
Conclusion and Outlook
Grigoris Antoniou
Frank van Harmelen
1
Chapter 8
A Semantic Web Primer
Lecture Outline
1.
2.
3.
4.
2
Which Semantic Web?
Four Popular Fallacies
Current Status
Selected Key Research Challenges
Chapter 8
A Semantic Web Primer
Interpretation 1: The Semantic Web as
the Web of Data
3
The main aim of the Semantic Web is to enable
the integration of structured and semistructured
data sources over the Web
The main recipe is to expose datasets on the
Web in RDF format and to use RDF Schema to
express the intended semantics of these
datasets
A typical use is the combination of geodata with
a set of consumer ratings for restaurants in order
to provide an enriched information source
Chapter 8
A Semantic Web Primer
Interpretation 2: The Semantic Web as
Enrichment of the Current Web
The aim of the Semantic Web is to improve
the current World Wide Web
Typical uses are :
–
–
–
4
Improved search engines
Dynamic personalization of Web sites
Semantic enrichment of existing Web pages
Chapter 8
A Semantic Web Primer
Interpretation 2: The Semantic Web as
Enrichment of the Current Web (2)
The sources of the required semantic metadata
are mostly claimed to be automated sources
–
–
–
5
Concept extraction
Named-entity recognition
Automatic classification
More recently the insight is gaining ground that
the required semantic markup can also be
produced by social mechanisms in communities
that provide large-scale human-produced
markup
Chapter 8
A Semantic Web Primer
Lecture Outline
1.
2.
3.
4.
6
Which Semantic Web?
Four Popular Fallacies
Current Status
Selected Key Research Challenges
Chapter 8
A Semantic Web Primer
1. The semantic Web tries to enforce
meaning from the top
7
This fallacy claims that the Semantic Web
enforces meaning on users through its
standards OWL and RDFS
The only meaning that OWL and RDFS enforce
is the meaning of the connectives in a language
in which users can express their own meanings
Users are free to choose their own vocabularies
and to describe whatever domains the choose
Chapter 8
A Semantic Web Primer
1. The semantic Web tries to enforce
meaning from the top (2)
The situation is comparable to HTML:
–
–
–
8
HTML does not enforce the lay-out of web-pages
“from the top”
All HTML enforces is the language that people
can use to describe their own lay-out
HTML has shown that such an agreement on the
use of a standardized language is a necessary
ingredient for world-wide interoperability
Chapter 8
A Semantic Web Primer
2. Everybody must subscribe to a single
predefined meaning for the terms they use
9
The meaning of terms cannot be predefined for
global use in addition meaning is fluid and contextual
The motto of the Semantic Web is not the
enforcement of a single ontology but rather “let a
thousand ontologies blossom”
That is the reason that the construction of mappings
between ontologies is such a core topic in the
Semantic Web community
Such mappings are expected to be partial, imperfect
and context-dependent
Chapter 8
A Semantic Web Primer
3. Users must understand the complicated details of
formalized knowledge representation
10
Some of the core technology of the Semantic
Web relies on intricate details of formalized
knowledge representation
The semantics of RDF Schema and OWL
and the layering of the subspecies of OWL
are difficult formal matters
The design of good ontologies is a
specialized area of Knowledge Engineering
Chapter 8
A Semantic Web Primer
3. Users must understand the complicated details of
formalized knowledge representation (2)
11
For most users of Semantic Web
applications, such details will remain entirely
behind the scene
Navigation or personalization engines can be
powered by underlying ontologies, expressed
in RDF Schema or OWL, without users ever
being confronted with the ontologies, let
alone their representation languages
Chapter 8
A Semantic Web Primer
4. The semantic Web requires the manual
markup of all existing Web pages
12
It is hard enough for most Web site owners to
maintain the human-readable content of their sites
They will certainly not maintain parallel machineaccessible versions of the same information in RDF
or OWL
If that were necessary, it would indeed spell bad
news for the Semantic Web
Instead, Semantic Web application rely on largescale automation for the extraction of such semantic
markup from the sources themselves
Chapter 8
A Semantic Web Primer
Lecture Outline
1.
2.
3.
4.
13
Which Semantic Web?
Four Popular Fallacies
Current Status
Selected Key Research Challenges
Chapter 8
A Semantic Web Primer
Four Main Questions
14
Where do the metadata come from?
Where do the ontologies come from?
What should be done with the many
ontologies?
Where’s the "Web " in Semantic Web?
Chapter 8
A Semantic Web Primer
Question 1: Where do the metadata
come from?
15
Much of the semantic metadata come from
Natural Language Processing and Machine
Learning technology
It is now possible with off-the-shelf
technology to produce semantic markup for
very large corpuses of Web pages by
annotating them with terms from very large
ontologies with sufficient precision and recall
to drive semantic navigation interfaces
Chapter 8
A Semantic Web Primer
Question 1: Where do the metadata
come from? (2)
16
More recent is the capability of social
communities to provide large amounts of
human-generated markup
Millions of images with hundreds of million of
manually provided metadata tags are found
on some of the most popular Web 2.0 sites
Chapter 8
A Semantic Web Primer
Question 2: Where do the ontologies
come from?
The term ontology as used by the Semantic Web
community now covers a wide array of semantic
structures, from lightweight hierarchies such as
MeSH to heavily axiomatized ontologies such as
GALEN
The world is full of such “ontologies”:
–
–
–
17
Companies have product catalogs
Organizations have internal glossaries
Scientific communities have their public metadata
schemata
Chapter 8
A Semantic Web Primer
Question 2: Where do the ontologies
come from? (2)
18
These have been constructed for other
purposes, most often predating Semantic Web
There are also significant advances in the area
of ontology learning, although results there
remain mixed
Obtaining the concepts of an ontology is feasible
given the appropriate circumstances, but placing
them in the appropriate hierarchy with the right
mutual relationships remains a topic of active
research
Chapter 8
A Semantic Web Primer
Question 3: What should be done with
the many ontologies?
19
The Semantic Web crucially relies on the
possibility of integrating multiple ontologies
This is known as the problem of ontology
alignment, ontology mapping, or ontology
integration
Chapter 8
A Semantic Web Primer
Question 3: What should be done with
the many ontologies? (2)
A wide array of techniques is deployed for
solving this problem
–
–
–
–
–
20
Ontology-mapping techniques based on natural
language technology
Machine learning
Theorem proving
Graph theory
Statistics
Chapter 8
A Semantic Web Primer
Question 3: What should be done with
the many ontologies? (3)
21
Although there are encouraging results, this
problem is by no means solved
Automatically obtained results are not yet
good enough in terms of recall and precision
to drive many of the intended Semantic Web
use cases
Ontology mapping is seen by many as the
Achilles’ heel of the Semantic Web
Chapter 8
A Semantic Web Primer
Question 4: Where’s the "Web " in
Semantic Web?
22
Semantic Web has sometimes been criticized as
being too much about “semantic” and not
enough about “Web”
This was perhaps true in the early days of
Semantic Web development, when there was a
focus on applications in rather circumscribed
domains like intranets
The main advantage of company intranets is that
the ontology-mapping problem can be avoided
Chapter 8
A Semantic Web Primer
Question 4: Where’s the "Web " in
Semantic Web? (2)
23
Recent years have seen a resurgence in the
Web aspects of Semantic Web applications
A prime example is the deployment of FOAF
technology, and of semantically organized
P2P systems
The Web is more than just a collection of
textual documents
Chapter 8
A Semantic Web Primer
Question 4: Where’s the "Web " in
Semantic Web? (3)
24
Nontextual media such as images and
videos are an integral part of the Web
For the application of Semantic Web
technology to such nontextual media we
currently rely on human-generated semantic
markup
Deriving annotations through intelligent
content analysis in images and videos is
under way
Chapter 8
A Semantic Web Primer
Main Application Areas
Looking at industrial events, either dedicated
events or co-organized with the major
international scientific Semantic Web
conferences, we observe that a healthy uptake
of Semantic Web technologies is beginning to
take shape in the following areas :
–
–
–
–
25
Knowledge management, mostly in intranets of large
corporations
Data integration (Boeing, Verizon, and other)
E-science, in particular the life sciences
Convergence with Semantic Grid
Chapter 8
A Semantic Web Primer
Main Application Areas (2)
If we look at the profiles of companies active in this area, we
see a transition from small start-up companies such as
–
–
–
–
To large vendors such as
–
–
–
–
26
Aduna
Ontoprise
Network Inference
Top Quadrant
IBM (Snobase ontology Management System)
HP (Jena RDF platform)
Adobe (RDF-based XMP metadata framework)
Oracle (support for RDF storage and querying in their datavase
product)
Chapter 8
A Semantic Web Primer
Main Application Areas (3)
However, there is a noticeable lack of uptake
in some other areas. In particular, the
promise of the Semantic Web for
Pesonalization
– Large-scale semantic search (on the scale of the
World Wide Web)
– Mobility and context-awareness
is largely unfulfilled
–
27
Chapter 8
A Semantic Web Primer
Main Application Areas (4)
28
A difference that seems to emerge between
the successful and unsuccessful application
areas is that the successful are all aimed at
closed communities, whereas the
applications aimed at the general public are
still in the laboratory phase at best
The underlying reason for this could well be
the difficulty of the ontology mapping
Chapter 8
A Semantic Web Primer
Lecture Outline
1.
2.
3.
4.
29
Which Semantic Web?
Four Popular Fallacies
Current Status
Selected Key Research Challenges
Chapter 8
A Semantic Web Primer
Selected Key Research Challenges (1)
Several challenges that were outlined in
2002 article by van Harmelen have become
active areas of research:
–
–
–
30
Scale inference and storage technology, now
scaling to the order of billions of RDF triples
Ontology evolution and change
Ontology mapping
Chapter 8
A Semantic Web Primer
Selected Key Research Challenges (2)
A number of items on the research agenda, though
hardly developed, have had a crucial impact on the
feasibility of the Semantic Web vision:
–
–
–
31
Interaction between machine-processable representations
and the dynamics of social networks of human users
Mechanisms to deal with trust, reputation, integrity and
provenance in a semiautomated way
Inference and query facilities that are sufficiently robust to
work in the face of limited resources (computational time,
network latency, memory, or storage space) and that can
make intelligent trade-off decisions between resource use
and output quality
Chapter 8
A Semantic Web Primer