BRD Prototype Data Warehouse

Download Report

Transcript BRD Prototype Data Warehouse

22 March 2006

Kobe, Japan
From Here to There:
Case Studies on a Path to
Semantic Web 2.0
Gail Hodge
Information International Associates, Inc.
US Geological Survey, Consultant
Joel Sachs
Ebiquity Lab, University of Maryland Baltimore County
Outline
• Describe the National Biological
Information Infrastructure (NBII)
• Highlight challenges in biodiversity- and
eco- informatics
• Describe the “to be” scenario
• Present several NBII-related Semantic
Web projects in biodiversity and
ecosystem domains
NBII Node Structure


Regional
Thematic

Infrastructure
Part of Multi-Sectored Approach
GLOBAL
REGIONAL
World Data Centers (WDC)
Global Biodiversity Information Facility (GBIF)
Clearinghouse Mechanism (CHM)
Pacific Biodiversity Information Forum (PBIF)
The Inter-American Biodiversity Information Network
(IABIN)
NATIONAL
NBII (US), CBIN (Canada)
ERIN (Australia)
LOCAL
State Heritage Programs
GAP Analysis
County Park Information
Information Management Challenges
Linking levels of:
–Biological organization
–Spatial organization
–Temporal organization
Linking people across disciplines and organizations
Challenges Across Disciplines &
Organizations
• Accessing data from diverse sources
NGOs
• Discipline - based practices
• Different terminologies and
representations for concepts
MUSEUMS
• Sensitive data
GOVERNMENTS
UNIVERSITIES
• Conflicts of interest
“As-Is” Situation
• Interaction between information resources is
often “hard coded”
• While partnerships are important they take
time to develop
• Difficult to respond quickly to new research
areas or practical needs
• Does not promote discovery and “new
connections”
“To-Be” Situation
• Fluid and flexible
• Connections (“partnerships”) made on the fly
• Local, regional, national and global information
able to be integrated
• Desired situation requires understanding of
content including:
– Assessment of provenance and trustworthiness
– Understanding semantics (across disciplines,
languages and cultures)
– Understanding the resource’s behavior (what you can
do with it)
The Vision for the Future
What Is Needed?
• Semantic Web approaches
• Distributed web services available through
a registry
• Metadata to describe who, why and how
• Semantics to improve understanding and
reuse
NBII Semantic Web Activities
• Semantic Prototypes in Research
Ecoinformatics (SPIRE)
• NBII Terminology Web Services
• Involvement in Ecoterm
Semantic Prototypes in Research
Ecoinformatics
• Background on the SPIRE Project
• Two SPIRE prototypes:
• ELVIS
• Swoogle
• Demo of SPARQL queries against
integrated semantic web documents
UMBC Prototype Applications
Ebiquity Information Retrieval
Agents
Spire
UMD
MINDSWAP
Semantic Web Tools
Infrastructure
NASA
GSFC
UC Davis
ICE
Semantic CAIN Dissemination
Ontology Development
NBII
Semantic Prototypes In
Ecoinformatics
Invasive Species Forecasting
System/Remote Sensing Data
RMBL
Peace
Food Webs
Ecological Interaction Ontologies
ELVIS
(The Ecosystem Location Visualization and Information System)
ELVIS is a suite of tools
motivated by the belief that food
web structure plays a role in the
success or failure of potential
species invasions.
?
Answer the question “what are
likely prey and predator species
of the invader in the new
environment?”
ELVIS Components
• Species List Constructor
– Click a location, get a species list
– Data integrated from NatureServe; Gap Analysis; Park
Inventories; etc.
• Food Web Constructor
– Input a species list, get a food web
– Uses a database of several hundred published food webs to
predict likely trophic interactions
• Evidence Provider
– Drill down on predicted trophic links to see the evidence for
the prediction
Swoogle: Motivation
• (Google + Web) has made us all smarter
• Something similar is needed by people and software
agents for finding information on the semantic web
Swoogle
• Allows users to search for
both ontologies and
instance data in a number
of ways.
• OntologyRank algorithm
returns Semantic Web
documents according to
their “importance” to the
semantic web.
• A “triple shop” allows a
user to select amongst
returned documents.
Pulling it Together: Triple Shop Demo
• The SPIRE Triple Shop allows a user to specify the URLs of
arbitrary semantic web documents, and to issue SPARQL queries
against the union of those documents.
– It is alpha-version, and is not robust in the general case. However …
• We have expressed each of our 259 food webs in OWL, using the
SpireEcoConcepts ontology.
• We have expressed a number of species accounts from the
Animal Diversity Web in OWL, using the ETHAN ontology.
• For efficiency, we have precomputed all triples entailed by the
original OWL files identified for this demo.
– We are experimenting with ways to do the reasoning in real time.
• The user can issue SPARQL queries over the integrated data.
Slide with Relevant URLs
What kind of food do herons eat?
http://spire.umbc.edu/ont/sparql_demo/query.php?demo=1
What kind of food do herons eat?
http://spire.umbc.edu/ont/sparql_demo/query.php?demo=2
What kind of pond-living or
marsh-living fish do herons eat?
http://spire.umbc.edu/ont/sparql_demo/query.php?demo=3
What kind of pond-living or
marsh-living fish do herons eat?
Show known behavioral
characteristics of those kinds of
fish
http://spire.umbc.edu/ont/sparql_demo/query.php?demo=4
http://thesaurus.nbii.gov/SearchNBIIThesaurus/
What kind of food do herons eat?
Carassius-auratus = Goldfish
Bufo-americanus = American Toad
What kinds of fish do herons eat?
Cyprinus-carpio = Common Carp
What kinds of pond-living or marsh-living fish do herons eat?
Cyprinus-carpio = Common Carp
Pimephales-notatus = Bluntnosed Minnow
What kind of pond-living or marsh-living fish do herons eat? Show
known behavioral characteristics of those kinds of fish.
NBII Web Services
• Web service for the Biocomplexity
Thesaurus
• Prototype developed between
Biocomplexity Thesaurus and GEMET
• Web service for the Integrated Taxonomic
Information System – authority file of
biological organisms and their taxonomies
NBII Services Overview
www.NBII.gov
My.NBII.gov
PORTAL
Integrated View
Distributed
Services
ITIS
Content Management
Collaboration Services
Integrated/Federated Search
Database and Web
Services
Geospatial Services
Model Services
Thesaurus
DIGR
Catalog
Mapping
Georeferencing
Geoparsing
Catalog
Discovery
Operations
Catalog
Resource
Catalog
Resource and
Service Catalogs
Distributed
Resources
Dublin Core(plus)
Geospatial
Services
Catalog
Geospatial
text
Dataset
Resource
Clearinghouse
Database and
Web Services
Catalog
Model
Services
Catalog
OGC/ISO
FGDC/ISO
UDDI/ WSDL
??
Describe and Discover
Distributed Applications
Consume
Databases
Websites
Tools and Models
Biocomplexity Thesaurus Web Services
http://thesaurus.nbii.gov/SearchNBIIThesaurus/
“endangered species”
http://thesaurus.nbii.gov/SearchNBIIThesaurus/
“endangered species”
http://thesaurus.nbii.gov/SearchNBIIThesaurus/
Involvement in Ecoterm
– Subgroup of the Interagency/International
Collaboration on Ecoinformatics Technical Working
Group
– Multilingual issues
– Multi-discipline/domain issues
– Prototypes in terminology registries, metadata and
exchange formats
– Using NBII Web Service Registry to describe web
services across the Ecoterm organizations
– May 2006 meeting will focus on identification and
definition of standard environmental relationships
SKOS Example
Looking Forward:
Microformats and the Semantic Web
• Microformats are a mechanism for embedding
semantics in XHTML documents, using existing
XHTML elements and attributes.
– Much easier to modify existing authoring applications to
incorporate microformats than RDF.
– Already a larger user base (primarily bloggers) than for
RDF.
• SPIRE is experimenting with microformats to
express ecological field data.
Example of Microformat Markup
<span class="vevent">
<a class="url" href="http://www.tiu.ac.jp/org/openforum2006/">
<span class="summary">Open Forum 2006</span>
<abbr class="dtstart" title="2006-03-20">March 20</abbr><abbr class="dtend" title="2006-02-22">22</abbr>,
at the <span class="location">International Conference Center
Kobe, Port Island , Kobe City, Japan</span>
</a>
</span>
Where have we been? Where are we now?…
& where are we
planning to go?
System manuals
11179 E3
Data dictionaries
11179 E1
XML & related standards
XMDR Project
11179 E2
Complex semantics management
Data engineering/XML Data
Data Standards/Data Administration
Semantics management for data
Contact Information
Gail Hodge
Information International Associates, Inc.
312 Walnut Place
Havertown, PA 19083 USA
Phone: +1 865-742-5430
E-mail: [email protected] or [email protected]
Joel Sachs
University of Maryland Baltimore County
Toronto, Canada
Phone: +1 613-447-8653
E-mail: [email protected]
38