the Exposure Ontology (ExO)

Download Report

Transcript the Exposure Ontology (ExO)

Providing the Missing Link:
The Exposure Ontology ExO
RASS
December 11, 2013
Elaine Cohen Hubal
Chemical Safety for Sustainability Research Program
Office of Research and Development
National Center for Computational Toxicology
Disclaimer. Although this work was reviewed by EPA and
approved for presentation, it may not necessarily reflect official
Agency policy.
May 18, 2011
What Is Exposure Science?
• The bridge between the sources of chemical, physical and
biological agents and human health
– Provides crucial information to estimate real-life risks to health
and to identify the most effective ways to prevent and reduce
these risks.
www.isesweb.org
Office of Research and Development
National Center for Computational Toxicology
1
Exposure for Risk Evaluation: Approaches
•
•
•
•
•
Questionnaire based metrics (epidemiology)
Surrogate exposure metrics (ambient measures)
Exposure measurement (direct or point-of-contact)
Biomonitoring (NHANES)
Modeled estimates (indirect or scenario evaluation)
TRANSPORT, TRANSFORMATION, and
FATE PROCESS MODELS
SOURCE / STRESSOR
FORMATION
ADVERSE
OUTCOME
EXPOSURE
MODELS
TRANSPORT/
TRANSFORMATION
DOSE
PBPK
MODELS
ENVIRONMENTAL
CHARACTERIZATION
EXPOSURE
ACTIVITY
PATTERN
•Individual
•Community
•Population
Office of Research and Development
National Center for Computational Toxicology
2
2
Systems Biology: Exposure at All Levels of Biological Organization
Stressor
Perturbation
Biological
Receptor
Perturbation
Outcome
Environmental
Source
Ambient
Exposure
Environmental
Source
Personal
Exposure
Population
Individual
Internal Exposure
(Tissue Dose)
Tissue
Dose to Cell
Cell
Dose of Stressor
Molecules
Office of Research and Development
National Center for Computational Toxicology
Biological
Molecules
Disease
Incidence/Prevalence
Disease State
(Changes to Health Status)
Dynamic Tissue Changes
(Tissue Injury)
Dynamic Cell Changes
(Alteration in Cell Division,
Cell Death)
Dynamic Changes in
Intracellular Processes
Cohen Hubal, JESEE, 2008
3
Exposure for Translation
Susceptibility
(Genetic Variants / Epigentic Modifications)
Biological Insight
(Toxicity Pathways)
Environmental Factors
(Exposure)
Improved Measures of Individual Etiological
Processes and Individual Exposures
Key Perturbations
Key Targets
Biomarkers
Indicators Metrics
Personal Risk Profile
Information
Extrapolation for
Risk Assessment
Education
Personal Risk
Management
Office of Research and Development Public Health Policy
National Center for Computational Toxicology Prevention
Cohen Hubal, et al. JTEH, 2010
Knowledge Systems – Enabling Hypothesis Development
• Computational Techniques – Two Branches
A combination of discovery and engineering (mechanistic)-based
modeling approaches required for hypothesis development and testing
• Knowledge-discovery
– Data-collection, mining, and analysis
– Required to extract information from extant data on critical
exposure determinants, link exposure information with toxicity
data, and identify limitations and gaps in exposure data.
• Mechanistic (dynamic) simulation
– Mathematical modeling at various levels of detail
– Required to model the human-environment system and to test our
understanding of this system.
Office of Research and Development
National Center for Computational Toxicology
5
Exposure-Hazard Knowledge System
• Translation of HTP hazard information requires holistic risk
assessment knowledge system
– Include ontologies, databases, linkages
– Facilitate computerized collection, organization, and retrieval of
exposure, hazard, and susceptibility information
– Define relationships, allow automated reasoning, facilitate meta
analyses
• Standardized exposure ontologies required to
– Develop biologically-relevant exposure metrics
– Design and interpret in vitro toxicity tests
– Incorporate information on susceptibility and background
exposures to assess individual and population-level risks
Office of Research and Development
National Center for Computational Toxicology
6
Schematic of ontologies, databases and ontology/database
linkages needed for the efficient development of a
Foods-for-Health Knowledge System
MC Lange, et al. (2007) A multi-ontology framework to
Office of Research and Development
guide National
agriculture
food Toxicology
towards diet and health.
Center for and
Computational
J Sci Food and Ag 87(8)1427-34.
7
Exposure Data Sources
Production/
Import
Volumes
Source/Stressor
Formation
EPA IUR
Environmental
Concentration
Exposure
EPA TRI
Chemicals in
Consumer
Products
DOE GHG
EU ESIS
Product
Usage
Information
Transport/Fate
Environmental
Releases
EPA HPVIS
EPA Pesticide
Usage Data
Production/
Process
Information
UK Pharmaceutical Usage
Indoor Air
Monitoring
Data
Human
Exposure
Monitoring
Household
Products DB
Cosmetic
Voluntary
Reg. DB
Environmental
Transformation
EPA HPVIS
ATSDR Tox
Profiles
ECOTOX DB
DEA NFLIS
DOE IndoorAir
NHEXAS
CTEPP
NHEXAS
EUROPA PestDiet
Outdoor Air
Monitoring
Data
Activity
Patterns
Information
Environmental
Fate Simulator
CESAR
EPA NATA
EPA AIRS/AFS
Exposure
Limits
UN IPCC GHG
EPA NHAPS
EPA CHAD
Human
Biological
Monitoring
Data
NIOSH NOES
AIHA WEEL
OSHA PEL
CDC NHANES
NHEXAS
CTEPP
Peter Egeghy, NERL
Office of Research and Development
National Center for Computational Toxicology
8
http://actor.epa.gov
Exposure Data Landscape
Office of Research and Development
National Center for Computational Toxicology
9
Network of exposure taxonomy used in ACToR; Egeghy et al, 2011
Exposure Data Landscape
Number of Unique Chemicals
10000
1000
100
10
1
Production
Volume
Use
Category
Food Use
Chemical
Release
Water
Conc.
Soil Conc.
Food Conc.
Air Conc.
Biomarker
Conc.
Data Type
Office of Research and Development
National Center for Computational Toxicology
10
Number of unique chemicals by data type in ACToR; Egeghy et al, 2011
Exposure Data Collection and Access: ACToR,
Aggregated Computational Toxicology Resource
http://actor.epa.gov/
ACToR API
Chemical
ID,
Structure
Chemical
ACToR Core
ToxRefDB
Tabular Data,
Links to Web
Resources
In Vivo Study
Data - OPP
Internet
Office ofSearches
Research and Development
DSSTox
ToxCastDB
ToxCast Data –
NCCT, ORD,
Collaborators
Chemical ID &
Structure QC,
Inventory
Tracking
ExpoCastDB
Exposure Data –
NERL, NCCT
National Center for Computational Toxicology
11
Exposure Data Collection and Access :
ExpoCastDB Goals
• Consolidate observational human exposure data, improve access
and provide links to health related data
• House measurements from human exposure studies
• Encourage standardized reporting of observational exposure information
• Provide separate interface with inner workings of ACToR
• Facilitate linkages with toxicity data, environmental fate data, chemical
manufacture information
• Provide basic user functions
http://actor.epa.gov/
• Visualization (e.g., scatterplots, probability plots, goodness-of-fit)
• Obtain summary statistics and estimate distributional parameters
• Download customized datasets
Office of Research and Development
National Center for Computational Toxicology
Generic_chemical
table in ACToR
Exposure Data Collection and Access: ExpoCastDB
1
1
N
Laboratory method
N
Measure
(Chemical)
N
Technique /
sampling method
N
Location_CV
(cont vocab.)
• Four initial studies
from National
Exposure Research
Laboratory
• Full raw data sets
available for
download
N
1
N
N
1
N
1
N
Sample
Medium
(serum, air, soil, etc.)
N
• Browse data
capability
N
• Descriptive
statistics capabilities
N
Location
N
N
N
N
N
N
N
N
1
N
Subject
N
N
N
N
N
Exposure
Taxonomy
(Assay_
category_
CV table
in ACToR)
N
N
Study
Sources
N
N
N
Study_CV
cont. vocab
1
Office of Research and Development
National Center for Computational Toxicology
N
N
13
ExO: An Ontology for Exposure Science
Office of Research and Development
National Center for Computational Toxicology
14
Exposure Data Collection and Access:
Design and Evaluation of the Exposure Ontology, ExO
Background:
• Significant progress has been made in collecting and improving access
to genomic, toxicology, and health data
• These information resources lack exposure data required to
– translate molecular insights
– elucidate environmental contributions to diseases
– assess human health risks at the individual and population levels
Aim:
• Facilitate centralization and integration of exposure data to inform
understanding of environmental health
• Bridge gap between exposure science and other environmental health
disciplines
Vehicle:
• Carolyn Mattingly, Mount Desert Island Biological Laboratory
• LRI seed funding, followed by NIEHS RO1
Office of Research and Development
National Center for Computational Toxicology
15
Ontologies
• An ontology is a formal representation of knowledge within a domain and
typically consists of classes, the properties of those classes, and the
relationships between these
(Gruber, Int. J. Human-Computer Studies, 1995)
• Many fields are developing ontologies to
– Organizing and analyzing large amounts of complex information from
multiple scientific disciplines
– Provide unprecedented perspective
– Enable more informed hypothesis development
(http://www.obofoundry.org/)
Office of Research and Development
National Center for Computational Toxicology
16
Design and Evaluation of the Exposure Ontology: ExO
• Develop an exposure ontology consistent with those being used
in toxicology and other health sciences
• Facilitate centralization and integration of exposure data to inform
understanding of environmental health
• Bridge gap between exposure science and other environmental
health disciplines
– Initially focus development on human exposure to chemicals
– Ultimately, provide domains that can be extended to
encompass exposure data for the full range of receptors and
stressors
Office of Research and Development
National Center for Computational Toxicology
17
Exposure Ontology Working Group
Working Group Member
Institution
Role/expertise
Carolyn Mattingly, PhD
Judith Blake, PhD.
Mount Desert Island
Biological Laboratory
The Jackson Laboratory
Michael Callahan, PhM.
MDB, Inc.
Robin Dodson, ScD.
Silent Spring Institute (SSI)
Facilitator/curated database
development
Facilitator/ontology
development
Core Member/
exposure assessment
Core Member/
exposure research
Member/exposure research
Peter Egeghy, PhD.
US EPA, NERL
Member/exposure research
Jane Hoppin, ScD.
NIEHS
Member/epidemiology
Thomas McKone, PhD.
Lawrence Berkeley National
Laboratory (LBNL)
Core Member/
exposure research
Ruthann Rudel, MS.
Silent Spring Institute (SSI)
Member/exposure research
Elaine Cohen Hubal, PhD US EPA; NCCT
Office of Research and Development
National Center for Computational Toxicology
18
Phases of Exposure Ontology Development
Phase II
Phase I
Phase III
Phase IV
Disseminate
exposure
ontology for
public feedback
Initial pilot curation
to identify major
concepts
Model relationships
among data concepts
Full working group
Expand test data set to evaluation of draft
evaluate extensibility of
ontology
conceptual model and cross
-reference existing ontologies
Iterate data model
refinement and curation
Office of Research and Development
National Center for Computational Toxicology
Definitions of Central Concepts
• Exposure Stressor - An agent, stimulus, activity, or event that causes
stress or tension on an organism and interacts with an exposure receptor
during an exposure event.
• Exposure Receptor - An entity (e.g., a human, human population, or a
human organ) that interacts with an exposure stressor during an
exposure event.
• Exposure Event - An interaction between an exposure stressor and an
exposure receptor.
• Exposure Outcome - Entity that results from the interaction between an
exposure receptor and an exposure stressor during an exposure event.
Office of Research and Development
National Center for Computational Toxicology
Mattingly et al, submitted
20
Biolog.
Agent
Relational View of Selected ExO Domains
Public
Policy
Chem.
Agent
•Source
•Location
•Process
•Transport Path
Biomech.
Agent
Exposure
Stressor
Individual
Anthrosphere
•Location
•Genetic Background
•Lifestage
•Health Status
•Socioeconomic Status
Office•Occupation
of Research and Development
National Center for Computational Toxicology
Intervention
Biolog.
Response
Phys.
Agent
Psychosoc.
Agent
Human
Pop.
Exposure
Outcome
Exposure
Receptor
Disease
Symptom
Exposure
Event
Molecular
Response
•Location
•Temporal Pattern
•Intensity
•Route
•Assay
•Medium
•Method
•Location
Mattingly et al, submitted
High-level schematic of Exposure Ontology (ExO) integration
within a broader biological context.
Encode
Annotated with
Chemical
(e.g., MeSH)
Is a
Genes
Interacts with
(e.g., CTD)
Exposure Receptor
(e.g., ExO)
Interacts with
(e.g., CTD)
Is a
Office of Research and Development
National Center for Computational Toxicology
Interact via
Occur within
Biological System
(e.g., Functional model of
anatomy)
Assessed by
Via an
Exposure Event
(e.g., ExO,
ExpoCastDB)
Gene
Products
Pathways, Networks
Reactions
(e.g., KEGG, Reactome)
Exposure Stressor
(e.g., ExO)
Interacts with
Biological Process
Molecular Function
Cellular Component
(e.g., Gene Ontology
Results in an
Exposure Outcome
Phenotype
(e.g., OMIM, MeSH)
Mattingly et al, submitted
Next Steps
• Open source approach
• With input from the scientific community, further specify branches
• Leverage existing ontologies (e.g., CHEBI and MeSH for “Chemical
agent” Stressors; DO, OMIM and MeSH for “Disease” Outcomes).
• Cross-referencing will underscore where ExO fits into a broader
knowledge space and where it may add value to existing ontologies.
Office of Research and Development
National Center for Computational Toxicology
23
Beyond EPA:
Pilot Curation of Exposure Data into CTD
Chemicals
Exposure
Data (curated
and public
sources)
chemical-gene
interactions
Genes
chemical-disease
relationships
gene-disease
relationships
Diseases
functional annotations
Carolyn Mattingly
Office of Research and Development
National Center for Computational Toxicology
pathway data
24
Acknowledgements
• ExO–
• Carolyn Mattingly,
• Tom McKone,
• Judy Blake,
ExpoCastDB -Richard Judson,
Peter Egeghy,
Sumit Gangwal
• Mike Callahan
Office of Research and Development
National Center for Computational Toxicology
25