Presentation
Download
Report
Transcript Presentation
OBO Foundry Update: April 2010
Barry Smith
OBO Foundry: The Underlying Idea
We have lots and lots of ontologies
Most of them are stand alone
Yet the biology is inherently interconnected
Across granular levels (molecules, cells, organisms, ...)
Across different basic types (objects, processes,
functions ...)
Rigorously enforced modularity
RELATION
TO TIME
CONTINUANT
INDEPENDENT
OCCURRENT
DEPENDENT
GRANULARITY
ORGAN AND
ORGANISM
CELL AND
CELLULAR
COMPONENT
MOLECULE
Organism
(NCBI
Taxonomy)
Cell
(CL)
Anatomical
Organ
Entity
Function
(FMA,
(FMP, CPRO) Phenotypic
CARO)
Quality
(PaTO)
Cellular
Component
(FMA, GO)
Molecule
(ChEBI, SO,
RNAO, PRO)
Biological
Process
(GO)
Cellular
Function
(GO)
Molecular Function
(GO)
Molecular Process
(GO)
3
OBO Foundry
Open Biological/Biomedical Ontologies Foundry
International open initiative since 2006 (Nat Biotech
2007)
Coordinating editors: Michael Ashburner, Chris
Mungall, Suzanna Lewis, Alan Ruttenberg, Richard H.
Scheuermann, Barry Smith
http://obofoundry.org
Create a suite of orthogonal and interoperable
ontologies
Maintain a framework for governance and testing of
best practices
Initiate a formal, peer review process for wellstructured biological accurate ontologies
Why OBO Foundry ontology
standards?
Operational efficiencies
Eliminate duplication of effort, code once, retrieve and
reuse efficiently
Unbiased, neutral perspective
Considering biology/biomedical domain as a whole
Homesteading by experts
who have an incentive to ensure adequate
terminology resources for their respective disciplines
OBO Foundry Principles
Revised Proposals
February 2010
6
OBO Foundry Principles
open (Creative Commons 3.0 CC-by license or equivalent)
common formal language (OBO Format, OWL 2, Common
Logic)
commitment to collaboration
maintenance in light of scientific advance
unique identifier space (URIs)
naming conventions (Susanna / EBI)
versioning (metadata for changes)
7
OBO Foundry Principles
ontologies should be conceivable as the result of
populating downwards from some fragment of (BFO2.0)
clearly delineated content (coherent natural language
definitions of top-level term(s) incorporating crossproduct links to other OBO Foundry ontologies)
the ontology is well-documented (e.g. in a published
paper describing the ontology or in manuals for
developers and users)
plurality of mutually independent users (documented
e.g. via pointers in external URIs, use in cross-products)
8
OBO Foundry Principles
single locus of authority, tracker (SOP), responsive
help desk
textual definitions (SOP) for all terms, plus
equivalent formal definitions (for at least a
substantial number of terms)
all definitions of the genus-species form, utilizing
(some) cross-products
single is_a inheritance (= each ontology should be
conceived as consisting of a core of asserted single
inheritance links, with further is_a relations
inferred)
9
Results of peer review process
2nd OBO Foundry meeting in Hinxton, UK in June 2009
Special mention:
CL (Cell Ontology)
FMA (Foundational Model of Anatomy Ontology)
EnvO: Environment Ontology
HPO: Human Phenotype Ontology
OBI: Ontology for Biomedical Investigations
SO: Sequence Ontology
10
Approved for full membership
CHEBI: Chemical Entities of Biological Interest
GO: Gene Ontology
PATO: Phenotypic Quality Ontology
PRO: Protein Ontology
XAO: Xenopus Anatomy Ontology
ZFA: Zebrafish Anatomy Ontology
11
Goals of the peer review process
identify problems in ontologies
identify, nurture and recommend best practices
education: provide illustrations of good practice
and flagposts for those new to the domain
motivation
of ontology authors
of ontology reviewers
of ontology users, publishers, vendors
analogous to scientific journal peer review
12
Linking ontologies via cross
products
Compound terms are formed out of simpler
constituents
{cysteine, leucine, collagen,..}
X {biosynthesis, metabolism,..}
LEGO examples: GO+CL
ChEBI PRO
CC
cell
anat
envo
PATO
MF
BP
ChEBI
PRO
16
GO-CC
14
Cell
174
8
70
13
Gross anat
19
Envo
phen
MP
273
WP
115
TO
127
disease
GO-MF
GO-BP
2407
10
GO:germ cell nucleus
654
3300
=def?
?
45
432
GO:nucleus
2
355
19
that is part_of
CL:germ cell
511
1085
28
5815
1040
1450
686
1
55
LEGO examples: CL+PATO
ChEBI PRO
CC
cell
anat
envo
PATO
MF
BP
ChEBI
PRO
16
GO-CC
Cell
174
14
8
70
19
CL:spiny neuron
654
3300
=def
45
432
CL:neuron
2
355
that has_quality
PATO:spiny
28
13
Gross anat
Envo
phen
MP
273
WP
115
TO
127
disease
GO-MF
GO-BP
2407
10
511
1085
5815
1040
1450
686
19?
1
55
LEGO examples: CL+ChEBI
ChEBI PRO
CC
cell
anat
envo
PATO
MF
BP
ChEBI
PRO
16
GO-CC
14
Cell
174
8
70
13
Gross anat
19
Envo
phen
MP
273
WP
115
TO
127
disease
GO-MF
GO-BP
2407
CL:estradiol secreting cell
10
654
3300
=def
45
432
CL:secretory
cell355
2
19?
that has_output
CHEBI:estradiol
511
1085
28
5815
1040
1450
686
1
55
LEGO examples: EnvO+ChEBI
ChEBI PRO
CC
cell
anat
envo
PATO
MF
BP
ChEBI
PRO
16
GO-CC
14
Cell
174
8
70
13
Gross anat
19
Envo
phen
MP
273
WP
115
TO
127
disease
GO-MF
GO-BP
2407
ENVO:xylene contaminated28soil
10
654
3300
5815
=def
45
432
1450
ENVO:soil2
355
19?
that has_contaminant
CHEBI:xylene
511
1085
55
1040
686
1
LEGO examples: GO+Anatomy
ChEBI PRO
CC
cell
anat
envo
PATO
MF
BP
ChEBI
19
GO:heart development
=def
16
GO:developmental
process 70
8
that results_in_development_of
FMA:heart
28
MP
273
10
WP
TO
PRO
GO-CC
14
Cell
Gross anat
Envo
phen
GO-BP
13
654
3300
5815
1040
115
45
432
1450
686
127
2
355
511
1085
19?
1
disease
GO-MF
174
2407
55
LEGO examples: Diseases
circulating
glucose
ChEBI MP:abnormal
PRO
CC
cell
anat
envo
PATO
ChEBI
PRO
GO-CC
14
Cell
Gross anat
19
Envo
phen
MP
273
WP
TO
=def
PATO:concentration
that inheres_in
16
MA:blood
8
towards
CHEBI:glucose
10
GO-BP
BP
174
70
13
gluconeogenesis
28
diabetes
654
3300
5815
1040
115
45
432
1450
686
127
2
355
511
1085
19
1
disease
GO-MF
MF
2407
55
Sustainable standards policy
requires
Openness
Readily available to access and open to participation
Separation of Duties
Ontology is vetted by multiple groups
Generational Compliance
Initially support backward compatibility so no one is
stranded
Enforce a more rigorous set of criteria in successive
generation of products
Varieties of application ontology
•
•
•
•
cross-border national parks
slims
cross-product ontologies
template ontologies:
– Infectious Disease Ontology
21
RELATION
TO TIME
CONTINUANT
INDEPENDENT
OCCURRENT
DEPENDENT
GRANULARITY
ORGAN AND
ORGANISM
Organism
(NCBI
Taxonomy)
CELL AND
CELLULAR
COMPONENT
Cell
(CL)
MOLECULE
Anatomical
Organ
Entity
Function
(FMA,
(FMP, CPRO) Phenotypic
CARO)
Quality
(PaTO)
Cellular
Cellular
Component Function
(FMA, GO)
(GO)
Molecule
(ChEBI, SO,
RnaO, PrO)
Molecular Function
(GO)
Biological
Process
(GO)
Molecular Process
(GO)
cross-border national parks: an ontology for studying
the effects of viral infection on cell function22in shrimp
http://www.infectiousdiseaseontology.org/
23