Interpretation of microarray expression data using

Download Report

Transcript Interpretation of microarray expression data using

Interpretation of microarray
expression data using
ontology browsing
Meeting the needs of a new era
Anders Bresell
17/07/2015
Presentation of Master Thesis
1
Motivation
After we have preformed a microarray
analysis we have a set of interesting genes
that we want to know more about.
The existing knowledge are found in
articles and databases. But we need a lot
of time to skim through the information to
get a hunch of what the knowledge is
about.
17/07/2015
Presentation of Master Thesis
2
Motivation
In order to get a first glance of what
knowledge we have about a set of genes.
We need to find a way to represent the
knowledge in a more condensed way.
For this we used Ontologies.
17/07/2015
Presentation of Master Thesis
3
What is an Ontology?
Definition:
An explicit specification of
conceptualisation [Gruber 93]
A controlled vocabulary that has
specified relationships between the
terms.
17/07/2015
Presentation of Master Thesis
4
Why is an ontology so nice?
• It provides a common language of a
domain for human agents.
• It gives a inter-operability between
computer systems.
• It enables an improvement of the
search and knowledge acquisition
process.
17/07/2015
Presentation of Master Thesis
5
Example of an Ontology
Biological Process
Cell communication
Response to external stimulus
DNA protection
Cell growth and maintenance
Metabolism
Nucleotide metabolism
DNA metabolism
DNA protection
Stress response
DNA protection
17/07/2015
Presentation of Master Thesis
6
Ontology 4
Existing Ontologies
MeSH
All published litterature is indexed by
the terms from MeSH.
Gene Ontology
Organism independent ontology
describing Molecular function, Biological
process and Cellular component.
17/07/2015
Presentation of Master Thesis
7
Information
17/07/2015
Presentation of Master Thesis
8
Annotation
Article (in Medline)
GENE
Term 1
17/07/2015
Term 2
Presentation of Master Thesis
Term 3
9
Solution
• Instead of having the information in the
form of articles and databases we
represent it as a set of keywords or
terms.
• By using ontologies we could specify or
generalise the terms.
17/07/2015
Presentation of Master Thesis
10
Ontology Annotation Tree browser (OAT)
• We built a tool
– Database (holding the annotations and the
tree structure of the ontology)
– Tree Browser tool (Visualises the
ontology)
17/07/2015
Presentation of Master Thesis
11
The OAT-tool
Checkbox for terms
which are to be
included in the
report
Significance of the
Annotation
Number of genes
annotated with this
term
Number
of genes
below
Number of annotations below
Link to MeSH description
17/07/2015
Presentation of Master Thesis
12
Report feature
17/07/2015
Presentation of Master Thesis
13
Summary of OAT 1
• Only terms that are relevant for the
genes are displayed.
• This tool only aims to give some brief
information about a group of genes.
• The set of genes must undergo further
studies. The report page directs us to
the articles we used as source when we
made the annotations.
17/07/2015
Presentation of Master Thesis
14
Limitations of OAT
• The OAT-tool is totally dependent on the
annotations. If we have no connections
between a gene and a term, the tool
would not tell us anything about the
gene.
• We will import more annotations during
this summer. Then the tool will be a big
help in the digestion of information.
17/07/2015
Presentation of Master Thesis
15
More Annotations
• GeneOntology
– GOA (Swissprot, TrEMBL, Interpro etc.)
High quality
– Compugene
Unsupervised!!
• MeSH
– Human Protein (Swissprot, OMIM)
Text Mining by Ian Dix et al.
17/07/2015
Presentation of Master Thesis
16
Modifications
• Change of DBMS from Berkeley DB to
MySQL.
– To many annotations.
– To process demanding calculation due to
the inflexible querying of data.
– Other systems could use the data.
– Makes the updating of data easy.
17/07/2015
Presentation of Master Thesis
17
Feedback
Give me feedback…
17/07/2015
Presentation of Master Thesis
18
Acknowledgement
•
•
•
•
•
Patrick Lambrix
Bo Servenius
Per Broberg
Thomas Breslin
Tobias Fändriks
17/07/2015
Presentation of Master Thesis
19