Strategies for functional modeling
Download
Report
Transcript Strategies for functional modeling
Strategies for functional
modeling
TAMU GO
Workshop
17 May 2010
Types of data sets and modeling
Commercial array data – more likely to have ID
mapping to support functional modeling.
Custom/USDA array data – may need to do your
own ID mapping: see examples on workshop
page.
Proteomics data
RNA-Seq data sets – computational pipelines to
assign GO (GOanna is limited; contact AgBase).
Real-time data or quantitative proteomics data –
hypothesis testing.
Overview of Functional Modeling Strategy
Microarray Ids
ArrayIDer
Protein/Gene
identifiers
Pathways and
network analysis
GO Enrichment
analysis
GORetriever
Ingenuity Pathways Analysis (IPA)
Pathway Studio
Cytoscape
DAVID
Ingenuity Pathways Analysis (IPA)
Pathway Studio
Cytoscape
DAVID
EasyGO/AgriGO
Onto-Express
Onto-Express-to-go (OE2GO)
GO annotations
Genes/Proteins with
no GO annotations
GOanna
GOSlimViewer
Yellow boxes represent AgBase tools
Green/Purple boxes are non-AgBase resources
Functional Modeling Considerations
Should I add my own GO?
Should I do GO analysis and pathway analysis and
network analysis?
use GOSlimViewer to see how much GO is available for your
species
use GORetriever to see how much GO is available for your
dataset
different functional modeling methods show different aspects
about your data (complementary)
is this type of data available for your species (or a close
ortholog)?
What tools should I use?
which tools have data for your species of interest?
what type of accessions are accepted?
availability (commercial and freely available)
Converting accessions
Depending on your data set & the tools you use,
you are likely to need to convert between
database accessions to do your functional
modeling.
UniProt database – ID mapping tab
Ensembl BioMart
Online analysis tools:
DAVID
g:profiler
GORetriever
ArrayIDer – converts EST accessions
Converting accessions (cont’d)
Commercial arrays
EST arrays
Commercial ID
mapping eg. NetAffy
Ensembl BioMart
Online tools
(g:convert, DAVID)
ArrayIDer
Custom arrays
Proteomics
UniProt ID Conversion
RNA-Seq data
Working on your own data or examples:
1.
Your own data set
2.
New to GO
3.
retrieve existing GO (accession conversion?) &
group using slim sets
try functional grouping (DAVID, AgriGO, etc)
GO browser tutorials to familiarize yourself with
GO
work on some example data sets
Example data sets
Your own data
Start by retrieving existing GO
(GORetriver)
may need to do accession conversion
GOanna – for sequence data sets
If you haven’t had results returned from
GOanna, sample results are available in the
example data sets
Try functional analysis using DAVID,
AgriGO or etc
For help with hypothesis modeling etc, see
me.
GO Browsers
search for gene products
search for GO terms
retrieve batch GO
some analysis tools (slim sets, enrichment
analysis, etc)
QuickGO at EBI
http://www.ebi.ac.uk/QuickGO/
AmiGO at GO Consortium
http://amigo.geneontology.org
Does not include IEA annotations
Example Dataset 1
Chicken Affymetrix Array
1.
2.
3.
4.
5.
Converting Accessions
Retrieving GO annotations
Grouping using GOSlimViewer
GO term enrichment analysis using DAVID
GO term enrichment analysis using AgriGO
Example Dataset 2
EST Array and adding your own GO
1.
2.
3.
4.
Converting Accessions
Retrieving GO annotations
Adding GO annotations
GO enrichment analysis using additional GO
annotations
What other
information
should we add
to your
workshop
website??