Strategies for functional modeling

Download Report

Transcript Strategies for functional modeling

Strategies for functional
modeling
TAMU GO
Workshop
17 May 2010
Types of data sets and modeling





Commercial array data – more likely to have ID
mapping to support functional modeling.
Custom/USDA array data – may need to do your
own ID mapping: see examples on workshop
page.
Proteomics data
RNA-Seq data sets – computational pipelines to
assign GO (GOanna is limited; contact AgBase).
Real-time data or quantitative proteomics data –
hypothesis testing.
Overview of Functional Modeling Strategy
Microarray Ids
ArrayIDer
Protein/Gene
identifiers
Pathways and
network analysis
GO Enrichment
analysis
GORetriever
Ingenuity Pathways Analysis (IPA)
Pathway Studio
Cytoscape
DAVID
Ingenuity Pathways Analysis (IPA)
Pathway Studio
Cytoscape
DAVID
EasyGO/AgriGO
Onto-Express
Onto-Express-to-go (OE2GO)
GO annotations
Genes/Proteins with
no GO annotations
GOanna
GOSlimViewer
Yellow boxes represent AgBase tools
Green/Purple boxes are non-AgBase resources
Functional Modeling Considerations

Should I add my own GO?



Should I do GO analysis and pathway analysis and
network analysis?



use GOSlimViewer to see how much GO is available for your
species
use GORetriever to see how much GO is available for your
dataset
different functional modeling methods show different aspects
about your data (complementary)
is this type of data available for your species (or a close
ortholog)?
What tools should I use?



which tools have data for your species of interest?
what type of accessions are accepted?
availability (commercial and freely available)
Converting accessions




Depending on your data set & the tools you use,
you are likely to need to convert between
database accessions to do your functional
modeling.
UniProt database – ID mapping tab
Ensembl BioMart
Online analysis tools:




DAVID
g:profiler
GORetriever
ArrayIDer – converts EST accessions
Converting accessions (cont’d)

Commercial arrays


EST arrays

Commercial ID
mapping eg. NetAffy
Ensembl BioMart
Online tools
(g:convert, DAVID)
ArrayIDer

Custom arrays


Proteomics

UniProt ID Conversion

RNA-Seq data

Working on your own data or examples:
1.
Your own data set


2.
New to GO


3.
retrieve existing GO (accession conversion?) &
group using slim sets
try functional grouping (DAVID, AgriGO, etc)
GO browser tutorials to familiarize yourself with
GO
work on some example data sets
Example data sets
Your own data

Start by retrieving existing GO
(GORetriver)


may need to do accession conversion
GOanna – for sequence data sets

If you haven’t had results returned from
GOanna, sample results are available in the
example data sets
Try functional analysis using DAVID,
AgriGO or etc
 For help with hypothesis modeling etc, see
me.

GO Browsers
search for gene products
 search for GO terms
 retrieve batch GO
 some analysis tools (slim sets, enrichment
analysis, etc)
 QuickGO at EBI



http://www.ebi.ac.uk/QuickGO/
AmiGO at GO Consortium


http://amigo.geneontology.org
Does not include IEA annotations
Example Dataset 1
Chicken Affymetrix Array
1.
2.
3.
4.
5.
Converting Accessions
Retrieving GO annotations
Grouping using GOSlimViewer
GO term enrichment analysis using DAVID
GO term enrichment analysis using AgriGO
Example Dataset 2
EST Array and adding your own GO
1.
2.
3.
4.
Converting Accessions
Retrieving GO annotations
Adding GO annotations
GO enrichment analysis using additional GO
annotations
What other
information
should we add
to your
workshop
website??