ISMB powerpoint presentation.
Download
Report
Transcript ISMB powerpoint presentation.
KELLER: Estimating Time Evolving
Interactions Between Genes
Le Song
[email protected]
Joint work with Mladen Kolar and Eric Xing
Transient Biological Processes
2
PPI Network
3
3
Time-Varying Interactions
4
The Big-Picture Questions
What are the interactions?
What pathways are active at a particular time
point and location?
How will biological networks respond to
stimuli (eg. heat shot)?
5
Regulation of cell response to stimuli is
paramount, but we can usually only measure
(or compute) steady-state interactions
Transcriptional
interactions
▲ Chromatin IP
▲ Microarrays
Protein—protein
interactions
▲ Protein coIP
▲ Yeast two-hybrid
Biochemical
reactions
▲ Metabolic flux
measurements
6
Current Practice
Static Networks
Microarray Time Series
t=1
2
3
T
…
Dynamic
Bayesian
Networks
7
Our Goal
Reverse engineer temporal/spatial-specific “rewiring”
gene networks
--- what are the difficulties?
t*
n=1
Time
8
Two Scenarios
Smoothly evolving networks
Abruptly changing networks
9
Scenario I (This paper)
Kernel reweighted L1-regularized logistic regression
(KELLER)
Key Idea I: reweighting observations
Key Idea II: regularized neighborhood estimation
10
Key Idea
Weight temporally adjacent observations more than
distal observations
11
Key Idea
Estimate the neighborhood of each gene separately
via L1-regularized logistic regression
L1-regularization
Kernel
Reweighting
Log-likelihood
12
Consistency
Theorem 1: Under certain verifiable conditions
(omitted here for simplicity), KELLER recovers the
true topology of the networks:
13
Synthetic data
Number of Samples
DBN and static networks do not benefit
from more observations
14
Senario II
Key idea: Temporally Smoothing
…
TESLA:
Tesla (Amr and Xing, PNAS 2009)
15
Drosophila Life Cycle
66 microarrays across
Adult
full life cycle
Embryo
588 genes related to
development
Pupa
Larva
16
biological process
molecular
function
cellular
component
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
Network Size vs. Clustering Coefficient
mid-embryonic
mid-pupal
40
Network Size vs. Clustering Coefficient
mid-embryonic stage
tight local clusters
mid-pupal stage
loose local clusters
41
Interactivity of Gene Sets
27 genes based on ontology
42
Interactivity of Gene Sets
25 genes based on ontology
43
Transient Gene Interactions
Time
Gene Pairs
Active
Inactive
msn dock
sno Dl
44
Transcriptional Factor Cascade
Summary networks 36 transcription factors
Node size
its total activity
45
TF Cascade – mid-embryonic stage
46
TF Cascade – mid-larva stage
47
TF Cascade – mid-pupal stage
48
TF Cascade – mid-adult stage
49
Transient Group Interactions
50
Conclusion
KELLER for reverse engineering “rewiring” networks
Key advantages:
Computationally efficient (scalable to 104 genes)
Global optimal solution is attainable
Theoretical guarantee
Glimpse to temporal evolution of gene networks
Many interactions are rewiring and transient
Availability: http://www.sailing.cs.cmu.edu/
51
The End
Thanks
Travel fellowship:
Office of Science (BER), U.S. Department of
Energy, Grant No. DE-FG02-06ED64270
Funding: Lane Fellowship,
Questions?
52
Interactivity of Gene Sets
30 genes based on ontology
53
Timing of Regulatory Program
Galactose
54
Challenges
Very small sample size
Experimental data are scarce and costly
Noisy measurement
More genes than microarrays
Complexity regularization needed to avoid over-
fitting
Observations no longer iid since the networks are
changing!
55