Transcript TF binding
Epigenetics
Xiaole Shirley Liu
STAT115, STAT215, BIO298, BIST520
Epigenetics
• Heritable changes in gene function that occur
without a change in the DNA sequence
– How come not all the motif sites are bound by the
factor?
– How come TF binding only regulate some of the
nearby genes?
Epigenetics
• The study of heritable (transgenerational)
changes in gene activity that are not caused by
changes in the DNA sequence
• The study of stable, long-term alterations in the
transcriptional potential of a cell that are not
necessarily heritable
• Functionally relevant changes to the genome that
do not involve a change in the nucleotide
sequence
In Human
• Nature vs nurture
• Zygotic twins: same DNA different epigenome
• North American Ice Storm of 1998
Agouti Mice and DNA Methylation
The Making of a Queen
Larvae
From Ting Wang, Wash U
Queen
Worker
Epigenetic Landscape
Conrad Hal Waddington
(1905–1975)
Developmental biologist
Paleontologist
Geneticist
Embryologist
Philosopher
Founder for systems biology
Components
• DNA-methylation
• Nucleosome
position and
histone
modifications
• Chromatin
accessibility
• Higher order
chromatin
interactions
• Analogy
DNA Methylation Distribution in
Mammalian Genomes
• In human somatic cells, 60%-80% of all CpGs
(~1% of total DNA bases) are methylated
– Most methylation is found in “repetitive”
elements
• “CpG islands”, GC-rich regions that possess a high
density of CpGs, remain methylation-free
– The promoter regions of ~70% of genes have
CpG islands
From Ting Wang, Wash U
Two classes of DNA methyltransferases
(DNMTs)
Jones and Liang, 2009
Nature Review Genetics
Inheritance of DNA Methylation
From Ting Wang, Wash U
DNA Methylation Detection
• Bisulfite sequencing
– Unmethyl C T
– High resolution, quantitative, but expensive!
From Wei Li, Baylor
BS-seq Methylation Call
• Most regions are either mostly methylated or
mostly unmethylated (dichotomy)
• Methylation level within a short distance is
consistent
ACGGGCTTACTTGCTTTCCTACGGGCTTACTTGCTTTCCTACGGGCTTACTTGCTTTCCTACGGGCTTACTTGC
CGGGTTTATTTGCTTTTTTATGGGC
TGGGTTTATTTGCTTTTTTATGGGC
TGGGTTTATTTGCTTTCCTATGGGC
CGGGCTTATTTGCTTTCCTATGGGC
CGGGCTTATTTGCTTTCCTATGGGC
3/5
60% methylated
From Ting Wang, Wash U
0/5
0% methylated
DNA Methylation Controls
Gene Expression
• Methylation at CpG islands often repress nearby
gene expression
• Many highly expressed genes have CpG
methylation on their exons
Some genes could
be imprinted, so
maternal and
paternal copies
have different
DNA methylation
From Ting Wang, Wash U
DNA Methylation in Cancer
• Prevalent misregulation of DNA methylation in
cancer: global hypomethylation and CpG island
hypermethylation
• Methylation variable regions in cancer
DNA Demethylation
• Recently, another type of DNA methylation
called hydroxyl methylation (hmC) is found
• hmC is an intermediate step between mC and C.
• TET family of proteins are important for DNA
demethylation
• Mutation in TET is linked to many cancers
Components
• DNA-methylation
• Nucleosome
position and
histone
modifications
• Chromatin
accessibility
• Higher order
chromatin
interactions
• Analogy
Nucleosome Occupancy & Histone
Modification Influence Factor Binding
TF
Histone Modifications
• Different modifications at different locations by
different enzymes
Histone Modifications in Relation to Gene Transcription
Bisulfite-Seq
H3K27ac
H3K4me1
H3K4me3
H3K36me3
H3K27me3
H3K9me3
RefSeq genes
SRPK1
SLC26A8
MAPK14
From Ting Wang, Wash U
Histone Modifications
• Gene body mark: H3K36me3, H3K79me3
• Active promoter (TSS) mark: H3K4me3
• Active enhancer (TF binding) mark: H3K4me1,
H3K27ac
• Both enhancers and promoters: H3K4me2,
H3/H4ac, H2AZ
• Repressive promoter mark: H3K27me3
• Repressive mark for DNA methylation:
H3K9me3
lncRNA Identification
• H3K4me3 active promoters
• H3K36me3 transcription elongation
Guttman et al, Nat 2009
23
24
25
Nucleosome Occupancy & Histone
Modification Influence Factor Binding
MNase digest
Antibody for
TF
Combine Tags From All ChIP-Seq
Extend Tags 3’ to 146 nt
Check Tag Count Across Genome
Take the middle 73 nt
Use H3K4me2 / H3K27ac Nucleosome
Dynamics to Infer TF Binding Events
/ac
/ac
/ac
/ac
/ac
Condition 1
Condition 2
Nucleosome Stabilization-Destabilization (NSD) Score
He et al, Nat Genet, 2010; Meyer et al, Bioinfo 2011
30
Condition-Specific Binding,
Epigenetics and Gene Expression
C1
C1
C2
C2
• Condition-specific TF bindings are associated
with epigenetic signatures
• Can we use the epigenetic profile and TF motif
analysis to simultaneous guess the binding of
many TFs together?
Genes
TF1
TF2
TF3
Epigenetics
31
Predict Driving TFs and
Bindings for Gut Differentiation
32
Identify Major TF Modules Regulating
Gut Differentiation and Function
GATA6
Cdx2
Embryonic and organ
development genes
Cdx2
HNF4
Metabolic and
digestive genes
• Nucleosome dynamics now applied to hematopoiesis and
cancer cell reprogramming
33
Verzi et al, Dev Cell, 2010
Components
• DNA-methylation
• Nucleosome
position and
histone
modifications
• Chromatin
accessibility
• Higher order
chromatin
interactions
• Analogy
DNase Hypersensitive (HS) Mapping
• DNase randomly cuts genome (more often in
open chromatin region)
• Select short fragments (two nearby cuts) to
sequence
• Map to
active
promoters
and
enhancers
Ling et al, MCB 2010
DHS Peaks
Capture Most TF
Binding Sites
• Motif occurrence in the
DHS peaks suggest TF
binding
• Quantitative signal
strength also suggest
binding stability
Thurman et al, Nat 2012
TF Network from DNase Footprint
37
DnaseI Cleavage vs Footprint
• Footprint occupancy score: FOS = (C + 1)/L + (C + 1)/R
• Smaller FOS value better footprint, for
L C R
predicting base resolution TF binding
GAT ACA
CTA TGT
38
DnaseI Cleavage vs Footprint
• Footprint occupancy score: FOS = (C + 1)/L + (C + 1)/R
• Smaller FOS value better footprint, for
L C R
predicting base resolution TF binding
• Intrinsic DNase cutting bias could
have 300-fold difference, creating fake footprints
GAT ACA
CTA TGT
CAGATA
CAGATC
…
ACTTAC
ACTTGT
0.004
0.004
1.225
1.273
39
Using DNaseI “Footprint” to
Predict TF Binding
• Using base-pair resolution cleavage pattern
(“footprint”) hurts TF binding prediction when it
is similar to intrinsic DNaseI cutting bias
40
Using DNaseI “Footprint” to
Predict Novel TF Motifs
41
He et al, Nat Meth 2013
Epigenetics and Chromatin
Transcription and Epigenetic Regulation
• Stem cell differentiation
• Aging brain
• Cancer