PAS Meeting - University of North Carolina at Chapel Hill
Download
Report
Transcript PAS Meeting - University of North Carolina at Chapel Hill
Genes
Eukaryotic Protein-Coding Gene Structure
coding
non-coding
Regulatory Region
Size: 50 > 10,000 bp
Contains multiple small DNA sequence elements (5 – 20 bp) >
bind regulatory proteins
Regulatory elements can be negative or positive acting
Regulatory regions found in 5’ flanking region, introns, and 3’
flanking regions – most common in 5’ flanking regions and
large introns
5’-Untranslated Region
Contained in mRNA
Spans from start of transcription to start of translation
Multiple functions – translational efficiency
Size varies greatly - average > 300 nt (human)
coding
non-coding
Coding Sequence
Begins with initiator methionine (AUG codon)
Sometimes multiple initiator methionines are used
Stops with termination codon (UAA, UAG, and UGA)
Sizes varies: average = 1340 nt (human); encodes ~450 aa
protein
coding
non-coding
3’ Untranslated Region
Spans translational termination codon > end of mRNA
Multiple functions: mRNA stability and localization
AAUAAA sequence signals where poly(A) is to be added
(10-35 nt upstream from cleavage/poly(A) site)
Size varies: average - 700 nt (human)
coding
non-coding
Poly(A)
Added posttranscriptionally (not encoded in gene)
Size varies (10-200 nt) depending on organism
Functions: mRNA stability and translational efficiency
Size of tract shortens with time
All mammalian mRNAs have poly(A) except histone mRNAs
Poly(A)
Exons
Genes have a modular design
Evolutionarily assembled in pieces
Functional unit > exons
# exons can vary from 1 > 178
Average # exons/gene – different organisms
Yeast
~1
Drosophila
4
Human
9
Human genes (mean sizes)
Exon size
145 bp
coding
non-coding
Introns
Introns vary greatly in size
Most ~ 50 bp but can be > 15 kb
Large genes – large introns
Small genes – small introns
Size differs between species
C. elegans
267 bp
Drosophila
487 bp
Human
3,365 bp
Human introns > exons in size
Intron 1
Intron 2
Genetics
Mutants
Wild-type – “normal” fully-active gene
Null – absence of any activity (e.g. deletion)
Hypomorph – reduced function
Hypermorph – enhanced activity
Neomorph – expressed in cells normally not expressed (transgenic
approach)
Phenotypic analysis – development, morphology, behavior, fertility, etc.
Gene regulation
Examine how mutation in Gene A influences expression of other
genes
Genetic and Molecular Genic Relationships
Organism
Genes
Lethal loci (%total genes)
Yeast
5,800
1,800 (30%)
Nematode
18,400
3,500 (20%)
Drosophila
13,600
3,600 (25%)
Mouse – similar % based on gene knockout studies
Lethal loci – loss of function mutant that results in death
Result: Only ~20-30% genes can be mutated to lethality
Genetic and Molecular Genic Relationships
Why are there genes with no apparent function?
Gene may not be doing anything
Other genes may compensate for defect (redundancy)
Double mutant analysis often provides evidence for
this explanation
Common for highly-related genes to be (at least
partially) redundant
Defect may be too subtle to detect
Proper assay not used
Need proper ecological setting and evolutionarilyrelevant time span to detect
May be conditional
CNS Midline Cell Development and Transcription Requires
Single-minded Function
Wild-type
Cell division
Cell morphology
Gene expression
sim
Ubiquitously-Expressed Sim Transforms Entire CNS into
CNS Midline Cells
Heat shock-sim
Rhomboid-lacZ
Uninduced
Induced
a-LacZ
Gene Regulation
Regulatory proteins > DNA cis-control elements
Positive and negative regulation
Combinatorial regulation > highly specific patterns of spatial,
temporal and quantitative expression
Murine transthyretin gene
Sim:Tgo Binding Sites (CNS Midline Elements - CMEs)
are Required for Midline Transcription
0.95 kb Toll-lacZ
1
2 3
4
X
X X X
CME > ACGTG
a-LacZ
Array Analysis of Gene Expression: Drosophila
Understand complete array of gene regulatory events that
underlie:
Development
Tissue and cell identity
Aging
Behavior
Circadian rhythms
Learning and memory
Example: Single-minded (Sim): Master Regulator of CNS
Midline Cell Development and Transcription
Sim protein (green) > CNS midline cells
Vnd protein (red) > lateral CNS
Array Analysis of Gene Expression
Midline gene expression program > identify all genes
expressed in midline cells
Study: function and regulation
Approaches:
Purify midline cells (GFP) > compare to other cell types
and developmental time intervals
Mutant (sim) vs. wild-type
Misexpression of sim vs. wild-type
Transgenes – express in entire CNS
Genetics – snail mutant > express in entire mesoderm
Midline and Lateral CNS GFP Lines
sim-GFP
vnd-GFP
Dissociate embryonic cells > FACS
Compare expression at different stages and to other cell types
Results: midline-specific transcripts high in midline cells when compared
to levels in other tissues
Fluorescence Activated Cell Sorter (FACS)
Allows isolation of
fluorescently-labeled
(GFP+) cells
Array Analysis of Gene Expression
Midline gene expression program > identify all genes
expressed in midline cells
Study: function and regulation
Approaches:
Purify midline cells (GFP) > compare to other cell types
and developmental time intervals
Mutant (sim) vs. wild-type
Misexpression of sim vs. wild-type
Transgenes – express in entire CNS
Genetics – snail mutant > express in entire mesoderm
Comparison of Wild-type to sim Mutant Embryos
Wild-type
sim
Results: Expect to see midline gene expression reduced in sim mutant
Array Analysis of Gene Expression
Midline gene expression program > identify all genes
expressed in midline cells
Study: function and regulation
Approaches:
Purify midline cells (GFP) > compare to other cell types
and developmental time intervals
Mutant (sim) vs. wild-type
Misexpression of sim vs. wild-type
Transgenes – express in entire CNS
Genetics – snail mutant > express in entire mesoderm
Analysis of Midline Transcription by Ectopic Sim
Expression: Transgenic Approaches
Wild-type
sca-Gal4 X UAS-sim-GFP
a-Wrapper
GFP
a-Wrapper
Result: Expect to see midline gene expression increased in sca-Gal4 X UAS-sim-GFP
Analysis of Midline Transcription by Ectopic Sim
Expression: Genetic Approaches
Wild-type
sim RNA
localization
snail
Result: Expect to see midline gene expression increased in snail mutant
Cluster Analysis of Combined Data Sets
Compare different data sets
Midline genes
Test by in situ hybridization for midline expression
Array Analysis of Mesoderm Gene Expression
Mesoderm
Somatic muscles
Visceral muscles
Fat body, hemocytes
twist gene
Encodes transcription factor required for mesodermal
gene expression
twist mutant – no mesoderm or mesodermal gene
expression
twist overexpression (Toll10B mutation) – excess
mesoderm and mesodermal gene expression
Twist Mutant and Overexpression Phenotypes
Mutant Embryo Purification
twist is embryonic lethal mutation
twi / + X twi / + only 25% embryos are mutant (twi / twi)
Use GFP-CyO chromosome and sort mutant embryos
GFP-CyO / twi
GFP-CyO / GFP-Cyo
twi / twi
Mutant Sorting
GFP-labeled organisms
Hand sort with fluorescence
microscope
Machine sort
Array Analysis: Clustering
Confirm expected expression pattern by in situ hybridization