Lecture 2: Biology Review II
Download
Report
Transcript Lecture 2: Biology Review II
Lecture 2: Biology Review II
Date: 8/29/02
Overview/Review of:
Mapping
Molecular techniques
Markers
Genetic Mapping
Definition: A genetic map is an ordering of
genes and markers in a linear arrangement
corresponding to their physical order along
the chromosome. Based on linkage.
Definition: A physical map is an ordering of
landmarks on DNA, regardless of
inheritance. Measured in base pairs.
Marker
Definition: A marker is a gene or piece of
DNA with easily identified phenotype such
that cells or individuals with different alleles
are distinguishable.
e.g. a gene with known function
e.g. a single nucleotide change in DNA
Polymorphism
Definition: A polymorphism is a detectable
and heritable variation at a locus.
Definition: A marker is polymorphic if the
most abundant allele comprises less than X%
of all alleles, usually 95%.
Definition: A mapping population is a
population used to map genes.
Natural Populations
Definition: Natural populations are those
where mating is not controlled by the
experimenter, though the experimenter can
choose who to observe.
Only phenotype observable, genotype
sometimes unknown, phase is unknown.
Knowns: allele frequencies, genotype
frequencies, amount of disequilibrium.
Hardy-Weinberg Equilibrium I
Refers to the equilibrium achieved at a single
locus.
Hardy-Weinberg Equilibrium (HWE) is
achieved when the allele frequencies and
genotype frequencies do not change from
generation to generation.
Hardy-Weinberg Equilibrium II
Let pA and pB be the frequencies of allele A and B in
the population. Let pAA be the frequency of
genotype AA. Similarly, pAB and pBB are genotype
frequencies.
Then HWE implies that
pAA = pA2
pAB = 2pApB
pBB = pB2
Measures of Polymorphism
l
H 1 p i2
P(heterozygote) =
i 1
Definition: Polymorphism Information
Content (PIC)
l
l
l
PIC 1 pi2 2 pi2 p 2j
i 1
i 1 j i 1
Uninformative Matings
AB X AB
½ are informative
1 AA : 2 AB : 1 BB
uninformative
informative
Classical Linkage Analysis
A few markers.
Must have detectable variation.
Must be substantially variable in study
population.
Controlled crosses: testcross, backcross,
double- haploid
Well-defined parental lines.
Three-Point Testcross Design
dominant
recessive
testcross
X
X
F1
A A
b b
S S X t t
Y Y
z z
A
S
Y
F2
b
b b
t X t t
z
z z
b b
t S
z z
Three-Point Testcross Results
Count the number of recombinant haplotypes
produced by F1 parent. Calculate the
recombinant fraction for each pair of genes.
1
2
3
1
--
0.19
0.03
2
--
--
0.15
3
--
--
--
Map for Three-Point Testcross
3
1
2
0.03
0.15
0.19
Backcross Design
new recombinant
self
self
F2
no more changes
Large-Scale Mapping
Many genetic markers
Steps of analysis:
pairwise linkage analysis
group into linkage groups
order markers in each linkage group
Comparative Mapping
Compare maps of different species.
Due to similarities, information can be
transferred between species.
Information about how genomes evolve.
Uses conserved loci rather than highly
variable loci.
Molecular Techniques: probes
5’ – …AAGCCTAGAGCCCTTAGCCAAAAG… – 3’
3’ – …TTCGGATCTCGGGAATCGGTTTTC… – 5’
denature
add probe 3’ – *ATCTCGGGAATC – 5’
hybridization
5’ – …AAGCCTAGAGCCCTTAGCCAAAAG… – 3’
*ATCTCGGGAATC
Molecular Techniques:
restriction enzymes
Definition: An endonuclease is an enzyme (protein that acts
as a catalyst to speed up the rate of a biochemical reaction) that
cleaves nucleic acid strands at internal sites (phosphodiester
bond).
Definition: A restriction endonuclease is an enzyme that cuts
DNA at specific sites that it recognizes.
EcoRI
5’ GAATTC 3’
3’ CTTAAG 5’
number of cut
sites = N/4b
Molecular Techniques: gel
electrophoresis
DNA is negatively charged. Proteins can
also be charged.
An electric current is passed through a
porous medium (agarose, acrylamide) and
molecules in the medium respond by moving
in electric field, but at different rates based
on size and charge.
Electrophoretic Gel
Molecular Technique: PCR I
5’
5’
Denaturation and hybridization
5’
5’
Elongation & denaturation
5’
5’
Molecular Technique: PCR II
Physical Maps
Banding patterns on chromosomes
In-situ hybridization
Denature metaphase chromosomes
Add radioactive or fluorescent probe
Visualize chromosomes
DNA fragmentation
DNA sequence: still not practical for all
organisms
DNA Fragmentation
• Larger fragments better (rare cutters; partial digestion)
• Find overlap by sequencing or hybridization.
DNA Vector I
Definition: A cloning vector is a DNA
molecule that is capable of self-replicating.
Insert the fragment of foreign DNA to make
recombinant DNA.
DNA Vector II
phage: virus that infects bacteria (5-25 kb).
cosmid: Packaged in lambda phage and infects E.
coli (35-45 kb).
yeast artificial chromosome (YAC): has telemere,
centromere, and replication origin (200-2000 kb).
bacterial artificial chromosome (BAC)
plasmid: extrachromosomal circular DNA
nonessential for cell survival.
How Many Clones?
Let N be the number of clones made.
Let NS be the number of nonoverlapping clones
needed to cover the full genome.
NS 1
Pclone not in library
NS
log 1 P
N
log 1 1
NS
N
More: M. S. Waterman
Introduction to
Computational Biology:
Maps, Sequences, and
Genomes
Genetic Mapping Still Needed
Even if the full sequence is known, mapping
is still necessary.
There must be some way to correlate a
trait/phenotype with something on the
sequence.
Physical Mapping Still Needed
Linkage maps lack resolution
Sample more people
Better statistics
Let recombination accumulate over many
generations.
Even with most precise linkage map can
identify a gene to 1 cM (1 Mb in humans).
Morphological Markers
Differences in shape, color, size, etc.
Must have one-to-one correspondence with a
controlling gene.
Protein Markers
Definition: An isozyme are proteins with
same enzymatic function but different
structural, chemical, or immunological
characteristics.
Differences: amino acid composition, size,
modifications (e.g. phosphorylation).
Differences visualized: gel electrophoresis,
mass spectrometry, etc.
DNA Marker: RFLP I
Definition: RFLP is Restriction Fragment Length
Polymorphism.
DNA digested with endonuclease.
Separate fragments by electrophoresis.
Denature strands.
Transfer single-stranded DNA to durable membrane
and immobilize (Southern blot).
Hybridize labeled probe to the blot.
Visualize probe.
DNA Marker: RFLP II
DNA polymorphisms that RFLP identifies:
mutation in the restriction site
mutation elsewhere to create restriction site
insertion/deletion of DNA
RFLP markers are codominant
Mini- and Micro-Satellite
Markers
Definition: minisatellites or VNTR (Variable
Number of Tandem Repeats) are tandem
repeates of sequences 9-100 bp long.
Detected by hybridization or PCR.
Definition: microsatellites or SSR (Simple
Sequence Repeat) are direct tandem repeated
sequences of DNA of 1-6 bp.
STS and EST
Definition: Sequence tagged sites (STS) is a
short unique fragment of DNA.
Definition: Expressed sequence tags (EST)
are subsets of STSs from cDNA clones.
Represent transcribed genes (e.g. usually
proteins).
Single-Strand Conformational
Polymorphism (SSCP)
Detects changes as small as 1 nucleotide in
more than 1000 bp.
Single-stranded DNA is electrophoresed on
gel and migrates based on size and shape.
Visualized by Southern blot with specific
fragment probe or PCR specific fragment and
visualize directly.
Random Amplified
Polymorphic DNA (RAPD)
PCR with short probes that bind randomly to
sites in the genome.
Good for genomes where little sequence
information is available.
Band-present is dominant.
Expected number of products = 2fN/16b
Amplified Fragment Length
Polymorphism (AFLP)
Cut DNA with frequent- and rare-cutting
endonuclease
Anneal adapters to the ends of the frequentcutter cut sites.
Amplify off adapters with PCR. Use various
specific primers to amplify subsets of total.
Visualize on denaturing polyacrylamide gel.
Choosing Markers
High polymorphism.
Clear interpretation.
Quick typing and easy automation.
Personal preference.