Neutral Theory
Download
Report
Transcript Neutral Theory
Lecture 20: Introduction to Neutral
Theory
November 5, 2012
Announcements
Classes related to Population
Genetics/Genomics next semester:
BIOL493S
SPTP: Next Generation Biology
CRN 18190, 1 credit, Tues 13:00-13:50
BIOL321
Genomics
2 credits
required,
Total Science Experience Lab:
Module, CRN 18084. W 13:00-15:50
(capstone) (special permission
limit 12 students)
Last Time
Mutation introduction
Mutation-reversion equilibrium
Mutation and selection
Mutation and drift
Today
Introduction to neutral theory
Molecular clock
Expectations for allele frequency
distributions under neutral theory
Classical-Balance
Fisher focused on the dynamics of allelic forms of
genes, importance of selection in determining
variation: argued that selection would quickly
homogenize populations (Classical view)
Wright focused more on processes of genetic drift
and gene flow, argued that diversity was likely to be
quite high (Balance view)
Problem: no way to accurately assess level of
genetic variation in populations! Morphological traits
hide variation, or exaggerate it.
Molecular Markers
Emergence of enzyme electrophoresis in mid 1960’s
revolutionized population genetics
Revealed unexpectedly high levels of genetic
variation in natural populations
Classical school was wrong: purifying selection does
not predominate
Initially tried to explain with Balancing Selection
Deleterious homozygotes create too much fitness
burden
i 1 s1 p s2q
2
2
i
m
for m loci
The rise of Neutral Theory
Abundant genetic variation exists, but perhaps not
driven by balancing or diversifying selection:
selectionists find a new foe: Neutralists!
Neutral Theory (1968): most genetic mutations are
neutral with respect to each other
Deleterious mutations quickly eliminated
Advantageous mutations extremely rare
Most observed variation is selectively neutral
Drift predominates when s<1/(2N)
Infinite Alleles Model (Crow and Kimura Model)
Each mutation creates a completely new allele
Alleles are lost by drift and gained by mutation: a
balance occurs
Is this realistic?
Average human protein contains about 300 amino acids
(900 nucleotides)
Number of possible mutant forms of a gene:
n4
900
7.14x10
542
If all mutations are equally probable, what is
the chance of getting same mutation twice?
Infinite Alleles Model (IAM: Crow and Kimura
Model)
Homozygosity will be a function of mutation and
probability of fixation of new mutants
1
1
2
ft
(1
) f t 1 (1 )
2Ne
2Ne
Probability of
Probability of
sampling same allele
sampling two alleles
twice
identical by descent
due to inbreeding in
ancestors
Probability neither
allele mutates
Expected Heterozygosity with Mutation-Drift
Equilibrium under IAM
1
1
ft
(1
) f t 1 (1 ) 2
2Ne
2Ne
At equilibrium ft = ft-1=feq
Previous equation reduces to:
Ignoring μ2
1 2
f eq
4 N e 1 2
Ignoring 2μ
1
f eq
4Ne 1
Remembering that H=1-f:
4Ne
He
4Ne 1
4Neμ is called the
population mutation rate,
also referred to as θ
Expected Heterozygosity with Mutation-Drift
Equilibrium under IAM
At equilibrium:
1
1
fe
4Ne 1 1
set 4Neμ = θ
Remembering that H = 1-f:
He
1
Equilibrium Heterozygosity under IAM
4N em
q
He =
=
4N em +1 q +1
Frequencies of individual
alleles are constantly
changing
Balance between loss and
gain is maintained
4Neμ>>1: mutation
predominates, new
mutants persist, H is
high
2
Fraser et al. 2004 PNAS 102: 1968
4Neμ<<1: drift
dominates: new mutants
quickly eliminated, H is
low
Stepwise Mutation Model
Do all loci conform to Infinite Alleles Model?
Are mutations from one state to another equally
probable?
Consider microsatellite loci: small insertions/deletions
more likely than large ones?
SMM:
1
He 1
(8 N e 1)
IAM:
4Ne
He
4Ne 1
Which should have higher produce He,the
Infinite Alleles Model, or the Stepwise
Mutation Model, given equal Ne and μ?
SMM:
1
He 1
(8 N e 1)
IAM:
4Ne
He
4Ne 1
Plug numbers into the equations to see how
they behave.
e.g, for Neμ = 1, He = 0.66 for SMM and 0.8 for
IAM
Expected Heterozygosity Under Neutrality
Direct assessment of
neutral theory based on
expected heterozygosity
if neutrality
predominates (based on
a given mutation model)
Allozymes show lower
heterozygosity than
expected under strict
neutrality
Why?
He
1
Observed
Avise 2004
Neutral Expectations and Microsatellite Evolution
Comparison of Neμ (Θ) for
216 microsatellites on
human X chromosome
versus 5048 autosomal loci
Only 3 X chromosomes for
every 4 autosomes in the
population
Why is Θ higher for
Ne of Xautosomes?
expected to be 25%
less than Ne of autosomes:
θX/θA=0.75
Observed ratio of ΘX/ΘA
was 0.8 for Infinite
Alleles Model and 0.71 for
Stepwise model
Autosomes
X
X chromosome
Correct model for
microsatellite evolution
is a combination of
IAM and Stepwise
Sequence Evolution
DNA or protein sequences in different taxa trace
back to a common ancestral sequence
Divergence of neutral loci is a function of the
combination of mutation and fixation by genetic
drift
Sequence differences are an index of time since
divergence
Molecular Clock
If neutrality prevails, nucleotide divergence between two sequences
should be a function entirely of mutation rate
1
k = 2N m
=m
2N
Probability of
creation of new
alleles
Probability of
fixation of new
alleles
Time since divergence should therefore be the reciprocal of the
estimated mutation rate
Expected Time Until Fixation of a New Mutation:
t
1
Since μ is number of
substitutions per unit time
Variation in Molecular Clock
If neutrality prevails, nucleotide divergence between two sequences
should be a function entirely of mutation rate
So why are rates of substitution so different for different classes
of genes?
The main power of neutral theory is it provides a
theoretical expectation for genetic variation in
the absence of selection.
Fate of Alleles in Mutation-Drift Balance
Generations from
birth to fixation
Time between
fixation events
Time to fixation of a new mutation is much longer than
time to loss
Fate of Alleles in Mutation-Drift-Selection
Balance
Purifying Selection
Which case will have the
most alleles
Highest
on H
average
at
E?
What will this depend
upon?
any given time?
Neutrality
Balancing
Selection/Overdomina
nce
Assume you take a sample of 100 alleles from a
large (but finite) population in mutation-drift
equilibrium.
What is the expected distribution of allele
frequencies in your sample under neutrality and
the Infinite Alleles Model?
Number of Alleles
A.
B.
C.
10
8
6
4
2
2
4
6
8
10
2
4
6
8
10
Number of Observations of Allele
2
4
6
8
10
Allele Frequency Distributions
Black: Predicted from Neutral
Theory
White: Observed (hypothetical)
Neutral theory allows a
prediction of frequency
distribution of alleles
through process of birth
and demise of alleles
through time
Comparison of observed to
expected distribution
provides evidence of
departure from Infinite
Alleles model
Hartl and Clark 2007
Depends on f, effective
population size, and
mutation rate
Ewens Sampling Formula
Population mutation rate: index of variability of population:
4 Ne
i
Probability the i-th sampled allele is new given i alleles already sampled:
Probability of sampling a new allele on the first sample:
0
Probability of observing a new allele after sampling one allele:
.
1
1
Probability of sampling a new allele on the third and fourth samples:
Expected number of different alleles (k) in a sample of 2N alleles is:
E (k )
2 N 1
i
1
i 0
1 2
...
2N 1
Example: Expected number of alleles in a sample of 4:
E (k )
2 N 1
i
i 0
3
i 0
i
1
1 2 3
He
2
3