Descriptive Indices - National University of Singapore
Download
Report
Transcript Descriptive Indices - National University of Singapore
Equivalent ly, P( X x, Y y) P( X x) P(Y y)
E ( X ) is also called thepopulationmean of X and
can be interprete
d as thelong- run averageof X ,
i.e., if we samplerepeatedlyand independently
accordingto thedistribution p( x) to obtain
X 1 , X 2 ,..., X n , then thesamplemean
X n 1n ( X1 ... X n ) E( X ) as n increases.
Furthermore,
4. E ( XY ) E ( X ) E (Y ) if X and Y are independent.
As in thecase of themean,
thesample variance S var(X )
as thesamplesize increases.
2
Standarddeviation: SD( X ) var(X )
Topic 7
Binomial and Poisson
Distribution
Binomial Probability Distribution
Consider a sequence of independent events with
only two possible outcomes called success (S) and
failure (F)
Example: outcome of treatment (cured/not cured)
opinion (yes/no) S=yes, F=no
gender (boy/girl) S=boy, F=girl
Let p be the probability of S
Consider n number of such independent events.
Then the total no of success out of n such events is
a random variable called Binomial r.v.
Binomial Probability Distribution
Let p = probability of having a boy
q = probability of having a girl
Assume the outcome of the first child (boy or girl)
does not affect the outcome of the second and
subsequent children i.e. events are independent ,
we can multiply the probabilities together to get
Pr(BBB) = p x p x p = Pr(3 boys)
Pr(GGG) = q x q x q = Pr(No boy)
More generally, there are
n
n!
k k!( n k )!
ways of ordering k B’s and n-k G’s. So
Pr (k boys out of n children)
n
Pr( B
...BG...G )
k
k
nk
n k
nk
p (1 p )
k
Binomial Probability
For a 3-child family,
1 boy = BGG or GBG or GGB
Pr(1 boy) = Pr(BGG)+Pr(GBG)+Pr(GGB)
= pqq + qpq +qqp
= 3pqq
Similarly,
Pr(2 boys) = Pr(BBG)+Pr(BGB)+Pr(GBB)
= 3ppq
Example 1
Choose a group of 7 old individuals
randomly from the population of 65-74
years old in US. Suppose 12.5% of the
population in that age is diabetic. The total
no. of persons out of the 7 selected who
suffers from Diabetes has a binomial
distribution.
Let X = # diabetic
Binomial( n = 7, p = 0.125)
7
P[ X 2] 0.1252 (1 0.125) 5
2
0.1683
Example 2
Past records indicate that 70% of patients responded
to treatment. What is the probability that 16 out of
the next 20 patients will respond to treatment?
# patients responding ~ Bin( n = 20, p = 0.7)
P(16 out of 20 responded)
20
0.716 (1 0.7) 4 = 0.1304
16
Poisson distribution:
Consider a binomial random variable Y with n very
large and p small and np is moderate equal to . Then the
probabilities can be approximated by what is called a
Poisson random variable.
n k
e k
p (1 p) n k
k!
k
Y ~ Poisson ( ) if
e k
P (Y k )
k!
for k 0,1,2,...
is called the rate.
Often used to model count data such as
a) # industrial accidents in a plant per month
b) # chromosome interchanges within cells
c) # insurance claims
The number of deaths attributable to typhoid
fever follows a Poisson distribution at a rate of
4.6 deaths per year
X = # deaths in 1 year ~ Poisson(4.6)
P( X 2) e 4.6
4.6 2
0.1063
2!
Y = # deaths in 6 months ~ Poisson(2.3)
P(Y 2) e 2.3
2.3 2
0.265
2!