Chi-Squared Distribution - Erwin Sitompul
Download
Report
Transcript Chi-Squared Distribution - Erwin Sitompul
Probability and Statistics
Lecture 9
Dr.-Ing. Erwin Sitompul
President University
http://zitompul.wordpress.com
2 0 1 3
President University
Erwin Sitompul
PBST 9/1
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
The probabilities associated with binomial experiments are readily
obtainable from the formula b(x;n, p) of the binomial distribution
or from the table when n is small.
For large n, making the distribution table is not practical anymore.
Nevertheless, the binomial distribution can be nicely approximated
by the normal distribution under certain circumstances.
President University
Erwin Sitompul
PBST 9/2
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
If X is a binomial random variable with mean μ = np and variance
σ2 = npq, then the limiting form of the distribution of
Z
X np
npq
as n ∞, is the standard normal distribution n(z;0, 1).
Normal approximation of b(x; 15, 0.4)
Each value of b(x; 15, 0.4) is
approximated by P(x–0.5 < X < x+0.5)
President University
Erwin Sitompul
PBST 9/3
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
P( X 4) b(4;15, 0.4)
15 C4 (0.4) 4 (0.6)11
0.1268
P( X 4) P(3.5 X 4.5)
P(1.32 Z 0.79)
0.1214
Normal approximation of
9
b(4;15, 0.4) and
b( x;15, 0.4)
x 7
P(7 X 9) b( x;15, 0.4)
x 7
0.3564
np (15)(0.4) 6
npq (15)(0.4)(0.6) 1.897
President University
9
Erwin Sitompul
P(7 X 9) P(6.5 X 9.5)
P(0.26 Z 1.85)
0.3652
PBST 9/4
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
The degree of accuracy, that is how well the normal curve fits the
binomial histogram, will increase as n increases.
If the value of n is small and p is not very close to 1/2, normal
curve will not fit the histogram well, as shown below.
b( x; 6, 0.2)
b( x;15, 0.2)
The approximation using normal curve will be excellent when n is
large or n is small with p reasonably close to 1/2.
As rule of thumb, if both np and nq are greater than or equal to 5,
the approximation will be good.
President University
Erwin Sitompul
PBST 9/5
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
Let X be a binomial random variable with parameters n and p. For
large n, X has approximately a normal distribution with μ = np and
σ2 = npq = np(1–p) and
x
P( X x) b(k ; n, p)
k 0
area under normal curve to the left of x 0.5
P ( X x 0.5)
( x 0.5)
PZ
and the approximation will be good if np and nq = n(1–p) are
greater than or equal to 5.
President University
Erwin Sitompul
PBST 9/6
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
The probability that a patient recovers from a rare blood disease is
0.4. If 100 people are known to have contracted this disease, what is
the probability that less than 30 survive?
np (100)(0.4) 40
n 100, p 0.4
29
P( X 30) b( x;100, 0.4)
npq (100)(0.4)(0.6) 4.899
x 0
P( X 30) P( X 29.5)
z
29.5 40
2.143
4.899
P ( Z 2.143)
0.01608
After interpolation
1.608%
Can you calculate the
exact solution?
President University
Erwin Sitompul
PBST 9/7
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
A multiple-choice quiz has 200 questions each with 4 possible
answers of which only 1 is the correct answer. What is the probability
that sheer guess-work yields from 25 to 30 correct answers for 80 of
the 200 problems about which the student has no knowledge?
n 80, p
1
4
np (80)( 14 ) 20
npq (80)( 14 )( 34 ) 3.873
z1
P(25 X 30)
30
b( x;80,
x 25
1
4
24.5 20
30.5 20
1.162, z2
2.711
3.873
3.873
)
P(24.5 X 30.5)
P(1.162 Z 2.711)
P( Z 2.711) P( Z 1.162)
0.9966 0.8774
0.1192
President University
Erwin Sitompul
PBST 9/8
Chapter 6.5
Normal Approximation to the Binomial
Normal Approximation to the Binomial
PU Physics entrance exam consists of 30 multiple-choice questions
each with 4 possible answers of which only 1 is the correct answer.
What is the probability that a prospective students will obtain
scholarship by correctly answering at least 80% of the questions just
by guessing?
n 30, p
1
4
np (30)( 14 ) 7.5
npq (30)( 14 )( 43 ) 2.372
P( X 24)
30
b( x;30,
x 24
1
4
)
z
23.5 7.5
6.745
2.372
1 P( X 23.5)
1 P( Z 6.745)
0
It is practically impossible to
get scholarship just by pure
luck in the entrance exam
President University
Erwin Sitompul
PBST 9/9
Chapter 6.6
Gamma and Exponential Distributions
Gamma and Exponential Distributions
There are still numerous situations that the normal distribution
cannot cover. For such situations, different types of density
functions are required.
Two such density functions are the gamma and exponential
distributions.
Both distributions find applications in queuing theory and reliability
problems.
The gamma function is defined by
( ) x 1e x dx
for α > 0.
0
President University
Erwin Sitompul
PBST 9/10
Chapter 6.6
Gamma and Exponential Distributions
Gamma and Exponential Distributions
|Gamma Distribution| The continuous random variable X has a
gamma distribution, with parameters α and β, if its density
function is given by
1
1 x
x
e
, x0
( )
f ( x)
elsewhere
0,
where α > 0 and β > 0.
|Exponential Distribution| The continuous random variable X
has an exponential distribution, with parameter β, if its density
function is given by
1 x
, x0
e
f ( x)
elsewhere
0,
where β > 0.
President University
Erwin Sitompul
PBST 9/11
Chapter 6.6
Gamma and Exponential Distributions
Gamma and Exponential Distributions
Gamma distributions for certain values of
the parameters α and β
The gamma distribution with α = 1 is called
the exponential distribution
President University
Erwin Sitompul
PBST 9/12
Chapter 6.6
Gamma and Exponential Distributions
Gamma and Exponential Distributions
The mean and variance of the gamma distribution are
and
2 2
The mean and variance of the exponential distribution are
and
President University
2 2
Erwin Sitompul
PBST 9/13
Chapter 6.7
Applications of the Gamma and Exponential Distributions
Applications of Gamma and Exponential Distributions
Suppose that a system contains a certain type of component whose
time in years to failure is given by T. The random variable T is
modeled nicely by the exponential distribution with mean time to
failure β = 5.
If 5 of these components are installed in different systems, what is
the probability that at least 2 are still functioning at the end of 8
years?
1
P(T 8) et 5 dt
58
5
P( X 2) b( x;5, 0.2)
x2
1
1 b( x;5, 0.2)
e 8 5 0.2
x 0
1 0.7373
The probability whether
the component is still
functioning at the end of 8
years
0.2627
The probability whether at
least 2 out of 5 such
component are still
functioning at the end of 8
years
President University
Erwin Sitompul
PBST 9/14
Chapter 6.7
Applications of the Gamma and Exponential Distributions
Applications of Gamma and Exponential Distributions
Suppose that telephone calls arriving at a particular switchboard
follow a Poisson process with an average of 5 calls coming per
minute.
What is the probability that up to a minute will elapse until 2 calls
have come in to the switchboard?
1 5, 2
x
P( X x )
1
2
0
xe x dx
β is the mean time of the
event of calling
α is the quantity of the
event of calling
1
P( X 1) 25 xe5 x dx 1 e5(1) (1 5) 0.96
0
President University
Erwin Sitompul
PBST 9/15
Chapter 6.7
Applications of the Gamma and Exponential Distributions
Applications of Gamma and Exponential Distributions
Based on extensive testing, it is determined that the average of time
Y before a washing machine requires a major repair is 4 years. This
time is known to be able to be modeled nicely using exponential
function. The machine is considered a bargain if it is unlikely to
require a major repair before the sixth year.
(a) Determine the probability that it can survive without major repair
until more than 6 years.
(b) What is the probability that a major repair occurs in the first
year?
(a)
1
P(Y 6) et 4 dt e6 4 0.223
46
(b)
1
P(Y 1) 1 e t 4 dt 1 e1 4 0.221
41
Only 22.3% survives until
more than 6 years without
major reparation
1
22.1% will need major
reparation after used for 1
year
1
et 4 dt
40
President University
Erwin Sitompul
PBST 9/16
Chapter 6.8
Chi-Squared Distribution
Chi-Squared Distribution
Another very important special case of the gamma distribution is
obtained by letting α = v/2 and β = 2, where v is a positive
integer.
The result is called the chi-squared distribution, with a single
parameter v called the degrees of freedom.
The chi-squared distribution plays a vital role in statistical
inference. It has considerable application in both methodology and
theory.
Many chapters ahead of us will contain important applications of
this distribution.
President University
Erwin Sitompul
PBST 9/17
Chapter 6.8
Chi-Squared Distribution
Chi-Squared Distribution
|Chi-Squared Distribution| The continuous random variable X
has a chi-squared distribution, with v degrees of freedom, if its
density function is given by
1
v 2 1 x
x
e
, x0
2v 2 (v 2)
f ( x)
elsewhere
0,
where v is a positive integer.
The mean and variance of the chi-squared distribution are
and
v
2 2v
President University
Erwin Sitompul
PBST 9/18
Chapter 6.9
Lognormal Distribution
Lognormal Distribution
The lognormal distribution is used for a wide variety of
applications.
The distribution applies in cases where a natural log
transformation results in a normal distribution.
President University
Erwin Sitompul
PBST 9/19
Chapter 6.9
Lognormal Distribution
Lognormal Distribution
|Lognormal Distribution| The continuous random variable X has
a lognormal distribution if the random variable Y = ln(X) has a
normal distribution with mean μ and standard deviation σ. The
resulting density function of X is
2
1
ln( x )
2 x e
f ( x)
0,
(2 2 )
, x0
x0
The mean and variance of the chi-squared distribution are
E( X ) e
2 2
President University
and
Var( X ) e
2 2
Erwin Sitompul
2
(e 1)
PBST 9/20
Chapter 6.9
Lognormal Distribution
Lognormal Distribution
Concentration of pollutants produced by chemical plants historically
are known to exhibit behavior that resembles a log normal
distribution. This is important when one considers issues regarding
compliance to government regulations.
Suppose it is assumed that the concentration of a certain pollutant,
in parts per million, has a lognormal distribution with parameters μ =
3.2 and σ = 1.
What is the probability that the concentration exceeds 8 parts per
million?
P( X 8) 1 P( X 8)
ln(8) 3.2
P( X 8) F
F (1.12) 0.1314
1
F denotes the cumulative distribution
function of the standard normal distribution
a. k. a. the area under the normal curve
President University
Erwin Sitompul
PBST 9/21
Probability and Statistics
Homework 8A
1. (a) Suppose that a sample of 1600 tires of the same type are obtained at
random from an ongoing production process in which 8% of all such tires
produced are defective. What is the probability that in such sample 150
or fewer tires will be defective?
(Sou18. CD6-13)
(b) If 10% of men are bald, what is the probability that more than 100 in
a random sample of 818 men are bald?
President University
Erwin Sitompul
PBST 9/22