P05b-estimate_6
Download
Report
Transcript P05b-estimate_6
General Business 704
Estimation:
Confidence Intervals
Based in part on Chapter 6
Objectives:
Estimation
Distinguish point & interval estimates
Explain interval estimates
Compute confidence interval estimates
Population mean & proportion
Population total & difference
Determine necessary sample size
Thinking Challenge
Suppose you’re
interested in the
average amount of
money that students
in this class (the
population) have in
their possession.
How would you find
out?
Statistical
Methods
Statistical
Methods
Descriptive
Statistics
Inferential
Statistics
Estimation
Hypothesis
Testing
Estimation
Process
Population
Mean, , is
unknown
Sample
Random Sample
Mean
X = 50
I am 95%
confident that
is between
40 & 60.
Population Parameter
Estimates
Estimate population
parameter...
Mean
with sample
statistic
x
Proportion
p
ps
Variance
2
Differences
2
12
s
x1 -x2
Estimation
Methods
Estimation
Point
Estimation
Interval
Estimation
Confidence
Interval
Bootstrapping
Estimation
Methods
Estimation
Point
Estimation
Interval
Estimation
Confidence
Interval
Bootstrapping
Point Estimation
Provides single value
Based on observations from 1 sample
Gives no information about how
close value is to the unknown
population parameter
Example: Sample meanX = 3 is
point estimate of unknown
population mean
Estimation
Methods
Estimation
Point
Estimation
Interval
Estimation
Confidence
Interval
Bootstrapping
Interval Estimation
Provides range of values
Gives information about closeness to
unknown population parameter
Based on observations from 1 sample
Stated in terms of probability
Example: Unknown population mean
lies between 40 & 60 with 95%
confidence
Key Elements of
Interval Estimation
A probability that the population parameter
falls somewhere within the interval.
Confidence
interval
Confidence
limit (lower)
Sample statistic
(point estimate)
Confidence
limit (upper)
Confidence Limits
for Population Mean
Parameter =
Statistic ± Error
© 1984-1994
T/Maker Co.
(1)
X Error
(2)
Error X or X
X
(3)
Z
(4)
Error Z x
(5)
X Z x
x
Error
x
Many Samples Have
Same Interval
X = ± Zx
x_
-2.58x
-1.65x
-1.96x
+1.65x
+2.58x
+1.96x
90% Samples
95% Samples
99% Samples
X
Level of Confidence
Probability that the unknown
population parameter falls within
interval
Denoted (1 -
is probability that parameter is not
within interval
Typical values are 99%, 95%, 90%
Intervals &
Level of Confidence
Sampling
Distribution /2
of Mean
x_
1 -
/2
x =
_
X
(1 - ) % of
intervals
contain .
Intervals
extend from
X - ZX to
X + ZX
% do not.
Large number of intervals
Factors Affecting
Interval Width
Data dispersion
Sample size
Measured by
Intervals extend from
X - ZX toX + ZX
X = / n
Level of confidence
(1 - )
Affects Z
© 1984-1994 T/Maker Co.
Confidence
Interval Estimates
Confidence
Intervals
Mean
Known
Proportion
Unknown
Variance
Finite
Population
Confidence
Interval Estimates
Confidence
Intervals
Mean
Known
Proportion
Unknown
Variance
Finite
Population
Confidence Interval
Mean ( Known)
Assumptions
Population standard deviation is known
Population is normally distributed
If not normal, can be approximated by
normal distribution (n 30)
Confidence interval estimate
X Z / 2
n
X Z / 2
n
Note: 99% Z=2.58, 95% Z=1.96 , 90% Z=1.65
Estimation Example
Mean ( Known)
The mean of a random sample of n = 25
isX = 50. Set up a 95% confidence
interval estimate for if = 10.
X Z / 2
X Z / 2
n
n
10
10
50 1.96
50 1.96
25
25
46.08 53.92
Thinking Challenge
You’re a Q/C inspector for
Gallo. The for 2-liter
bottles is .05 liters. A
random sample of 100
bottles showedX = 1.99
liters. What is the 90%
confidence interval
estimate of the true mean
amount in 2-liter bottles?
2
liter
© 1984-1994 T/Maker Co.
Confidence Interval
Solution for Gallo
X Z / 2
n
X Z / 2
n
.05
.05
1.99 1.645
1.99 1.645
100
100
1.982 1.998
Confidence
Interval Estimates
Confidence
Intervals
Mean
s Known
Proportion
Unknown
Variance
Finite
Population
Confidence Interval
Mean ( Unknown)
Assumptions
Population standard deviation is
unknown
Population must be normally distributed
Use Student’s t distribution
Confidence interval estimate
S
S
X t / 2, n 1
X t / 2, n 1
n
n
Student’s t Distribution
Standard
Bellnormal
shaped
Symmetric
t (df = 13)
‘Fatter’ tails
t (df = 5)
0
Z
t
Note: As d.f. approach 120, Z and t become very similar
Student’s t Table
Upper Tail Area
df
.25
.10
Assume:
n=3
df = n - 1 = 2
= .10
/2 =.05
/2
.05
1 1.000 3.078 6.314
2 0.817 1.886 2.920
.05
3 0.765 1.638 2.353
0
t values
2.920
t
Degrees of Freedom
Number of observations that are free
to vary after sample statistic has
been calculated
degrees of freedom
Example
Sum of 3 numbers is 6
X1 = 1 (or any number)
X2 = 2 (or any number)
X3 = 3 (cannot vary)
Sum = 6
= n -1
= 3 -1
=2
Estimation Example
Mean ( Unknown)
A random sample of n = 25 hasX = 50
& S = 8. Set up a 95% confidence
interval estimate for .
S
X t / 2, n 1
X t / 2, n 1
n
8
50 2.0639
50 2.0639
25
46.69 53.30
S
n
8
25
Thinking Challenge
You’re a time study analyst
in manufacturing. You’ve
recorded the following
task times (min.): 3.6, 4.2,
4.0, 3.5, 3.8, 3.1.
What is the 90%
confidence interval
estimate of the population
mean task time?
Confidence Interval
Solution for Time Study
X = 3.7
S = 3.8987
n = 6, df = n - 1 = 6 - 1 = 5
S / n = 3.8987 / 6 = 1.592
t.05,5 = 2.0150
3.7 - (2.015)(1.592) 3.7 + (2.015)(1.592)
0.492 6.908
Confidence
Interval Estimates
Confidence
Intervals
Mean
Known
Proportion
Unknown
Variance
Finite
Population
Estimation for
Finite Populations
Assumptions
Sample is large relative to population
n / N > .05
Use finite population correction factor
Confidence interval (mean, unknown)
S
Nn
S
Nn
X t / 2, n 1
X t / 2, n 1
n N 1
n N 1
Confidence
Interval Estimates
Confidence
Intervals
Mean
Known
Proportion
Unknown
Variance
Finite
Population
Confidence Interval
Proportion
Assumptions
Two categorical outcomes
Population follows binomial distribution
Normal approximation can be used
n·p 5 & n·(1 - p) 5
Confidence interval estimate
ps (1 ps )
ps (1 ps )
ps Z
p ps Z
n
n
Estimation Example
Proportion
A random sample of 400 graduates
showed 32 went to grad school. Set
up a 95% confidence interval estimate
for p.
ps (1 ps )
ps (1 ps )
ps Z / 2
p ps Z / 2
n
n
.08 (1 .08)
.08 (1 .08)
.08 1.96
p .08 1.96
400
400
.053 p .107
Thinking Challenge
You’re a production
manager for a newspaper.
You want to find the %
defective. Of 200
newspapers, 35 had
defects. What is the
90% confidence interval
estimate of the population
proportion defective?
Confidence Interval
Solution for Defects
n·p 5
n·(1 - p) 5
ps (1 ps )
ps (1 ps )
ps Z / 2
p ps Z / 2
n
n
.175 (.825)
.175 (.825)
.175 1.645
p .175 1.645
200
200
.1308 p .2192
Estimation Methods
Estimation
Point
Estimation
Interval
Estimation
Confidence
Interval
Bootstrapping
Bootstrapping Method
Used if population is not normal
Requires significant computer power
Steps
Take initial sample
Sample repeatedly from initial sample
Compute sample statistic
Form resampling distribution
Limits are values that cut off smallest &
largest /2 %
Finding Sample Sizes
For Estimating
(1)
Z
X
x
Error
x
(2)
Error Z x Z
(3)
Z
n
Error 2
2
2
I don’t want to
sample too much
or too little!
n
Sample Size Example
What sample size is needed to be 90%
confident of being correct within 5?
A pilot study suggested that the
standard deviation is 45.
a f a f 219.2 220
af
2
1645
.
45
Z
n
2
2
Error
5
2
2
2
Thinking Challenge
You work in Human
Resources at Merrill Lynch.
You plan to survey employees
to find their average medical
expenses. You want to be
95% confident that the
sample mean is within ± $50.
A pilot study showed that
was about $400. What
sample size do you use?
Sample Size Solution
Medical Expenses
Z 2 2
n
2
Error
1.96f a
400f
a
a50f
2
2
2
245.86 246
Finding Sample Sizes For
Estimating Proportions
Z p(1 p)
n
2
Error
2
I don’t want to
sample too much
or too little!
Remember
•Error is acceptable error
•Z is based on confidence level chosen
•p is the true proportion of “success”
•Never under-estimate p
•When in doubt, use p=.5
Sample Size Example
for Estimating p
What sample size is needed to be 90%
confident (Z=1.645) of being correct
within proportion of .04 when using
p=.5 (since no useful estimate of p is
available)?
Z p(1 p) 1.645 (.5)(.5)
n
422
.
82
423
2
2
Error
.04
2
2
Estimation of
Population Total
In auditing, population total is more
important than mean
Total = NX
Confidence interval (population total)
S
Nn
S
Nn
NX t
Total NX t
n N 1
n N 1
Degrees of freedom = n - 1
Estimation
of Differences
Used to estimate the magnitude of
errors
Steps
Determine sample size
Compute average difference,D
Compute standard deviation of
differences
Set up confidence interval estimate
Estimation of
Differences Equations
Mean Difference:
Standard Deviation:
n
D
Di
i 1
n
n
sD
2
D
i nD
i 1
n1
Interval Estimate:
SD N n
SD N n
ND Nt
D ND Nt
n N 1
n N 1
Objectives:
Estimation
Distinguish point & interval estimates
Explain interval estimates
Compute confidence interval estimates
Population mean & proportion
Population total & difference
Determine necessary sample size