Confidence Intervals

Download Report

Transcript Confidence Intervals

Confidence
Intervals with
Means
What is the purpose of a
confidence interval?
To estimate an unknown
population parameter
Formula:
Standard
deviation of
Critical value
statistic
Confi dence Interval :
  

x  z * 
 n
statistic
Margin of error
In a randomized comparative experiment on
the effects of calcium on blood pressure,
researchers divided 54 healthy, white males
at random into two groups, taking calcium
or placebo. The paper reports a mean
seated systolic blood pressure of 114.9 with
standard deviation of 9.3 for the placebo
group. Assume systolic blood pressure is
normally distributed.
Can you find a z-interval for this problem?
Why or why not?
Student’s t- distribution
• Developed by William Gosset
• Continuous distribution
• Unimodal, symmetrical, bell-shaped
density curve
• Above the horizontal axis
• Area under the curve equals 1
• Based on degrees of freedom
df = n - 1
Graph examples of
t- curves vs standard normal curve
How does the t-distributions
compare to the standard
normal distribution?
• Shorter & more spread out
• More area under the tails
• As n increases, t-distributions
become more like a standard
normal distribution
Formula:
Standard
deviation of
Standard error
–
Critical value
statistic
when you
substitute s for .
Confidence Interval :
 s 

x  t * 
 n 
statistic
Margin of error
How to find t*
• Use
B for
t distributions
CanTable
also use
invT
on the calculator!
• Look up confidence level at bottom &
Need
t* value with 5% is above –
df onupper
the sides
• df = n – 1 so 95% is below
invT(p,df)
Find these t*
90% confidence when n = 5
95% confidence when n = 15
t* = 2.132
t* = 2.145
Steps for doing a confidence
interval:
1) Assumptions –
2) Calculate the interval
3) Write a statement about the interval
in the context of the problem.
Statement: (memorize!!)
We are ________% confident
that the true mean context is
between ______ and ______.
Assumptions for t-inference
• Have an SRS from population (or
randomly assigned treatments)
•  unknown
• Normal (or approx. normal) distribution
– Given
– Large sample size
– Check graph of data
Use only one of
these methods to
check normality
Ex. 1) Find a 95% confidence interval for the
true mean systolic blood pressure of the
placebo group.
Assumptions:
• Have randomly assigned males to treatment
• Systolic blood pressure is normally distributed
(given).
•  is unknown
 9.3 
114.9  2.056
  (111.22, 118.58)
 27 
We are 95% confident that the true mean systolic
blood pressure is between 111.22 and 118.58.
Ex. 2) A medical researcher measured
the pulse rate of a random sample of 20
adults and found a mean pulse rate of
72.69 beats per minute with a standard
deviation of 3.86 beats per minute.
Assume pulse rate is normally
distributed. Compute a 95% confidence
interval for the true mean pulse rates of
adults.
We are 95% confident that the
true mean pulse rate of adults is
between 70.883 & 74.497.
Ex 2 continued) Another medical
researcher claims that the true mean
pulse rate for adults is 72 beats per
minute. Does the evidence support or
refute this? Explain.
The 95% confidence interval
contains the claim of 72 beats
per minute. Therefore, there is
no evidence to doubt the
claim.
Ex. 3) Consumer Reports tested 14
randomly selected brands of vanilla
yogurt and found the following numbers
of calories per serving:
160 200 220 230 120 180 140
130 170 190 80 120 100 170
Compute a 98% confidence interval for
the average calorie content per serving
of vanilla yogurt.
We are 98% confident that the true mean
calorie content per serving of vanilla
yogurt is between 126.16 calories & 189.56
calories.
Ex 3 continued) A diet guide claims that
you will get 120 calories from a serving
Note: confidence intervals tell us
of vanilla yogurt. What does this
if something is NOT EQUAL –
evidence
indicate?
never less or greater than!
Since 120 calories is not contained
within the 98% confidence interval, the
evidence suggest that the average
calories per serving does not equal
120 calories.
CI & p-values deal with area in the tails
Robust
– is the area changed greatly when
there
is
skewness
• An inference procedure is ROBUST if
the confidence level or p-value doesn’t
change much if the normality
assumption is violated.
Since there is more area in the tails in tdistributions,can
then,
a distribution
has
• t-procedures
beif used
with some
some skewness,
tail area
not
skewness,
as long the
as there
areisno
greatly affected.
outliers.
• Larger n can have more skewness.
Find a sample size:
• If a certain margin of error is wanted,
then to find the sample size necessary
for that margin of error use:
 
m  z *

 n
Always round up to the nearest person!
Ex 4) The heights of PWSH male
students is normally distributed with
 = 2.5 inches. How large a sample
is necessary to be accurate within +
.75 inches with a 95% confidence
interval?
n = 43
Some Cautions:
• The data MUST be a SRS from the
population (or randomly assigned
treatment)
• The formula is not correct for more
complex sampling designs, i.e.,
stratified, etc.
• No way to correct for bias in data
Cautions continued:
• Outliers can have a large effect on
confidence interval
• Must know  to do a z-interval –
which is unrealistic in practice