Chapters 22, 24, 25

Download Report

Transcript Chapters 22, 24, 25

Chapters 22, 24, 25
Notes Chapter 22
Comparing Two Proportions
• Use when we want to compare the
proportions from two groups of categorical
data:
• Conditions:
*Both are SRS’s from populations of interest
*n1  10% of population 1
n2  10% of population 2
*n1p1, n1(1-p1), n2p2, n2(1-p2) are all  10
For Confidence Intervals:
(we do not assume that p1 = p2 therefore we
do not pool for CI's)
( pˆ1  pˆ 2 )  z *
pˆ1 (1  pˆ1 ) pˆ 2 (1  pˆ 2 )

n1
n2
*choose the non-pooled formula from chart,
calculator will make correct choice automatically
For Confidence Intervals:
• Our confidence interval statements reflect
the true difference in the two proportions
(in context).
For Significance/Hypothesis
Testing
Ho: p1 = p2
Ha: p1  p2 or
p1 < p2
or
p1 > p2
(Since our null states that p1 = p2, we
assume this to be true until proven
otherwise, therefore we automatically pool
here)
z
pˆ1  pˆ 2
1 1
pˆ (1  pˆ )(  )
n1 n2
Where
x1 x2
pˆ 
n1 n2
*choose the pooled formula from the
chart, the calculator will
automatically pool
We rely on the same p-values and alphas to make
our conclusions.
Notes Chapter 24
Comparing Two Means
• When our data is quantitative, then we are
either looking for an interval that contains
their differences or we are comparing
them to one another.
Notes Chapter 24
Comparing Two Means
• Two-Sample t Interval:(Quantitative Data)
s s
x  x   t  
n n
2
1
1
2
2
2
1
2
*choose the non-pooled formula from the formula sheet
*df = there is a nasty formula – ick!
But …calculator gives you this 
Notes Chapter 24
Comparing Two Means
Two-Sample t-test
t
x1  x 2
s12 s 2 2

n1 n 2
*choose the non-pooled formula from the formula
sheet
*df = there is a nasty formula – ick!
But …calculator gives you this 
Conditions:
*The two groups we are comparing must be independent
of each other.
*Both n1 & n2 must be SRS from the populations of interest
*Both n1 & n2 must be < 10% of their respective populations
of interest
*Sample size restriction must be met:
If n1 + n2 < 15, do not use if outliers or severe skewness
are present
If 15 ≤ n1 + n2 < 30, use except in presence of outliers
If n1 + n2  30, sample is large enough to use regardless of
outliers or skewness by CLT
Two-Sample t Procedures:
• We do not pool t-tests.
• Pooling assuming equal σ values. Since
σ1 and σ2 are both unknown, why would
we assume they are equal?
• You must tell the calculator you do not
want to pool. “Just say No”.
• When we interpret confidence intervals we
say “the true difference between two
means” (not the mean difference).
Notes Chapter 25
Paired Samples
• Often we have samples of data that are
drawn from populations that are not
independent. We have to watch carefully
for those!! We can not treat these
samples as two independent samples, we
must consider the fact that they are
related.
• So…what do we do?
Notes Chapter 25
Paired Samples
• If we have two samples of data that are
drawn from populations that are not
independent, we use that data to create a
list of differences. That list of differences
then becomes our data and we will not
use the two individual lists of data
again.
Notes Chapter 25
Paired Samples
• Once we have the list of differences,
everything else is like 1 sample
procedures from the last unit.
• We will treat that list of differences as
our data.
• That list must meet the conditions for a
single sample t-distribution. We will use
the t-interval or the t-test on this list of
differences depending on the question.
• When we interpret our confidence interval
or make our conclusion we will be talking
about the “mean of the differences.”
Reminders for Single Sample t
Conditions:
*Data is from an SRS of size n from the population of
interest
* Sample size < 10% of Population size
*Sample is approx. normal/no outliers or large n
(see guidelines below)
• n < 15: Use t-procedures if the data is close to normal.
If severe skewness or outliers are present, do not use t.
• 15 ≤ n < 30 : The t-procedures can be used except in the
presence of outliers
• large n: The t-procedures can be used even for clearly
skewed distributions when the sample is large (n  30)
by CLT.
The One-Sample t-Procedure:
• A level C confidence interval for  is:
 s 
X t

 n
where t* is the critical value from the t
distribution based on degrees of freedom.
• To test the hypothesis: Ho:  = o and
Ha:  > o
Ha:  < o
Ha:   o
• Calculate the test statistics t and the p
value.
• We make the same conclusions based on
p-values and alpha.
• For a one-sample t statistic:
X 
t
s/ n
has the t distribution with n – 1 degrees of
freedom.
• Just remember that all the variables
represent the “mean difference” for your
populations.