Transcript t - cyclass

Two-sample Tests of Hypothesis
GOALS
1.
2.
3.
4.
11-2
Conduct a test of a hypothesis about the
difference between two independent
population means.
Conduct a test of a hypothesis about the
difference between two population proportions.
Conduct a test of a hypothesis about the mean
difference between paired or dependent
observations.
Understand the difference between dependent
and independent samples.
Comparing two populations – Some
Examples
1.
2.
3.
11-3
Is there a difference in the mean value of
residential real estate sold by male agents
and female agents in south Florida?
Is there a difference in the mean number of
days absent between young workers (under
21 years of age) and older workers (more than
60 years of age) in the fast-food industry?
Is there an increase in the production rate if
music is piped into the production area?
Z-test in SPSS
11-4

There is no (direct) Z-test in SPSS. Just use
T-test, instead. We don’t need to use Z-test
because the T approaches Z as n
approaches infinity.

(That is, the T=Z for large n). Is your sample
size enough? The rule of thumb for sample
size is 10 to 20 cases per variable.
Comparing Two Population Means of
Independent Samples



The samples are from independent populations.
The standard deviations for both populations are known
The formula for computing the value of z is:
z
X1  X 2
 12
n1

 22
n2
Note: The z-test above may also be used if the sample sizes are
both at least 30 even when the population standard deviations
are not known
11-5
Comparing Two Population Means of
Independent Samples – Example
The U-Scan facility was recently
installed at the Byrne Road Food-Town
location. The store manager would like
to know if the mean checkout time
using the standard checkout method is
longer than using the U-Scan. She
gathered the following sample
information. The time is measured
from when the customer enters the line
until their bags are in the cart. Hence
the time includes both waiting in line
and checking out.
11-6
EXAMPLE
continued
Step 1: State the null and alternate hypotheses.
H0: µS ≤ µU
H1: µS > µU
Step 2: State the level of significance.
Test using .01 significance level.
Step 3: Find the appropriate test statistic.
Because both population standard deviations are given (or both
samples are more than 30), we use z-distribution as the test
statistic.
11-7
Example continued
Step 4: State the decision rule.
Reject H0 if Z > Z
Z > 2.33
11-8
Example
continued
Step 5: Compute the value of z and make a decision
z
Xs  Xu
 s2

ns

 u2
nu
5.5  5.3
2
2
0.40 0.30

50
100
0.2

 3.13
0.064
11-9
The computed value of 3.13 is larger than the
critical value of 2.33. Our decision is to reject the
null hypothesis. The difference of .20 minutes
between the mean checkout time using the
standard method is too large to have occurred by
chance. We conclude the U-Scan method is
faster.
Two-Sample Tests about Proportions
Here are several examples.
 The vice president of human resources wishes to know whether
there is a difference in the proportion of hourly employees who
miss more than 5 days of work per year at the Bangkok and the
Salaya plants.
 A consultant to the airline industry is investigating the fear of
flying among adults. Specifically, the company wishes to know
whether there is a difference in the proportion of men versus
women who are fearful of flying.
11-10
Two Sample Tests of Proportions
11-11

We investigate whether two samples came from
populations with an equal proportion of successes.

The two samples are pooled using the following
formula.
Two Sample Tests of Proportions
continued
The value of the test statistic is computed from
the following formula.
11-12
Two Sample Tests of Proportions Example
Manelli Perfume Company recently developed a new
fragrance that it plans to market under the name Heavenly. A
number of market studies indicate that Heavenly has very
good market potential. The Sales Department at Manelli is
particularly interested in whether there is a difference in the
proportions of younger and older women who would
purchase Heavenly if it were marketed.
Two independent populations, a population consisting of the
younger women and a population consisting of the older
women, were surveyed. Each sampled woman was asked to
smell Heavenly and indicate whether she likes the fragrance
well enough to purchase a bottle. Of 100 young women, 19
liked the Heavenly fragrance well enough to purchase
Similarly, a sample of 200 older women, 62 liked the
fragrance well enough to make a purchase.
11-13
Two Sample Tests of Proportions Example
Step 1: State the null and alternate hypotheses.
H 0: 1 = 2
H 1: 1 ≠ 2
note: keyword “there is a difference in the proportion”
We designate 1 as the proportion of young women who would purchase
Heavenly and 2 as the proportion of older women who would purchase
Step 2: State the level of significance.
The .05 significance level is chosen.
Step 3: Find the appropriate test statistic.
We will use the z-distribution
11-14
Two Sample Tests of Proportions Example
Step 4: State the decision rule.
Reject H0 if Z > Z/2 or Z < - Z/2
Z > 1.96 or Z < -1.96
11-15
Two Sample Tests of Proportions Example
Step 5: Compute the value of z and make a decision
The computed value of -2.21 is in the area of rejection. Therefore, the null hypothesis is
rejected at the .05 significance level. To put it another way, we reject the null hypothesis
that the proportion of young women who would purchase Heavenly is equal to the
proportion of older women who would purchase Heavenly.
11-16
Comparing Population Means with Unknown
Population Standard Deviations (the Pooled t-test)
The t distribution is used as the test statistic if one
or more of the samples have less than 30
observations. The required assumptions are:
1. Both populations must follow the normal
distribution.
2. The populations must have equal standard
deviations.
3. The samples are from independent populations.
11-17
Small sample test of means continued
Finding the value of the test
statistic requires two
steps.
1.
2.
11-18
(n1  1) s12  (n2  1) s22
Pool the sample standard s 
n1  n2  2
deviations.
2
p
Use the pooled standard
deviation in the formula.
t
X1  X 2
2
s p 
1
1 
 
 n1 n2 
Comparing Population Means with Unknown
Population Standard Deviations (the Pooled t-test)
Owens Lawn Care, Inc., manufactures and assembles
lawnmowers that are shipped to dealers throughout the
United States and Canada. Two different procedures have
been proposed for mounting the engine on the frame of the
lawnmower. The question is: Is there a difference in the
mean time to mount the engines on the frames of the
lawnmowers? The first procedure was developed by
longtime Owens employee Herb Welles (designated as
procedure 1), and the other procedure was developed by
Owens Vice President of Engineering William Atkins
(designated as procedure 2). To evaluate the two methods, it
was decided to conduct a time and motion study.
A sample of five employees was timed using the Welles
method and six using the Atkins method. The results, in
minutes, are shown on the right.
Is there a difference in the mean mounting times? Use the
.10 significance level.
11-19
Comparing Population Means with Unknown Population
Standard Deviations (the Pooled t-test) - Example
Step 1: State the null and alternate hypotheses.
H0: µ1 = µ2
H1: µ1 ≠ µ2
Step 2: State the level of significance. The .10 significance level is
stated in the problem.
Step 3: Find the appropriate test statistic.
Because the population standard deviations are not known but are
assumed to be equal, we use the pooled t-test.
11-20
Comparing Population Means with Unknown Population
Standard Deviations (the Pooled t-test) - Example
Step 4: State the decision rule.
Reject H0 if t > t/2,n +n -2 or t < - t/2,n +n -2
1 2
1 2
t > t.05,9 or t < - t.05,9
t > 1.833 or t < - 1.833
11-21
Comparing Population Means with Unknown Population
Standard Deviations (the Pooled t-test) - Example
Step 5: Compute the value of t and make a decision
(a) Calculate the sample standard deviations
11-22
Comparing Population Means with Unknown Population
Standard Deviations (the Pooled t-test) - Example
Step 5: Compute the value of t and make a decision
The decision is not to reject the
null hypothesis, because -0.662
falls in the region between -1.833
and 1.833.
-0.662
11-23
We conclude that there is no
difference in the mean times to
mount the engine on the frame
using the two methods.
Practice

Use employee data.sav
–
–
–
–
11-24
Analyze
Compare means
independent sample t- test
Do male and female respondents have different
average current salaries?
Two-Sample Tests of Hypothesis:
Dependent Samples
Dependent samples are samples that are paired or
related in some fashion.
For example:
– If you wished to buy a car you would look at the
same car at two (or more) different dealerships
and compare the prices.
– If you wished to measure the effectiveness of a
new diet you would weigh the dieters at the start
and at the finish of the program.
11-25
Hypothesis Testing Involving
Paired Observations
Use the following test when the samples are
dependent:
d
t
sd / n
Where
d is the mean of the differences
sd is the standard deviation of the differences
n is the number of pairs (differences)
11-26
Hypothesis Testing Involving
Paired Observations - Example
Nickel Savings and Loan
wishes to compare the two
companies it uses to
appraise the value of
residential homes. Nickel
Savings selected a sample
of 10 residential properties
and scheduled both firms for
an appraisal. The results,
reported in $000, are shown
on the table (right).
At the .05 significance level, can we conclude there is a difference in
the mean appraised values of the homes?
11-27
Hypothesis Testing Involving
Paired Observations - Example
Step 1: State the null and alternate hypotheses.
H0: d = 0
H1: d ≠ 0
Step 2: State the level of significance.
The .05 significance level is stated in the problem.
Step 3: Find the appropriate test statistic.
We will use the t-test
11-28
Hypothesis Testing Involving
Paired Observations - Example
Step 4: State the decision rule.
Reject H0 if
t > t/2, n-1 or t < - t/2,n-1
t > t.025,9 or t < - t.025, 9
t > 2.262 or t < -2.262
11-29
Hypothesis Testing Involving
Paired Observations - Example
Step 5: Compute the value of t and make a decision
The computed value of t
is greater than the
higher critical value, so
our decision is to reject
the null hypothesis. We
conclude that there is a
difference in the mean
appraised values of the
homes.
11-30
Practice

Use employee data.sav
–
–
–
Analyze
Compare means
Paired sample t- test


11-31
Beginning salary
Current salary
Group assignment (Analyze and
interpret)

Use one sample mean
–
From employee data.sav


Do all employees have previous working experience of
the average of 80 months since hired?
From 1991 US general social survey.sav

Independent t-test
–
Do male and female respondents have average age
difference?
– Do male and female respondents have equal number of
Brothers and Sisters on average in their family?
– Do male and female respondents have equal umber of
Children on average in their family?
11-32
The End
11-33