Transcript The t-test

Inferential Statistics I:
The t-test
Experimental Methods and
Statistics
Department of
Cognitive Science
Michael J. Kalsher
1 of 65
Outline
Definitions
Descriptive vs. Inferential Statistics
The t-test
- One-group t-test
- Dependent-groups t-test
- Independent-groups t-test
2 of 65
The t-test:
•
Basic Concepts
Types of t-tests
- Independent Groups vs. Dependent
Groups
• Rationale for the tests
- Assumptions
•
•
•
•
Interpretation
Reporting results
Calculating an Effect Size
t-tests as GLM
3 of 65
Beer and Statistics:
A Winning Combination!
William Sealy Gosset (1876–1937)
Famous as a statistician, best known by his pen name
Student and for his work on Student's t-distribution.
4 of 65
5 of 65
6 of 65
The One Group t test
The One-group t test is used to compare a sample mean
to a specific value (e.g., a population parameter; a neutral point on a
Likert-type scale).
Examples:
1. A study investigating whether stock brokers differ from the general population on
some rating scale where the mean for the general population is known.
2. An observational study to investigate whether scores differ from some neutral point
on a Likert-type scale.
Calculation of ty :
Note: The symbol ty indicates
this is a t test for a single group
mean.
ty =
Mean Difference
Standard Error
(of the mean difference)
7 of 65
8 of 65
9 of 65
Assumptions
The one-group t test requires the following statistical
assumptions:
1. Random and Independent sampling.
2. Data are from normally distributed populations.
Note: The one-group t test is generally considered robust against violation of this
assumption once N > 30.
10 of 65
Computing the one-group
t test by hand
11 of 65
12 of 65
13 of 65
Critical Values:
One-Group t test
Note: Degrees of Freedom = N - 1
14 of 65
Computing the one-group
t test using SPSS
15 of 65
Move DV to box labeled
“Test variable(s):
Type in “3” as a proxy
for the population mean.
16 of 65
SPSS Output
17 of 65
Reporting the Results:
One Group t test
The results showed that the students’ rated level of
agreement with the statement “I feel good about myself”
(M=3.4) was not significantly different from the scale’s
neutral point (M=3.0), t(4)=.784. However, it is important
to note several important limitations with this result,
including the use of self-report measures and the small
sample size (five participants). Additional research is
needed to confirm, or refute, this initial finding.
18 of 65
19 of 65
20 of 65
21 of 65
22 of 65
23 of 65
24 of 65
25 of 65
26 of 65
27 of 65
28 of 65
29 of 65
30 of 65
31 of 65
32 of 65
33 of 65
34 of 65
35 of 65
36 of 65
11. Select both Time 1 and
Time 2, then move to the box
labeled “Paired Variables.”
12. Next, “click”, “Paste”.
37 of 65
38 of 65
The Independent Groups t test:
Between-subjects designs
Assumption:
Participants contributing to the two means come from
different groups; therefore, each person contributes only
one score to the data.
Calculation of t:
t=
Mean Difference
Standard Error
(of the mean difference)
39 of 65
Standard Error:
How well does my sample
represent the population?
6
When someone takes
a sample from a
population, they are
taking one of many
possible samples-each of which has its
own mean (and s.d.).
Sampling
Distribution
5
Frequency
4
3
We can plot the
sample means as a
frequency distribution
or sampling
distribution.
2
1
0
Sample Mean
10
40 of 65
Standard Error:
How well does my sample
represent the population?
The Standard Error, or Standard Error of the Mean, is an
estimate of the standard deviation of the sampling distribution
of means, based on the data from one or more random
samples.
•
•
•
Large values tell us that sample means can be quite different, and
therefore, a given sample may not be representative of the population.
Small values tell us that the sample is likely to be a reasonably accurate
reflection of the population.
An approximation of the standard error can be calculated by dividing the
sample standard deviation by the square root of the sample size
SE = 
N
41 of 65
Standard Error:
Applied to Differences
We can extend the concept of standard error to situations in
which we’re examining differences between means.
The standard error of the differences estimates the extent to
which we’d expect sample means to differ by chance alone-it is a measure of the unsystematic variance, or variance not
caused by the experiment.
An estimate of the standard error can be calculated by
dividing the sample standard deviation by the square root of
the sample size.
SE = 
N
42 of 65
43 of 65
Computing the independentgroups t test by hand
44 of 65
Sample Problem
Anxiety Scores
Liberal
Arts
Behavioral
Science
45
58
63
59
62
63
51
68
54
74
63
68
52
52
54
66
64
69
49
57
A college administrator reads an article
in USA Today suggesting that liberal
arts professors tend to be more
anxious than faculty members from
other disciplines within the humanities
and social sciences. To test whether
this is true at her university, she carries
out a study to determine whether
professors teaching liberal arts courses
are more anxious than professors
teaching behavioral science courses.
Sample data are gathered on two
variables: type of professor and level
of anxiety.
45 of 65
46 of 65
47 of 65
48 of 65
Critical Values:
Independent Groups t test
Note: Degrees of Freedom = N1 + N2 - 2
49 of 65
50 of 65
Reporting the Results:
Independent Groups t test
On average, the mean level of anxiety among a
sample of liberal arts professors (M = 55.7) was
significantly lower than the mean level of anxiety
among a sample of behavioral science
professors (M = 63.4), t(18) = -2.54, p < .05, r2 =
.26. The effect size estimate indicates that the
difference in anxiety level between the two
groups of professors represents a large effect.
51 of 65
Computing the independentgroups t test using SPSS
52 of 65
Sample Problem
A researcher is interested in
comparing the appetite suppression
effects of two drugs, fenfluramine
and amphetamine, in rat pups. Fiveday-old rat pups are randomly
assigned to be injected with one of
the two drugs. After injection, pups
are allowed to eat for two hours.
Percent weight gain is then
measured.
Percent Weight Gain
Fenfluramine Amphetamine
2
8
3
10
3
4
4
7
4
9
5
3
6
7
Compute the independent groups ttest using the data at right.
6
12
6
6
Is this a true experiment, quasiexperiment, or observational study?
7
8
53 of 65
54 of 65
55 of 65
56 of 65
SPSS Output: Independent-Groups t test
Group Statistics
Weightgain
DrugType
1
2
N
Mean
4.60
7.40
10
10
Std. Deviation
1.647
2.675
Std. Error
Mean
.521
.846
Independent Samples Test
Levene's Tes t for
Equality of Variances
F
Weightgain
Equal variances
ass umed
Equal variances
not as sumed
1.110
Sig.
.306
t-tes t for Equality of Means
t
df
Sig. (2-tailed)
Mean
Difference
Std. Error
Difference
95% Confidence
Interval of the
Difference
Lower
Upper
-2.819
18
.011
-2.800
.993
-4.887
-.713
-2.819
14.964
.013
-2.800
.993
-4.918
-.682
57 of 65
Calculating Effect Size:
Independent Samples t test
r=
t2
(-2.819)2
t2 + df
(-2.819)2 + 18
7.95
r =.5534
7.95 + 18
r2 = .306
Note: Degrees of
freedom calculated by
adding the two sample
sizes and then
subtracting the number
of samples:
df = 10 + 10 – 2 = 18
58 of 65
Reporting the Results:
Independent Groups t test
On average, the percent weight gain of five-dayold rat pups receiving amphetamine (M = 7.4, SE
= .85) was significantly higher than the percent
weight gain of rat pups receiving fenfluramine (M
= 4.6, SE = .52), t(18) = -2.82, p < .05, r2 = .31.
The effect size estimate indicates that the
difference in weight gain caused by the type of
drug given represents a large, and therefore
substantive, effect.
59 of 65
60 of 65
61 of 65
62 of 65
63 of 65
64 of 65
65 of 65