Hypothesis testing in the REAL world

Download Report

Transcript Hypothesis testing in the REAL world

Hypothesis testing in the REAL
world
t tests: single sample
repeated measures
independent samples
Need for the t test
• Hypothesis testing using z requires knowledge of
population parameters (mean, SD)
• Typical psychological experiments
▫ No knowledge of parameters
▫ Often don’t really care
 e.g. compare effects of IV manipulation in
experimental and control group
▫ Underlying population not critical
▫ Interested in change produced by IV
▫ Typically use ESTIMATES
Estimation
• Best predictor of population mean?
▫ Sample mean
• Best predictor of population SD?
▫ Sample SD
 BUT sample SD will systematically
underestimate true population
variability
 biased estimate
 bias to reject null hypothesis
How do we fix it?
• Adjust the formula for SD
▫ Now we’ll call it s
• n-1
• degrees of freedom
▫ Number of scores in a sample
that are free to vary
• Adjust our comparison
distribution
The t distribution
• Shape-shifter
Using the t table
• df
•p
• cutoff (critical) value
• Things to note:
▫ As df increases, critical value decreases
 Related to sample size
▫ With an infinite sized sample, what are the CVs?
▫ n=30
A sample problem: pg 232
• Conceptual and computational formuli
• SPSS
• Steps in hypothesis testing: hopefulness
following a flood
1. Null hypothesis
Alternative
M(flood) = M(neutral)
M(flood) not= M(neutral)
 Neutral = score of 4
2. Comparison distribution
use estimates from sample
estimated mean
sample mean
estimated SD (s)
s of the sampling distribution
3. Cutoff/critical value
use t table with appropriate df
4. Compute t
5. Decide
6. Tell grandma
A saner approach….
• Computational
formula
• SPSS
Types of t tests
• t test for single sample
▫ Compares sample mean to known or predicted
population mean (e.g. “neutral” hopefulness; no
ESP baseline probability)
• t test for dependent means
▫ Compares samples (from the same people)
collected before and after a given manipulation
 aka – within subjects, repeated measures, paired
samples
• t test for independent means
▫ Compares samples from two unique sets of people
with different levels of the IV for each (EXP,
control groups design)
t test for dependent means
• Difference between one test to the next
▫ Role of IV
▫ If null is true, then difference = ?
▫ pg 238 Communication scores before and after
premarital counseling
1.
2.
3.
4.
5.
6.
State null and alternative
Select appropriate t
Determine critical value
Compute
Decide
Tell your Babushka
t test for independent means
• Pooled estimate of the population SD
▫ Weighted average of s for each sample
▫ pg 279 Effects of writing on health
1.
2.
3.
4.
5.
6.
State null and alternative
Select appropriate t
Determine critical value
Compute
Decide
Tell your Nai nai
Let’s give it a whirl….
1. Siegel (1990) found that elderly people who owned dogs
were less likely to pay visits to their doctors after
upsetting events than those who did not own
pets. Similarly, consider the following data: A sample of
elderly dog owners is compared to a similar group of
elderly persons who do not own dogs. The researcher
records the number of visits to the doctor during the past
year for each person. For the following data, is there a
significant difference in the number of doctor visits
between dog owners and control subjects? Use the .05
level of significance.
CONTROL GROUP: 12, 10, 6, 9, 15, 12, 14
DOG OWNERS: 8, 5, 9, 4, 6
WHAT KIND OF TEST IS THIS???
A psychologist tests a new drug for its painkilling effects. Pain threshold is
measured for a sample of subjects by determining the intensity (in
milliamperes) of electric shock that causes discomfort. After the initial
baseline is established, subjects receive the drug and their pain thresholds
are once again measured. The data are listed below. Do these data indicate
that the drug produces a significant increase in pain tolerance? Use a one
tailed test with the .05 level of significance.
PAIN THRESHOLDS (milliamperes)
without drug
with drug
2.1
3.2
2.3
2.9
3.0
4.6
2.7
2.7
1.9
3.1
2.1
2.9
2.9
2.9
2.7
3.4
3.2
5.3
2.5
2.5
3.1
4.9
A group of students recently complained that all of the statistics
classes are offered in the morning. They claim that they "think
better" later in the day and, therefore, would do better if the course
was offered in the afternoon rather than in the morning. To test this
claim, the instructor scheduled the course at the 2:40 time slot one
semester. The afternoon class is given the exact same course as
during prior semesters, including the same final exam. The
instructor knows that, for previous students the final exam scores
had been normally distributed with a mean of 70. The afternoon
class, with 16 students enrolled, got a mean of 76 with a sample
standard deviation of 8. Did the students in the afternoon section
perform significantly better?
a. Test with p=.05 with a one tailed test.
b. Identify the dependent and independent variables