Chapter 11: Bivariate Statistics and Statistical Inference
Download
Report
Transcript Chapter 11: Bivariate Statistics and Statistical Inference
“Figures don’t lie, but liars figure.”
Chapter 11: Bivariate Statistics
and Statistical Inference
Key Concepts: Statistical
Inference
1
Making Inferences
We calculate the odds…
Crossing a busy street and not getting hit?
The movie will be good?
A child will be safe if left with parents in which
there was previous child abuse?
Probability (the odds, chances of)
The chances of an event based on the ratio of
favorable outcomes to total outcomes.
Getting heads on a coin flip 1/2 = .5 (50%)
Getting an Ace from cards: 4/52 = 7.7%
2
What are the odds (con’t.)
Average height: male – 5’9”; female 5’5”
Which groups are all male and all female?
Is the sample representative of the population?
How certain are you of your answer?
Could all 3 groups be the same sex?
Group 1: 5’8, 5’6, 5’4, 5’2, 5’4, 5’5, 5’6, 5’1
Group 2: 5’9, 6’1, 5’8, 5’7, 5’8, 5’9, 6’2, 5’6
Group 3: 5’6, 5’8, 6’0, 5’2, 5’7, 5’5, 5’7, 5’6
3
Statistical Inference
Variation – differences in behavior,
attitudes, values, characteristics, etc.
There is variation in the population
e.g., some people like chocolate, others like vanilla.
A sample is picked from the population.
We study the sample to make inferences about the
population.
To do so, the sample must reflect the population in
the characteristics under study.
If 30% prefer chocolate in the sample, we’d like to
conclude that 30% prefer chocolate in the
population.
4
Statistical Inference (con’t.)
BUT, sometimes the sample does not reflect the
population – the extent to which it doesn’t is
SAMPLE ERROR.
We can calculate the chances that the relationship
between variables in the sample is due to sample
error.
Influences on sample error
Luck – someone wins the lottery even if the odds are
40 million to 1.
Sample size – smaller samples will have more
chances of error.
Homogeneity – less variation in the population yields
smaller sample error.
5
Hypothesis Testing
Testing the relationship between two or more
variables.
Statistical tests are used to find the probability
that the relationship between variables is due
to sampling error or to chance.
Type
Example
Null hypothesis (Ho) – No relationship There is no relationship between
income and mental health.
Two-tailed hypothesis (H1) – There is
a relationship
There is a relationship between income
and mental health.
One-tailed hypothesis (H1) –
Directional relationship
The greater the income the greater the
mental health.
6
Statistical Inference (con.)
p-value
The probability that a relationship between
variables or a mean difference found in a sample
is a result of sample error.
p =.05 means there is a 5% chance that the
relationship found in the sample is a result of
sample error.
p =.05 means there is a 95% that the relationship
is NOT due to sample error, and actually reflects
the differences in the population.
Rejection level: If the p value is <.05, we reject
the null hypothesis and accept the alternative
hypothesis. (Why .05? – Convention).
7