UTOPPS—Fall 2004

Download Report

Transcript UTOPPS—Fall 2004

UTOPPS—Fall 2004
Teaching Statistics in Psychology
AP Psychology “Suggestions”

Descriptive Statistics
Measures of central tendency
 Variability
 Correlation


Experimental Design
Appropriate Sampling
 Control

Measures of Central Tendency

Mean


Median


True average
True middle of sorted data
Mode

Most frequent data value
How these measures relate

In a normal distribution…


Mean = Median = Mode
In a nonnormal distribution (one
that is skewed)…

The mean will be impacted by the
unusual values and will be pulled
toward the skew.
Examples
Which measure do you use?


If the distribution is close to
normal, they are all equal and it
makes no difference
If the distribution is skewed, the
median is the best measure of
center because it is “resistant” to
influence from outliers that
cause the distribution to be
skewed.
Variability

Variation occurs





there may be a center to the data, but each
individual value will vary around that center
Range of the data (max-min)
Quartiles (divide the data into quarters)
Interquartile Range (Q3 – Q1)
Standard Deviation



Average distance that a data point lies away
from the mean
As a rough estimate—figure out the distance
that each data value lies from the mean, and
take the average
Not an exact calculation, but it gives a rough
idea.
Practicing Calculations


1,1,2,3,4,5,6,7,8,9,10,16
Median (easiest)


5.5 (average between 5 and 6)
Mean
Add ‘em all up and divide by 12
 6


Mode

1 (occurs most frequently)
More Practice


1,1,2,3,4,5,6,7,8,9,10,16
Range


Quartiles




16 – 1 = 15
Medians of the lower and upper halves
Q1 = 2.5
Q3 = 8.5
Interquartile range

Q3 – Q1 = 6
Standard Deviation




1,1,2,3,4,5,6,7,8,9,10,16
The formula for standard deviation is
quite complicated, but we will use a
procedure to estimate this value.
The mean value is 6
The differences between each data
point and 6 are all follows:
-5, -5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 10

Square those differences, add them
all up and divide by 12 and then
square root.


(25+25+16+9+4+1+1+4+9+16+100)/12
Square root your answer
When to use Standard
Deviation


Standard deviation (like the
mean) is impacted by unusual
values, so should not be used
unless the distribution is close to
normal.
If the distribution is normal,
standard deviations are an
excellent way to see how
variability within the data.
Empirical Rule
How to interpret and use
standard deviations




More than three standard deviations
away from the mean is unusual, not
impossible
If you see a result like this, we
attribute that to mean that something
unusual is occurring
We sometimes use this as evidence
against the current mean
We sometimes use a p-value to
indicate how unusual this result is

Something is considered unusual if the
p-value is small (p < .05). Look back at
the empirical rule slide…
Z-scores





Measure the number of standard
deviations that a data point lies from
the mean
If z is negative, the data point is
below the mean
If z is positive, the data point is
above the mean
Z = (data point – mean) / st. dev
If z is more than 2.5, then it is
considered unusual in most settings
AP Psych Exam 2003
A. Statistics are often used to describe and
interpret the results of intelligence
testing.
•
•
•
•
Describe the three measures of central
tendency
Describe a skewed distribution
Relate the three measures of central
tendency to a normal distribution
Relate the three measure of central
tendency to positively skewed distribution
• An intelligence test for which the scores
are normally distributed has a mean of 100
and a standard deviation of 15. Use this
info to describe how the scores are
distributed.
• In two normal distribution, the means are
100 for group I and 115 for group II. Can
an individual in group I have a higher
score than the mean score for group II.
Explain.
Correlation between two
quantitative variables



Correlation is not Causation
Correlation shows that some linear
relationship exists.
Correlation Coefficient




-1 < r < 1
-1 shows a very strong negative
relationship
1 shows a very strong positive
relationship
0 shows no correlation between the
variables
Correlation between
categorical variables



Difficult to assess
Comparing percentages can be
misleading
Bar graphs can be misleading