Educational Research

Download Report

Transcript Educational Research

Educational Research
Chapter 12
Descriptive Statistics
Gay, Mills, and Airasian
10th Edition
Topics Discussed in this Chapter


Preparing data for analysis
Types of descriptive statistics





Central tendency
Variation
Relative position
Relationships
Calculating descriptive statistics
Preparing Data for Analysis

Issues



Scoring procedures
Tabulation and coding
Use of computers
Scoring Procedures

Instructions



Standardized tests detail scoring instructions
Teacher-made tests require the delineation of
scoring criteria and specific procedures
Types of items


Selected response items - easily and objectively
scored
Open-ended items - difficult to score objectively
with a single number as the result
Tabulation and Coding

Tabulation is organizing data




Identifying all information relevant to the analysis
Separating groups and individuals within groups
Listing data in columns
Coding

Assigning names to variables



EX1 for pretest scores
SEX for gender
EX2 for posttest scores
Tabulation and Coding

Reliability


Concerns with scoring by hand and
entering data
Machine scoring

Advantages


Reliable scoring, tabulation, and analysis
Disadvantages

Use of selected response items, answering on
scantrons
Tabulation and Coding

Coding


Assigning identification numbers to
subjects
Assigning codes to the values of nonnumerical or categorical variables



Gender: 1=Female and 2=Male
Subjects: 1=English, 2=Math, 3=Science, etc.
Names: 001=John Adams, 002=Sally Andrews,
003=Susan Bolton, … 256=John Zeringue
Computerized Analysis

Need to learn how to calculate descriptive
statistics by hand



Creates a conceptual base for understanding the
nature of each statistic
Exemplifies the relationships among statistical
elements of various procedures
Use of computerized software


SPSS-Windows
Other software packages
Descriptive Statistics


Purpose – to describe or summarize
data in a manner that is both
understandable and short
Four types




Central tendency
Variability
Relative position
Relationships
Descriptive Statistics
Graphing data – a
frequency polygon


SCORE
5
Vertical axis
represents the
frequency with which
a score occurs
Horizontal axis
represents the
scores themselves
Frequency

4
3
2
1
Std. Dev = 1.63
Mean = 6.0
N = 16.00
0
3.0
SCORE
4.0
5.0
6.0
7.0
8.0
9.0
Quiz 1 Results
Central Tendency


Purpose – to represent the typical score
attained by subjects
Three common measures



Mode
Median
Mean
Central Tendency

Mode




The most frequently occurring score
Appropriate for nominal data
Look for the most frequent number
Median


The score above and below which 50% of all
scores lie (i.e., the mid-point)
Characteristics



Appropriate for ordinal scales
Doesn’t take into account the value of all scores
Look for the middle # (if 2 are in contention, get
the mean of these 2 numbers.
Central Tendency

Mean


The arithmetic average of all scores
Characteristics




Advantageous statistical properties
Affected by outlying scores
Most frequently used measure of central
tendency
Add all of the scores together and divide
by the number of Ss
Calculate for the following
data points:








S1 = 10
S2 = 12
S3 = 14
S4 = 10
S5 = 14
S6 = 12
S7 = 12
S8 = 12
??= ???



Mode
Median
Mean
You know the central score,
do you need anything else?

What is the mean of the following:


What is the mean of the following:


10, 20, 200, 10, 20
51, 52, 53, 52, 52
Is there more we want to know about
the data than just what is the middle
point?
Quiz 1: Central Tendency



Count: 25
Average/ Mean: 79.7
Median:
83.5
Variability


Purpose – to measure the extent to
which scores are spread apart
Four measures




Range
Variance
Standard deviation
(there are others, but these are the only
ones we are going to talk about)
Variability

Range


The difference between the highest and lowest
score in a data set
Characteristics



Unstable measure of variability
Rough, quick estimate
Calculate


What is the range of the following:
 10, 20, 200, 10, 20
What is the range of the following:
 51, 52, 53, 52, 52
Quiz 1





Count: 25
Average:
Median:
Maximum:
Minimum:
79.7
83.5
93.4
0.0
Variability

Variance


The average squared deviation of all scores
around the mean
Characteristics




Many important statistical properties
Difficult to interpret due to “squared” metric
Used mostly to calculate standard deviation
Formula
Variance





51
52
53
52
52
-
52
52
52
52
52
=
=
=
=
=
-1
0
1
0
0





10 - 52 = -42
20 - 52 = -32
200-52 = 148
10 - 52 = -42
20 - 52 = -32
Variance





51
52
53
52
52
-
52
52
52
52
52
=
=
=
=
=
-12 = 1
02 = 0
12 = 1
02 = 0
02 = 0





10 - 52 = -422 = 1764
20 - 52 = -322 = 1024
200-52 = 1482 =21904
10 - 52 = -422 = 1764
20 - 52 = -322 = 1024
Variance





51
52
53
52
52
-
52
52
52
52
52
=
=
=
=
=
-12 = 1
02 = 0
12 = 1
02 = 0
02 = 0
2
2/5=.4
Variance = .4





10 - 52 = -422 = 1764
20 - 52 = -322 = 1024
200-52 = 1482 =21904
10 - 52 = -422 = 1764
20 - 52 = -322 = 1024
27480
27480/5 = 5496
Variance = 5496
Variability

Standard deviation


The square root of the variance
Characteristics




Many important statistical properties
Relationship to properties of the normal curve
Easily interpreted
Formula
Standard Deviation
-12 = 1

02 = 0

12 = 1

02 = 0

02 = 0
2
2/5=.4; Variance = .4
__
 √.4 = .63 = SD

51
52
53
52
52
-
52
52
52
52
52
=
=
=
=
=
10 - 52 = -422 = 1764
2
 20 - 52 = -32 = 1024
 200-52 = 1482 =21904
 10 - 52 = -422 = 1764
 20 - 52 = -322 = 1024
27480
27480/5= 5496= Variance
____
√5496 = 74.13 = SD

So now you know middle #
and spreadoutedness


How can you use that information to
standardize all of the scores to have the
same meaning.
First set of scores has a mean of 52 and
a SD of .63; second set has a mean of
52 and a SD of 74.13. How do we
compare an individual score on first to
an individual score on second?
Quiz 1: Variance






Count: 25
Average:
79.7
Median:
83.5
Maximum: 93.4
Minimum:
0.0
Standard Deviation:
18.44
The Normal Curve


A bell shaped curve reflecting the
distribution of many variables of
interest to educators
Gives a visual way of identifying where
one person’s scores fit in with the rest
of the people.
Normal Curve
The Normal Curve

Characteristics




Fifty-percent of the scores fall above the mean
and fifty-percent fall below the mean
The mean, median, and mode are the same
values
Most participants score near the mean; the further
a score is from the mean the fewer the number of
participants who attained that score
Specific numbers or percentages of scores fall
between ±1 SD, ±2 SD, etc.
The Normal Curve

Properties

Proportions under the curve



±1 SD = 68%
±1.96 SD = 95%
±2.58 SD = 99%
Skewed Distributions



None - even
Positive –
many low
scores and few
high scores
Negative – few
low scores and
many high
scores
Skewed Distribution

Which direction are the following scores
skewed:


Step 1: Reorder from lowest to highest



12,4,5,13,4,4,1,3,1,3,1,3,1,5
1,1,1,3,3,3,4,4,4,5,5,12,13
Step 2: Graph these numbers
Step 3: Compare the graph to the pictures
we showed above (tail goes toward the
direction… tail to the right, positive; tail to
the left, negative)
Skewed Distribution Example
1
1
1
3
3
3
4
4
4
5
5
12
13
Skewed Distribution Example
3
2.5
2
1.5
1st Qtr
1
0.5
0
1
3
4
5
12
13
Measures of Relative Position


Purpose – indicates where a score is in
relation to all other scores in the
distribution
Characteristics


Clear estimates of relative positions
Possible to compare students’
performances across two or more different
tests provided the scores are based on the
same group
Measures of Relative Position

Types


Percentile ranks – the percentage of scores
that fall at or above a given score
Standard scores – a derived score based
on how far a raw score is from a reference
point in terms of standard deviation units

z score
T score

Stanine

Measures of Relative Position

z score


The deviation of a score from the mean in
standard deviation units
Characteristics




Mean = 0
Standard deviation = 1
Positive if the score is above the mean and negative if it
is below the mean
Relationship with the area under the normal curve
Measures of Relative Position

T score – a transformation of a z score

Characteristics



Mean = 50
Standard deviation = 10
No negative scores
Measures of Relative Position

Stanine – a transformation of a z score

Characteristics

Nine groups with 1 the lowest and 9 the
highest
Measures of Relationship:
Correlations


Purpose – to provide an indication of the relationship
between two variables
Characteristics of correlation coefficients



Strength or magnitude – 0 to 1
Direction – positive (+) or negative (-)
Types of correlation coefficients – dependent on the
scales of measurement of the variables


Spearman rho – ranked data
Pearson r – interval or ratio data
Measures of Relationship


Interpretation – correlation does not
mean causation
Formula see page 316 in your text
book to discuss the formula for the
Pearson r correlation coefficient.
Calculating Descriptive Statistics

Using SPSS Windows

Means, standard deviations, and standard
scores


The DESCRIPTIVE procedures
Correlations

The CORRELATION procedure
Objectives 10.1, 10.2, 10.3, & 10.4