Transcript chapter 2


FREQUENCY DISTRIBUTION TABLES

FREQUENCY DISTRIBUTION GRAPHS

Frequency Distribution: Lists each
category (label) of data and the
number of occurrences.

Sum of all = population or sample size

Relative Frequency: The proportion of
occurrences for each category
Frequency
calculated as:
Sum _ of _ all _ frequencies
Sum of all = 1.

Bar Graph: Vertical or Horizontal. X-axis
contains the categories or labels. For
Frequency Distributions the y-axis is the
number of occurrances. For Relative
Frequency Distributions the y-axis is the
proportion (values between 0 and 1).
Bars do not need to be touching.
SPECIES
ZOO A
ZOO B
Elephant
6
3
Giraffe
12
7
Impala
13
24
Zebra
1
2
Ostrich
6
1
Guinea Hens
25
12
NUMBER
ZOO A ANIMAL INVENTORY
30
25
20
15
10
5
0
E
nt
a
ph
e
l
fe
f
ira
G
la
a
p
Im
ra
b
Ze
SPECIES
s
O
h
c
i
tr
ea
n
ui
G
s
n
He
SPECIES
G
ui
n
He
ns
ffe
SPECIES
ea
h
He
ns
O
st
r ic
Ze
br
a
Im
pa
la
G
ira
ep
ha
nt
ea
h
Ze
br
a
NUMBER
NUMBER
30
25
20
15
10
5
0
El
G
ui
n
ffe
Im
pa
la
G
ira
ep
ha
nt
O
st
r ic
El
ZOO A ANIMAL INVENTORY
ZOO B ANIMAL INVENTORY
30
25
20
15
10
5
0
He
ns
ffe
SPECIES
h
Ze
br
a
O
st
r ic
ep
ha
nt
G
ira
Im
pa
la
ea
El
G
ui
n
NUMBER
ZOO A ANIMAL INVENTORY
30
25
20
15
10
5
0

CAN TREAT DISCRETE DATA LIKE
QUALITATIVE (IF ONLY SEVERAL VALUES)
OR AS WE WILL BE TREATING
CONTINUOUS DATA (IF MANY VALUES)

SEPARATE CONTINUOUS DATA INTO
CLASSES (INTERVALS) AND THEN DO
DISTRIBUTION TABLES OR GRAPHS
Frequency Distribution Table: Similar to
that for qualitative data, but each class
is for a value or an interval (range) of
values.
Histograms: Vertical bar graphs, where
the x-axis is the number line and each
bar is for a class. All bars must touch side
to side. Uses Lower Class limit on x-axis.

Cumulative Frequency Distributions: Each
class listed as before (lowest to largest), but
the frequencies are the total for that
frequency and all the lower classes.

Relative Cumulative Frequency Distribution:
Each Cumulative Frequency divided by
total of all frequencies. The last class will
have a cumulative value of 1.0
Use number of siblings
 Do as Frequency Table
 Do as Relative Frequency
 Do as Cumulative Frequency
 Do as Relative Cumulative Frequency


Class: An interval of numbers along the
number line.

Lower Class Limit (LCL): The beginning
number of the class.

Upper Class Limit (UCL): The last number
of the class.

Class Width: the difference between lower
class limits (or upper class limits), found by
taking using data set’s maximum and minimum
Maximum  Minimum
and calculating
#_ of _ Classes
rounding up to a convenient value

Midpoint of Each Class: The point in the middle
of the class, found by averaging the class
lower class limit and the next class lower class
limit.
1. Organize data in ascending order:
1.03
1.72
1.99
3.21
4.24
4.58
1.36
1.75
2.52
3.47
4.27
4.72
1.45
1.85
2.67
3.50
4.43
4.75
1.51
1.92
3.06
3.72
4.54
4.79
1.63
1.95
3.20
3.78
4.57
4.91
2. Determine the number of classes (5 –
20): For this we will use 6.
3. Find the maximum and minimum: For
this max = 4.91 and min = 1.03
4. Calculate the Class Width:
MAX  MIN  4.911.03  0.647
6
#_ of _ CLASSES
Round UP to a convenient value. We will
use 0.70.
5. Determine First Lower Class Limit: For this
we will use 1.00 (something convenient and
lower than the Minimum).
6. Determine the next 5 Lower Class Limits by
adding class width to the first and each
subsequent to get the next:
1.00+.70=1.70; 1.70+.70=2.40 …
3.10, 3.80, 4.50.
7. Determine the first Upper Class Limit by
Subtracting 1 from the last place of the
second Lower Class Limit: 1.70-.01=1.69.
8. Find the other 5 Upper Class Limits by
adding the class width to each previous
Upper Class Limits: 1.69+.7=2.39,
2.39+.7=3.09, …, 3.79, 4.49, 5.19
9. Now construct the Table ……:
CLASS
LOWER CLASS LIMIT
UPPER CLASS LIMIT
FREQUENCY
1.00
1.69
?
1.70
2.39
?
2.40
3.09
?
3.10
3.79
?
3.80
4.49
?
4.50
5.19
?
And count the frequencies in each class …:
1.03
1.72
1.99
3.21
4.24
4.58
1.36
1.75
2.52
3.47
4.27
4.72
1.45
1.85
2.67
3.50
4.43
4.75
1.51
1.92
3.06
3.72
4.54
4.79
1.63
1.95
3.20
3.78
4.57
4.91
And complete the Table:
CLASS
LOWER CLASS LIMIT
UPPER CLASS LIMIT
FREQUENCY
1.00
1.69
5
1.70
2.39
6
2.40
3.09
3
3.10
3.79
6
3.80
4.49
3
4.50
5.19
5
10. Draw the histogram:
Histogram
8
7
Frequency
6
5
4
3
2
1
0
1.00
1.70
2.40
3.10
LCL
3.80
4.50
5.3
5.8
6.4
7.1
5.5
5.9
6.6
7.1
5.6
6.2
6.6
7.3
5.7
6.3
6.7
7.6
5.7
6.3
6.8
7.9

Stem Leaf Plot: Used for recording and
showing dispersion of data. Stem can be
the integer portion of a number and the
leaves the decimal portion. Or the stem
could be the tens digit and the leaves the
ones digit.
5-3,5,6,7,7,8,9
 6-2,3,3,4,6,6,7,8
 7-1,1,3,6,9


Dot Plot: Also used to show dispersion of
data. Draw a number line and label the
horizontal scale with the numbers from
the data from lowest to highest. Then
place a dot above the numbers each
time the number occurs.
*
*
*
* * * * *
* *
|___|___|___|___|___|___|___|___|___|___|
5.3 5.4 5.5 5.6 5.7 5.8 5.9 6.0 6.1 6.2 6.3

Polygon Plot: Line graph using the
midpoints for the x-axis and frequencies
for the y-axis. Both ends of the line must
come back to the 0 on the y-axis.
POLYGON GRAPH
FREQUENCY
10
8
6
4
2
0
0
2
4
6
8
10 12 14 16 18 20 22 24
CLASS MIDPOINT

Given a Polygon Plot, construct a
Frequency Distribution Table.
› 1. Find the Class Width: Difference in Midpoints
› 2. Find first two LCL’s: Midpoint +/- ½*Class
Width
› 3. Find First Upper Class Limit: 2nd LCL – 1
› Find remainder of LCL’s & UCL’s
› Find each class’s frequency

Ogive (pronounced oh jive) Plot: Line
Graph used for displaying Cumulative
Frequency Distributions. The x-axis is the
Upper Class Limit and the y-axis is the
Cumulative Frequency. The first point is a
class width less than the first Upper Class
Limit so that the line starts with a
frequency of 0.

Ogive Plot:
22
20
18
16
14
12
10
8
6
4
2
0
1.99
2.99
3.99
4.99
5.99
6.99
7.99
8.99
9.99
10.99
11.99
12.99
13.99

Time Series Plots: Can be vertical or
horizontal bar graphs, or line graphs. Xaxis is time intervals or ages (years,
months, days) and y-axis is frequency.
NORMAL DISTRIBUTION
UNIFORM DISTRIBUTION
7
6
8
FREQUENCY
FREQUENCY
10
6
4
2
5
4
3
2
1
0
0
A
B
C
GRADE
D
F
A
B
C
GRADE
D
F
SKEWED LEFT DISTRIBUTE
12
12
10
10
FREQUENCY
FREQUENCY
SKEWED RIGHT DISTRIBUTE
8
6
4
8
6
4
2
2
0
0
A
B
C
GRADE
D
F
A
B
C
GRADE
D
F
Vertical Scale Manipulation: Not starting
the y-axis at 0. Also using a break in the
scale. Can make differences look
bigger than they really are.
 Exaggeration of Bars or Symbols: Used in
pictographs.
 Horizontal Scale Manipulation: Not all
classes or time interval are the same
width.


“Get your facts first, then you can distort
then as you please” Mark Twain

“There are lies, damn lies, and
STATISTICS” Mark Twain

“Definition of Statistics: The science of
producing unreliable facts from reliable
figures.” Evan Esar