MATH30-6 Lecture 2

Download Report

Transcript MATH30-6 Lecture 2

Data Presentation
MATH30-6
Probability and Statistics
Objectives
At the end of the lesson, the students are expected to
• Identify and learn various ways of presenting data;
• Describe data through tables, graphs, and charts;
• Describe and interpret data presented in various
charts; and
• Practice different ways or presenting data.
Types of Data Presentation
• Textual Form
- Data presentation using sentences and paragraphs in
describing data
• Tabular Form
- Data presentation that uses tables arranged in rows
and columns for various parameters
• Graphical Form
- Pictorial representation of data
Grouped and Ungrouped Data
• Ungrouped Data
- Data points are treated individually.
• Grouped Data
- Data points are treated and grouped according to
categories.
Data Presentation
Stem-and-Leaf Diagram
Steps to construct a stem-and-leaf diagram:
1. Divide each number xi into two parts: a stem,
consisting of one or more of the leading digits, and a
leaf, consisting of the remaining digit.
2. List the stem values in a vertical column.
3. Record the leaf for each observation beside its stem.
4. Write the units for stems and leaves on the display.
Stem-and-Leaf Diagram
Example:
1. Express the following data as a stem-and-leaf diagram
with the tens digit as the stems and the ones digit as the
leaves.
12, 23, 12, 11, 10, 25, 29, 39, 31, 43, 42, 54,
53, 53, 56, 57, 56, 67, 54, 65, 76, 76, 75, 74
Data Presentation
Frequency Distribution Table
Numerous data can be analyzed by grouping the data
into different classes with equal class intervals and
determining the number of observations that fall within
each class. This procedure is done to lessen work done
in treating each data individually by treating the data by
group.
Frequency Distribution Table
Class limits
- The smallest and the largest values that fall within
the class interval (class)
- Taken with equal number of significant figures as the
given data.
Class boundaries (true class limits)
- More precise expression of the class interval
- It is usually one significant digit more than the class
limit.
- Acquired as the midpoint of the upper limit of the
lower class and the lower limit of the upper class
Frequency Distribution Table
Frequency
- The number of observations falling within a particular
class.
- Counting and tallying
Class width (class size)
- Numerical difference between the upper and lower class
boundaries of a class interval.
Class mark (class midpoint)
- Middle element of the class
- It represents the entire class and it is usually
symbolized by x.
Frequency Distribution Table
Cumulative Frequency Distribution
- can be derived from the frequency distribution and can
be also obtained by simply adding the class frequencies
- Partial sums
Types of Cumulative Frequency Distribution
- Less than cumulative frequency (<cf) refers to the
distribution whose frequencies are less than or below
the upper class boundary they correspond to.
- Greater than cumulative frequency (>cf) refers to the
distribution whose frequencies are greater than or
above the lower class boundary the correspond to.
Frequency Distribution Table
Relative Frequency
- Percentage frequency of the class with respect to the
total population
- For presenting pie charts
Relative Frequency (%rf) Distribution
- The proportion in percent the frequency of each class
to the total frequency
- Obtained by dividing the class frequency by the total
frequency, and multiplying the answer by 100
Frequency Distribution Table
Class Interval
Frequency
x
LCB
UCB
<cf
>cf
%rf
Frequency Distribution Table
Steps in Constructing a Frequency Distribution Table (FDT)
1. Get the lowest and the highest value in the
distribution. We shall mark the highest and lowest
value in the distribution.
2. Get the value of the range. The range denoted by R,
refers to the difference between the highest and the
lowest value in the distribution. Thus,
R = H ─ L.
Frequency Distribution Table
3. Determine the number of classes. In the
determination of the number of classes, it should be
noted that there is no standard method to follow.
Generally, the number of classes must not be less than
5 and should not be more than 15. In some instances,
however, the number of classes can be approximated
by using the relation
𝑘 = 1 + 3.322 log 𝑛 (Sturges’ Formula),
where k = number of classes and n = sample size.
is
the ceiling operator (meaning take the closest integer
above the calculated value).
Square root principle: 𝑘 = 𝑛
Frequency Distribution Table
4. Determine the size of the class interval. The value of C
can be obtained by dividing the range by the desired
number of classes. Hence, 𝐶 = 𝑅 𝑘.
5. Construct the classes. In constructing the classes, we
first determine the lower limit of the distribution. The
value of this lower limit can be chosen arbitrarily as
long as the lowest value shall be on the first interval
and the highest value to the last interval.
Frequency Distribution Table
6. Determine the frequency of each class. The
determination of the number of frequencies is done
by counting the number of items that shall fall in each
interval.
Frequency Distribution Table
2. Using the steps discussed, construct the frequency
distribution of the following results of a test in statistics of
50 students given below.
88
62
63
88
65
85
83
76
72
63
60
46
85
71
67
75
78
87
70
43
63
90
63
60
73
55
62
62
83
79
78
43
51
56
80
90
47
48
54
77
86
55
76
52
76
43
52
72
43
60
Frequency Distribution Table
2. Using the steps discussed, construct the frequency
distribution of the following results of a test in statistics of
50 students given below.
Answer:
88 62 63 88 65
Class Interval
Frequency
85
83
76
72
63
43-49
7
60
46
85
71
67
50-56
7
75
78
87
70
43
57-63
10
63
90
63
60
73
55
62
62
83
79
64-70
3
78
43
51
56
80
71-77
9
90
47
48
54
77
78-84
6
86
55
76
52
76
85-91
8
43
52
72
43
60
Frequency Distribution Table
3. The following are the scores of 40 students in a Math
quiz. Prepare a frequency distribution for these scores
using a class size of 10.
22
31
55
76
48
49
50
85
17
38
92
62
94
88
72
65
63
25
88
88
86
75
37
41
76
64
66
58
66
76
52
40
42
76
29
72
59
42
54
62
Frequency Distribution Table
3. The following are the scores of 40 students in a Math
quiz. Prepare a frequency distribution for these scores
using a class size of 10.
Answer:
22 31 55 76 48 49 50 85 17 38
Class Interval Frequency
17-26
3
27-36
2
37-46
6
47-56
6
57-66
9
67-76
7
77-86
2
87-96
5
92
62
94
88
72
65
63
25
88
88
86
75
37
41
76
64
66
58
66
76
52
40
42
76
29
72
59
42
54
62
Frequency Distribution Table
4. The thickness of a particular metal of an optical
instrument was measured on 121 successive items as they
came off a production line under what was believed to be
normal conditions. The results are shown in Table 4.5.
Frequency Distribution Table
4. Answer
Data Presentation
Graphical Form of Frequency Distribution
Frequency Polygon
- Line graph
- The points are plotted at the midpoint of the classes.
Histogram (Frequency Histogram or Relative Frequency
Histogram)
- Bar graph
- Plotted at the exact lower limits of the classes
Data Presentation
Graphical Form of Frequency Distribution
Ogive
- Line graph
- Graphical representation of the cumulative frequency
distribution
- The < ogive represents the <cf while the > ogive
represents the >cf.
Data Presentation
5. Construct a frequency polygon, histogram, and ogives
of the given distribution.
Class Interval
Frequency
25-29
1
30-34
1
35-39
5
40-44
8
45-49
15
50-54
4
55-59
4
60-64
3
65-69
4
70-74
3
75-79
2
Data Presentation
In the preparation of a polygon, the frequency values are
always plotted on the y-axis (vertical) while the classes are
plotted on the x-axis (horizontal). Here we use the class
midpoints.
Frequency Polygon
16
15
14
13
12
Frequency (f)
11
10
9
8
7
6
5
4
3
2
1
0
17
22
27
32
37
42
47
52
57
Class Midpoint (x)
62
67
72
77
82
87
Data Presentation
The preparation of the histogram is similar to the construction
of the frequency polygon. While the frequency polygon is
plotted using the frequencies against the class midpoints, the
histogram is plotted using the frequencies against the exact limit
Frequency Histogram
of the
classes.
16
15
14
13
12
11
Frequency (f)
10
9
8
7
6
5
4
3
2
1
0
19.5
24.5
29.5
34.5
39.5
44.5
49.5
54.5
Exact Class Limit
59.5
64.5
69.5
74.5
79.5
Data Presentation
Frequency Histogram
Frequency Histogram
16
15
14
13
12
11
Frequency (f)
10
9
8
7
6
5
4
3
2
1
0
22
27
32
37
42
47
52
57
Class Midpoint (x)
62
67
72
77
82
Data Presentation
Ogives
Ogives
55
50
Cumulative Frequency (CF)
45
< ogive
> ogive
40
35
30
25
20
15
10
5
0
19.5
24.5
29.5
34.5
39.5
44.5
49.5
54.5
Class Boundary (CB)
59.5
64.5
69.5
74.5
79.5
84.5
Data Presentation
6. Construct a frequency polygon, histogram, and ogives
of the frequency distribution from problem #2.
Class Interval
Frequency
43-49
7
50-56
7
57-63
10
64-70
3
71-77
9
78-84
6
85-91
8
Summary
• Stem-and-leaf diagram is one way of data presentation
tabular form.
• Frequency distribution can be depicted in two ways:
tabular and graphical (frequency polygon, histogram,
and ogives) forms.
References
• DeCoursey. Statistics and Probability for Engineering
Applications with Microsoft® Excel © 2003
• Montgomery and Runger. Applied Statistics and
Probability for Engineers, 5th Ed. © 2011