Lecture 4 Measurement
Download
Report
Transcript Lecture 4 Measurement
Measure of Location and
Variability
Histogram
Multimodal
Mean, Median and Mode
Mean: average value
Sample mean: denoted by x-bar (English)
Population mean: Mu (Greek)
Median: value in the middle when
the data are arranged in ascending
order (smallest to largest).
odd number of observations: the middle value
even number of observations: the average of the two
middle values
Mean, Median and Mode II
Mode: the value that occurs with
greatest frequency (may be more
than one mode in a set of data).
Example:
Data: 1,1,2,2,2,2,3,3,4,4,5
Mode=2
Percentile
the pth percentile is the value such that at
least p percent of the observations are
less than or equal to this value and at
least (100-p) percent of the
observations are greater than or
equal to this value.
For example: If you scored in the 90th
percentile on the verbal part of your
SAT’s, this would mean that you scored
above 90% of all verbal scores taken for
the SAT’s at that time.
Percentile II
To calculate the pth percentile:
• arrange the data in ascending order
• compute an index i=(p/100)*n
• if i is not an integer, round up, the next
integer greater than i denotes the
position of the pth percentile
• if i is an integer the pth percentile is the
average of the values in positions i and
i+1.
Quartile
used to divide the data into 4 parts.
Q1: first quartile, 25th percentile
Q2: second quartile, 50th
percentile, median
Q3: third quartile, 75th
percentile
5 Number Summary
1.
2.
3.
4.
5.
Smallest value
first quartile (Q1)
Median (Q2)
third quartile (Q3)
Largest value
Variability
We have talked about measure of location.
• Mean, median, mode
Why do we need to look at variability?
• Data with same mean or median may have different
variability
• Example: two sets of quiz grades
11 12 14 15 17 18 20
mean=15.28571
7 11 12 18 19 19 20
mean=15.14286
The two means are similar but apparently the second
set is more spreadout greater variability
How to quantify variability
Range
Inter-Quantile Range (IQR)
Variance
• Population variance
• Sample variance
Standard Deviation (S.D., std dev)
• Population S.D.
• Sample S.D.
Coefficients of Variation
Range
Range=max — min
In our example:
• Set 1: Range=20-11=9
• Set 2: Range=20-7=13
Inter-Quantile Range
Set 1:
• 11 12 14 15 17 18 20
• Median=15, Q1=12, Q3=18
• IQR = 18-12=6
Set 2:
• 7 11 12 18 19 19 20
• Median: 18, Q1=11, Q2=19
• IQR=19-11=8
Boxplot
a graphical
summary of data
that is based on a
five-number
summary.
Example: