Pictures of data - davis.k12.ut.us

Download Report

Transcript Pictures of data - davis.k12.ut.us

Pictures of data
• Dot plot or line plot
• Histogram
• Box and Whiskers plot
Dot plot for the sugar in cereal
Common Shapes of a distribution
height of students in a class
income distribution
test scores from an easy test
The 5 number summary and the box and
whisker plot.
• A distribution can be broken up
into quarters. To do this you
need a 5 number summary
• Minimum q1 Median, q3,
Maximum
• To find the median simply
determine the number of data
points n, then add 1 and divide
𝑛+1
by 2
2
• Then count up from the
beginning.
• 3,4,6,8,9,11,12,14,
8+1
2
= 4.5
• Halfway between 8 and 9 or 8.5
Analyzing a box plot
Box plot for sugar in breakfast cereals
• Data distribution
• 1,3,4,4,4,5,5,6,7,7,7,7,8,10,10,11,11,12,13,22
Minimum
Q1
Median
Q3
Maximum
Box plot video
• http://www.learner.org/vod/vod
_window.html?pid=3139
Making a histogram
Number of times you visited a
store during Christmas break
0,0,1,1,2,2,2,3,3,4,5,6,8,8,9,10
• Determine intervals
•0≤𝑥<2
• 2≤ 𝑥 <4
•4≤𝑥<6
• 6≤ 𝑥 < 8
• 8 ≤ 𝑥 < 10
Analzying a histogram
• Shape
• Outliers
• Center
• Spread
Analysis
• Shape
• Outliers
• Center
• Spread
Histogram video
• http://www.learner.org/vod/vod
_window.html?pid=3137
Analysis
• Shape
• Outliers
• Center
• Spread
Analysis
• Shape
• Outliers
• Center
• Spread
4 things to look for in a distribution
• Shape symmetric, left skew,
right skew
• Outliers; data that lies outside
the range of most of the data
• Center; median, mean
• Spread; IQR, Standard deviation,
range How much the data is
spread out.
8.2 Measures of center
Mean
• Find the mean for the amount of
sugar in the box of a cereal from
the 8.1 data set given below.
• 1,,3,4,4,4,5,5,6,7,7,7,7,8,10,10,1
1,11,12,13,22
• Compare the mean to the
median.
• Explain the difference between
the mean and the median.
Compare the two measures of center. Page
475
Analysis mean or median
Analysis mean or median
Analysis mean or median
• What will determine if you use
the mean or the median to
represent the center of a data
distribution?
• What kind of shape does a
distribution have if the mean
and the median are the same.
• What kind of shape does a
distribution have if the mean is
bigger than the median?
• What kind of shape does a
distribution have if the mean is
smaller than the median?
Analysis practice
What kind of distribution?
Symmetric, left skewed, right skewed
• Mean
102
• Median 103
What kind of distribution?
Symmetric, left skewed, right skewed
• Mean 77
• Median 52
What kind of distribution?
Symmetric, left skewed, right skewed
• Mean 48
• Median 61
Outliers by Malcome Gladwell
• The main tenet of Outliers is that there is a logic behind why some people become
successful, and it has more to do with legacy and opportunity than high IQ. In his latest
book, New Yorker contributor Gladwell casts his inquisitive eye on those who have risen
meteorically to the top of their fields, analyzing developmental patterns and searching
for a common thread. The author asserts that there is no such thing as a self-made man,
that "the true origins of high achievement" lie instead in the circumstances and
influences of one's upbringing, combined with excellent timing. The Beatles had
Hamburg in 1960-62; Bill Gates had access to an ASR-33 Teletype in 1968. Both put in
thousands of hours-Gladwell posits that 10,000 is the magic number-on their craft at a
young age, resulting in an above-average head start.
Gladwell makes sure to note that to begin with, these individuals possessed once-in-ageneration talent in their fields. He simply makes the point that both encountered the
kind of "right place at the right time" opportunity that allowed them to capitalize on
their talent, a delineation that often separates moderate from extraordinary success.
This is also why Asians excel at mathematics-their culture demands it. If other countries
schooled their children as rigorously, the author argues, scores would even out.
8.3 IQR and outliers
Does the distribution have any outliers
outliers? Explain.
Answer questions 1-5 with the following box
plots.
8.4 Standard deviation
• Joe score’s the following points
in each of 5 basketball games
• 6,8,10,12,14
• Sally score’s the following points
in each of 5 basketball games.
• 2,6,10,14,18
• Determine the mean, median
and range.
• Standard deviation is the
average distance from the mean.
• Since it is measuring the
distance from the mean and the
mean only works for symmetric
distributions, the standard
deviation should only be used
for symmetric distributions.
Average distance
from mean data
value - mean
6-10 = -4
16
8 -10 = -2
4
10-10 = 0
0
12 -10 = 2
4
14-10 = 4
16