Section 2.1 Second Day Transforming data
Download
Report
Transcript Section 2.1 Second Day Transforming data
+
Chapter 2: Modeling Distributions of Data
Section 2.1
Describing Location in a Distribution
The Practice of Statistics, 4th edition - For AP*
STARNES, YATES, MOORE
In Chapter 1, we developed a kit of graphical and numerical
tools for describing distributions. Now, we’ll add one more step
to the strategy.
Exploring Quantitative Data
1. Always plot your data: make a graph.
2. Look for the overall pattern (shape, center, and spread) and
for striking departures such as outliers.
3. Calculate a numerical summary to briefly describe center
and spread.
4.
Sometimes the overall pattern of a large number of
observations is so regular that we can describe it by a
smooth curve.
+
Curves
Describing Location in a Distribution
Density
Curve
A density curve is a curve that
•is always on or above the horizontal axis, and
•has area exactly 1 underneath it.
A density curve describes the overall pattern of a distribution.
The area under the curve and above any interval of values on
the horizontal axis is the proportion of all observations that fall in
that interval. AREA = PROBABILITY = PROPORTION.
The overall pattern of this histogram of
the scores of all 947 seventh-grade
students in Gary, Indiana, on the
vocabulary part of the Iowa Test of
Basic Skills (ITBS) can be described
by a smooth curve drawn through the
tops of the bars.
Describing Location in a Distribution
Definition:
+
Density
+
Mean and Median of a Density
Curve
Symmetric:
Mean = Median
Skewed
Left: Mean < Median
Skewed
Right: Mean > Median
The
median of a density curve is the equal-areas
point, where ½ of the area is to the left and ½ of the
area is to the right.
The
mean of a density curve is the balance point,
where the curve would balance if it were made of
solid material.
+
Examples
For each of
these density
curves, which
line represents
the mean? The
median?
+
Quartiles
For any density curve…
How much area is to the left of the first quartile?
How much area is to the right of the first quartile?
How much area is between the first and third quartiles?
+
The uniform distribution
The
uniform distribution is a special kind of
density curve. It looks like a rectangle.
It
has a constant height (i.e. horizontal line)
over some interval of values.
So, this density curve describes a variable
whose values are distributed evenly
(UNIFORMLY) over some interval of
values.
+
Example
Accidents on a level, 3-mile bike path occur uniformly along the
length of the path.
A) Show that this density curve satisfies the two requirements
for a density curve.
B) The proportion of accidents that occur in the first mile of the
path is the area under the density curve between 0 miles and 1
mile. What is this area?
C) Sue’s property adjoins the bike path between the 0.8 mile
mark and the 1.1 mile mark. What proportion of accidents
happen in front of Sue’s property?
+
Normal Distributions
Density
curves have an area = 1 and are always
positive.
Normal curves are a special type of density curves.
T/F All density curves are normal curves.
T/F All normal curves are density curves.
Characteristics of Normal Curves
Symmetric
Single-peaked (also called
unimodal)
Bell-shaped
μ
σ
The mean, μ, is located at the center of the
curve.
The standard deviation, σ, is located at the
inflection points of the curve.
Parameters of the Normal Curve
The
same way a line is
defined by its slope
and y-intercept, a
normal curve is
defined by its mean
and standard
deviation.
+
Why Be Normal?
Normal
curves are good descriptions for lots
of real data: SAT test scores, IQ, heights,
length of cockroaches (yum!).
Normal
curves approximate random
experiments, like tossing a coin many times.
Not
all data is normal (or even approximately
normal). Income data is skewed right.
Notation
We abbreviate the Normal distribution with mean µ and
standard deviation σ as N(µ,σ).
IQ scores on the WISC-IV are distributed Normally with a
mean of 100 and a standard deviation of 15.
IQ ~ N(100,15).
Women’s heights ~ N(64.5”, 2.5”)
+
Distributions
Normal Distributions
Normal
+
Remember z-scores?
What’s the formula to standardize a value?
If a person’s IQ has a z-score of 0, what does that mean?
What does a z-score of -1 mean?
On the normal curve, we draw 3 standard deviations on either
side of the mean.