Section 9.1 Second Day
Download
Report
Transcript Section 9.1 Second Day
Heights
Put your height in inches on the front board.
We will randomly choose 5 students at a time
to look at the average of the heights in this
class. Look at averages of 20 samples from
the class. Is this a statistic or parameter?
Create a histogram of the averages you found.
What is the average height of the whole class?
Is this a statistic or parameter? How does it
compare to the center of your histogram?
Section 7.1
Second Day
Sampling Distributions
Recall…
What is the difference between a parameter
and a statistic?
Which symbol do I use for the following?
Population proportion?
Sample mean?
Population mean?
Sample standard deviation?
Sample proportion?
Population standard deviation?
Sampling Distributions
A sampling distribution of a statistic is the
distribution of values taken by the
statistic in all possible samples of the
same size from the population.
Describing Sampling
Distributions
We can use
the tools of
data analysis
to describe any
distribution,
including a
sampling
distribution.
Example
Let the population be all the numbers on
a die. Roll two dice (an SRS of two) and
find the average value for each set of
rolls. Complete this 20 times and make a
histogram. Describe the histogram.
Can we construct the ACTUAL
probability distribution? List all possible
outcomes and their x-bars. If we find the
mean of the x-bars, then this is the actual
mean.
So…
x
Bias
So far we’ve talked about the bias of a
sampling method. However, it is often
useful to talk about the bias of statistic.
When talking about a statistic, bias
concerns the center of the sampling
distribution.
The bias of a statistic
When a sampling distribution centers around
the true value of the parameter, we say it is
unbiased.
In other words, a statistic is unbiased if the
mean of its sampling distribution equals the
true value of the parameter being estimated.
There is no SYSTEMATIC tendency to under- or
overestimate the value of the parameter.
The variability of a
statistic
The variability of a statistic is described by the
spread of its sampling distribution.
LARGER SAMPLES GIVE SMALLER SPREAD.
Notice that this is saying that larger samples are
good. However, it says NOTHING about the size
of the population.
Q: Does the size of the population matter? A: No.
We usually require the population be ten times
larger than the sample. So if n = 20, population
should be at least 200.
A Note about Population
Size
Example: Suppose the Mars Company wants
to check that their M&Ms are coming out
properly (i.e. Not broken, not undersized, etc.).
It doesn’t matter if you select a random scoop
from a truckload or a large bin. (Meaning:
population size doesn’t matter)
As long as the scoop is selecting a random,
well-mixed sample, we’ll get a good picture of
the quality of M&Ms.
Bias vs. Variability
What does
the bull’s-eye
represent?
What do the
darts
represent?