Sampling Distribution
Download
Report
Transcript Sampling Distribution
Counting in probability
Permutations
The number of orderings of different events
Combinations
The number ways that outcomes can be
grouped
1
Permutations
The number of orderings of different
events.
Three cards: AKQ
Four digits: 7412
AKQ, AQK, KQA, KAQ, QAK QKA = 6
24
Five symbols: # $ % & @
120
2
Three different cards: AKQ
AKQ, AQK, KQA, KAQ, QAK QKA = 6
Complete permutations
P N!
N
N
N! ( N )(N 1)(N 2)(2)(1)
P 3* 2 *1 6
3
3
3
Partial Permutations
Four Different Digits (of 10)
In the first spot: 10 different digits possible
In the second spot: 9 possible left
In the third spot: 8 possible left
In the last spot: 7 possible left
10*9*8*7 = 5040
Pr
N
N!
( N r )!
P410
10! 10 * 9 * 8 * 7 * 6 * 5 * 4 * 3 * 2 *1
6!
6 * 5 * 4 * 3 * 2 *1
4
Combinations
The number ways that outcomes can be
grouped
Calculate the number of combinations
of N objects taken r (r <= N) at a time.
Example: 13 cards (AKQ…2) taken 5 at
a time = 1287
N!
C
r!*(N r )!
N
r
13! 13 *12 *11 *10 * 9
5!*8!
5 * 4 * 3 * 2 *1
5
Review the problem
We want to be able to say something
about the population from a single
sample that we have drawn.
This is the problem of statistical
inference
What can we infer from our sample?
6
Our logic for proceeding
If the population has certain
characteristics,
then our sample will probably include
certain outcomes and probably not
include other outcomes
If our sample has outcomes that are
unlikely to come from that population, it
probably did not come from that
population
7
Our logic (continued)
If our sample has outcomes that are
unlikely to come from that population, it
probably did not come from that
population
Our conclusion must be either that the
population is as hypothesized or it is not
as hypothesized.
We reject or fail to reject the
hypothesized population characteristic
8
Once More
We reason from population
characteristics to probability distribution
of all possible samples -- the sampling
distribution.
We calculated this for the population of
7 black and 3 red marbles
We noted that the sampling distribution
began to look like a normal distribution
9
Sampling distributions
There are a few sampling distributions
that occur very frequently
binomial
normal
student’s t statistic
chi-square statistic
F statistic
10
Binomial
This is the distribution we have already
calculated for the red (p=.3) and black
(p=.7) marbles.
It is also the distribution for coin tosses
where p=.5 for heads and p=.5 for tails
It is the distribution for all binary
outcomes.
11
Normal
This is the distribution for many
empirical distribution.
This is the distribution which many other
distributions approach when working
with large numbers: binomial, t, chisquare, etc.
This is the distribution for the sum of a
set of random variables.
12
Normal distribution
characteristics
The formula
1
e
2
1 x x
2
2
13
Normal distribution
characteristics
The shape
.4
68%
norm
.3
.2
.1
0
-5
-4
-3
-2
-1
0
x
1
2
3
4
5
14
Normal distribution
characteristics
The shape
.4
95%
norm
.3
.2
.1
0
-3
-2
-1
0
x
1
2
3
15
Normal distribution
characteristicsMean
The shape
.4
One unit is
one std. dev.
norm
.3
.2
.1
0
-3
-2
-1
0
x
1
2
3
16
Why the normal distribution?
Central Limit Theorem
Sampling distribution of the mean
approaches normal distribution
Mean of sampling distribution
approaches mean of population
sd of sampling distribution approaches
population sd / sqrt(n) as n becomes
large
17
Thus
pop mean = 5, pop sd = 1
If n=4, sampling dist mean = 5, sd = .5
If n=9, sampling dist mean = 5, sd = .333
If n=25, samp dist mean = 5, sd = .20
If n=900, samp dist mean = 5, sd = .033
18
.4
Pop
norm
.3
.2
.1
0
2
3
4
5
x
6
7
8
.4
norm
.3
N=4
.2
.1
0
2
3
4
5
xx
6
7
8
.4
norm
.3
N=9
.2
.1
0
2
3
4
5
xxx
6
7
8
.4
N=900
norm
.3
.2
.1
0
2
3
4
5
xxxx
6
7
8
19