Probability and the Sampling Distribution

Download Report

Transcript Probability and the Sampling Distribution

Probability and the
Sampling Distribution
Quantitative Methods in HPELS
440:210
Agenda
Introduction
 Distribution of Sample Means
 Probability and the Distribution of Sample
Means
 Inferential Statistics

Introduction

Recall:
 Any raw score can be converted to
 Provides location relative to µ and 
a Z-score
 Assuming NORMAL distribution:
 Proportion relative to Z-score can be determined
 Z-score relative to proportion can be determined
 Previous
examples have looked at single data points
 Reality  most research collects SAMPLES of
multiple data points

Next step  convert sample mean into a Zscore
 Why? Answer
probability questions
Introduction

Two potential problems with samples:
1.
Sampling error

2.
Variation between samples


Difference between sample and parameter
Difference between samples from same taken
from same population
How do these two problems relate?
Agenda




Introduction
Distribution of Sample Means
Probability and the Distribution of
Sample Means
Inferential Statistics
Distribution of Sample Means

Distribution of sample means = sampling
distribution is the distribution that would occur
if:



Properties:




Infinite samples were taken from same population
The µ of each sample were plotted on a FDG
Normally distributed
µM = the “mean of the means”
M = the “SD of the means”
Figure 7.1, p 202
Distribution of Sample Means
Sampling error and Variation of Samples
 Assume you took an infinite number of
samples from a population

 What
would you expect to happen?
 Example 7.1, p 203
Assume a population consists of 4 scores (2, 4, 6, 8)
Collect an infinite number of samples (n=2)
Total possible outcomes: 16
p(2) = 1/16 = 6.25%
p(3) = 2/16 = 12.5%
p(4) = 3/16 = 18.75%
p(5) = 4/16 = 25%
p(6) = 3/16 = 18.75%
p(7) = 2/16 = 12.5%
p(8) = 1/16 = 6.25%
Central Limit Theorem
any population with µ and , the
sampling distribution for any sample
size (n) will have a mean of µM and a
standard deviation of M, and will
approach a normal distribution as the
sample size (n) approaches infinity
 If it is NORMAL, it is PREDICTABLE!
 For
Central Limit Theorem

The CLT describes ANY sampling
distribution in regards to:
Shape
2. Central Tendency
3. Variability
1.
Central Limit Theorem: Shape
All sampling distributions tend to be
normal
 Sampling distributions are normal when:

 The
population is normal or,
 Sample size (n) is large (>30)
Central Limit Theorem: Central Tendency
The average value of all possible sample
means is EXACTLY EQUAL to the true
population mean
 µM = µ
 If all possible samples cannot be
collected?
µM approaches µ as the number of
samples approaches infinity

µ = 2+4+6+8 / 4
µ=5
µM = 2+3+3+4+4+4+5+5+5+5+6+6+6+7+7+8 / 16
µM = 80 / 16 = 5
Central Limit Theorem: Variability

The standard deviation of all sample means
is denoted as M
 M
= /√n
 Also known as the STANDARD
ERROR of the MEAN (SEM)
Central Limit Theorem: Variability

SEM
Measures how well statistic estimates
the parameter
The amount of sampling error between
M and µ that is reasonable to expect by
chance
Central Limit Theorem: Variability

SEM decreases when:
 decreases
 Sample size increases
 Population

M = /√n
Other properties:
 When
 As
n=1, M
=  (Table 7.2, p 209)
SEM decreases the sampling distribution
“tightens” (Figure 7.7, p 215)
Agenda
Introduction
 Distribution of Sample Means
 Probability and the Distribution of Sample
Means
 Inferential Statistics

Probability  Sampling Distribution

Recall:
 A sampling
distribution is NORMAL and
represents ALL POSSIBLE sampling
outcomes
 Therefore PROBABILITY QUESTIONS can
be answered about the sample relative to the
population
Probability  Sampling Distribution




Example 7.2, p 209
Assume the following about SAT scores:
 µ = 500
  = 100
 n = 25
 Population  normal
What is the probability that the sample mean
will be greater than 540?
Process:
1.
2.
3.
4.
Draw a sketch
Calculate SEM
Calculate Z-score
Locate probability in normal table
Step 1: Draw a sketch
Step 2: Calculate SEM
Step 3: Calculate Z-score
Step 4: Probability
SEM = M = /√n
Z = 540 – 500 / 20
Column C
SEM = 100/√25
Z = 40 / 20
p(Z = 2.0) = 0.0228
SEM = 20
Z = 2.0
Agenda
Introduction
 Distribution of Sample Means
 Probability and the Distribution of Sample
Means
 Inferential Statistics

Looking Ahead to Inferential Statistics

Review:
 Single

raw score  Z-score  probability
Body or tail
 Sample


mean  Z-score  probability
Body or tail
What’s next?
 Comparison
method
of means  experimental
Textbook Assignment
Problems: 13, 17, 25
 In your words, explain the concept of a
sampling distribution
 In your words, explain the concept of the
Central Limit Theorum
