Sampling-Fundamentals

Download Report

Transcript Sampling-Fundamentals

Sampling Fundamentals
1
Basic Concepts

Population: the entire group under study (or of interest)
Exercise: Define population for a study seeking to assess
SUU student attitudes towards a) program quality and
delivery, b) program content, and c) social environment.

Sample: subset of the population
Used to represent the population

Sample unit (elements): basic unit investigated (choose
sampling units/elements when sampling)
Individuals, households, etc.

Census: data collected from EVERYONE in population
2
Basic Concepts (continued)

AGAIN: total error = sampling error + nonsampling error
Sampling error: error due to taking a sample (+/-zs)
Nonsampling error: everything else (measurement, data
analysis, etc.)
Sample frame: list from which the sample is selected
 Sample frame error: Pop’n members not in frame, and
members in frame not in pop’n of interest

3
Reasons for Sampling
 Cost
 Too much information to handle
 Sampling can be more accurate
Nonsampling errors can overwhelm reduction in
sampling errors
– Sampling work behaviors example
– Census Bureau
 Time problem
4
Developing a sampling plana
 1. Define the population of interest.
 2. Choose a data-collection method (mail,
telephone, Internet, intercept, etc.).
 3. Identify a sampling frame.
 4. Select sampling method
 5. Determine sample size.
 6. Develop operational procedures for selecting
sampling elements/units.
 7. Execute the operational sampling plan.
5
PROBABILITY SAMPLING METHODS
Each member of population has a ‘known’
probability of being selected
 Simple Random Sampling: Each member
has an equal probability of being selected
Blind Draw Method
Table of Random Numbers
Useful for small samples, when Random Digit
Dialing (or +1) is appropriate, and computerized lists
6
PROBABILITY METHODS (Cont’d)
 Stratified Sampling: Population is
segmented (stratified), and then samples are
chosen from each strata using some other
method
Can be more efficient (smaller sampling error)
– Homogeneous within, heterogeneous without
Useful when interested in different strata (e.g., small
numbers, etc)
Disproportionate versus proportionate
7
PROBABILITY METHODS (Cont’d)
 Cluster Sampling: Population is divided into
groups, or clusters, and then clusters are
randomly chosen.
Homogenous without, heterogeneous within
Every unit in cluster examined, OR
A Random (or systematic) sample is taken from
chosen cluster (2-stage or 2-step approach)
Careful with the probabilities!
8
PROBABILITY METHODS (Cont’d)
 Systematic Sampling: Randomly choosing a
starting point and then choosing every nth member.
Example: Need 52 data points (daily sales) for a
year
– Skip interval = 365/52=7.01
– Randomly choose 1 day out of first 7, then choose every
7th one after that.
Variation: Choose every nth visitor
9
NONPROBABILITY SAMPLING
METHODS
Probability of selection not known, and hence
representativeness cannot be assessed
Technically, confidence intervals, H0 tests, etc. not
appropriate
 Convenience Samples:
Shopping mall intercepts, classes asked to fill out
questionnaires, etc.
 Judgment Samples: Someone puts together what is
believed to be a relatively representative sample
Ex.: Test markets
10
Nonprobability Sampling (Cont’d)
 Referral (or Snowball) Samples
 Quota Samples
EXAMPLE: Choose sampling units so their
representation equals their frequency in the pop’n
(e.g., 52% females, 48% males)
11
Identifying the Target Population
Determining the Sampling Frame
Reconciling the
Population, Sampling
Frame Differences
Selecting a Sampling Frame
Probability
Sampling
Non-Probability
Sampling
Determining the Relevant Sample Size
The Sampling
Process
Execute Sampling
Data Collection From Respondents
Handling the NonResponse Problem
Information for Decision-Making
12
Nonresponse Bias

Reason for nonresponse:
Refusal
Lack of ability to respond
Not at home
Inaccessible

Handling nonresponse
Improve research design
Call-backs
Estimate effects
– Sample nonrespondents; trends
13