Class_7_-_Sampling

Download Report

Transcript Class_7_-_Sampling

Sampling
Class 7
Goals of Sampling



Representation of a population
Representation of a specific phenomenon or
behavior that is infrequent in the population
Ensuring sufficient power for statistical
analysis
Types of Samples

Probability Samples





Simple Random Samples
Stratified Random Samples
Cluster Samples
Matched Samples (Case Controls)
Non-Probability Samples




Systematic Samples
Quota Samples
Purposive Samples
Theoretical Samples
Simple Random Samples


We use simple random samples when we don't know
how a phenomenon is distributed in the population, or
when we assume that the probability of an event is
equal for all persons in the population, or when we
assume that the population characteristics that may bear
on the phenomena being studies are evenly distributed
among the population (EPSEM)
Examples
−
−
−
Monitoring the Future – annual survey of high school youths
News Polls
General Social Survey
Stratified Random Samples


We use stratified random samples when we believe that these
population characteristics are not evenly distributed; in that case
a random sample would not ensure representativeness of the
population. Stratification means that we sample first by identify
specific population characteristics or groups, and then sampling
cases within each groups.
Examples





School research
Selection of stratifying variables?
Theoretical concerns
Demographic concerns
We oversample when we need sufficient cases of a population
that has a low base rate in the overall population, and when even
stratification procedures may not yield sufficient cases for
comparison of these groups

Example – Adolescent Health – oversample kids who engage in risky
behavior
Cluster Samples


Cluster samples are used when subjects are widely
dispersed spatially or socially. Thus, we identify the social
or spatial units first, take a sample of these, and then
sample specific subjects within each of the social or
spatial units. This method is called a multi-stage cluster
sampling procedure
Example: Lawyer Satisfaction Study



Stratify by type of practice and area of law, (e.g. oversample
patent lawyers),
BUT let other characteristics (e.g., demographics) vary
naturally
Question – how, in this example, should we deal with years of
practice?
Case Controls

Case Controls


Matched Samples
Matched Cases



Fair Housing Checks
Employment Discrimination Checks
Matching on other sampling units?

Schools, Job Types, Type of dwelling
Non-Probabilistic Samples

Systematic Samples


Convenient but flawed. You sample based on a consistent
parameter but with a sample whose representation to the
population is uncertain. The most well know examples is
election exit polls, or market research at a shopping mall.
Quota Sample

Ensures adequate representation of specific groups, but not
with the goal of constructing a representative population.
Generally useful when phenomena is not randomly
distributed but concentrated, or when practical issues prevent
other probability-based techniques.

Example – survey of second-generation immigrants

Purposive Samples
Useful in generalizing to a specific phenomenon
when the independent variable is not widely distributed.
For example, we may want to look at the effects of
particular occupations on job satisfaction, but these
occupations may be rare (eg., driving instructors,
stenographers). We sample by identifying these
individuals and conducting observations on as many
as are needed to make valid statistical inferences.
 Examples:




People with unusual jobs (e.g., driving instructors, stenographers)
Consumers of unusual products
Persons with rare diseases

Theoretical Samples (a.k.a., snowball samples)
Sampling on the dependent variable when it is not
widely distributed and its population parameters are
unknown (precluding other sampling techniques).
 Examples:



People engaged in rare and hard-to-find behaviors
These raise problems in inference, but there are
considerable strengths in internal validity
Technological Samples (?)



How valid are net-based surveys?
 Sample Frame
 What do such samples represent? Who is missing?
Case-Study – Knowledge Networks (www.knowledgenetworks.com )
 Random-digit dialing telephone methodology to recruit sample
 Uses know probabilities of selection associated with geographical
locations
 Confirmation by Express Mail delivery and instructions for telephone
enrollment
 Panel is about 40,000 members
 Average stay of 3 years on the panel
 Participating households get free hardware, software, Internet service,
email accounts, and technical support
 Member households get about one multimedia survey per month and
three total per month
 Commercial and academic surveys
Advantages?
 Lack of interviewer reduces bias
 Broader range of stimulus materials
Issues in Sample Construction


Sample attrition and mortality
Sample size



Over-samples to compensate for low base rates or specific
theoretical questions
Practical limitations in sampling
Sample error

The degree of error for a particular sampling design
s
PxQ
n
Where P and Q are parameters, n=sample size, and
s = standard error


http://www.dssresearch.com/toolkit/secalc/error.asp
Sample weighting
Power Considerations




Power is the ability of a test to detect relationships that exist in the
population
Statistical Power is is generally defined as the ability of a design to reject the
null hypothesis when it is false. In other words, power gives you the
probability of not making a Type II Error.
Therefore, Power = 1- β
When a study has low power, effect size estimates will be less precise (have
wider confidence intervals) and we may incorrectly conclude that the cause
and effect do not covary.
−
−
−



Type I Error – False Positive (α)
Type II Error – False Negative (β)
Power = 1- β
It’s easy to get statistical significance with a large sample, but it’s not terribly
important (theoretically) if the effect size is quite small
The (hypothetical) effect size is determined from what is practically or
theoretically important and significant. So, you want to specify a difference
between groups that is meaningful, that is worth detecting.
Most statisticians agree that a power of less than 0.80 suggests a weakness in
the sampling design of a study.