Slides - CIMPOD Conference 2016
Download
Report
Transcript Slides - CIMPOD Conference 2016
Threats to validity in observational studies
Jay S. Kaufman, PhD
McGill University, Montreal QC
25 February 2016
11:05 AM – 11:45 AM
National Academy of Sciences
2101 Constitution Ave NW,
Washington, DC 20418 USA
Statistical models are used to estimate relationships
between variables in observational data sets.
Y
Y
β1
β0
X
0
X
1
Three main inferential targets of these models:
1) Real world in the present
surveillance, descriptive study
2) Real world in the future
clinical prediction model
3) Hypothetical world in the future
causal inference, etiologic study
The inferential target determines the adjustment
strategy.
Most people here are interested in 3)
If you are trying to estimate the causal effect of a
treatment, your job is to
what would happen
in the
if you did
compared to what
would happen if you did
.
To do this from observational data, you must often
adjust statistically for factors that are associated with
the treatment and the outcome.
You observe:
Pr(Y|X=x)
You want to know:
Pr(Y|SET[X=x])
This is the intervention you want to know about, but
unfortunately you don’t really get to “SET” anything.
The adjustment tradition in statistics exists
to link these two quantities:
Z
X
Pr(Y|X=x) Pr(Y|SET[X=x])
Y
BUT!
ΣPr(Y|X=x, Z=z)Pr(Z=z) = Pr(Y|SET[X=x])
Read:
as:
Pr(Y|SET[X=x])
Pr(Y|SET[X=x1]) versus Pr(Y|SET[X=x2])
x1 and x2 are the levels at which you intervene to set
the treatment; contrast is usually a difference or ratio.
Causal inference from passively observed data requires
not just structural identification, but also:
positivity (there are sufficient data available on the
treatment and outcome in the range of interest)
consistency (the way that people came to be treated in
the data set is comparable to the way that you plan to
treat them in your intervention)
correct specification of statistical models
Three main structural threats to validity:
Confounding Bias
Z
X
Selection Bias
Information Bias
Y
Z
X
Y
X*
Y*
X
U
Y
If you didn’t get to Z by one pathway, you are more
likely to have gotten there via the other pathway:
Z
A
A
B
A
π
B
π
A
B|Z
Z
B
Hernán MA, Hernández-Díaz S, Robins JM. A structural approach
to selection bias. Epidemiology 2004 Sep;15(5):615-25.
If you didn’t get to Z by one pathway, you are more
likely to have gotten there via the other pathway:
smoking
Clinical Diagnosis
genetic mutation
smoking
Clinical Diagnosis
genetic mutation
Cole SR, et al. Illustrating bias due to conditioning on a collider.
Int J Epidemiol. 2010 Apr;39(2):417-20.
Banack HR, Kaufman JS. The "obesity paradox" explained. Epidemiology 2013; 24:
461-2.
Banack HR, Kaufman JS. The obesity paradox: understanding the effect of
obesity on mortality among individuals with CVD. Prev Med 2014; 62: 96-102.
Banack HR, Kaufman JS. Does selection bias explain the obesity paradox among
individuals with cardiovascular disease? Ann Epidemiol 2015 May;25(5):342-9.
Lajous M, Banack HR, Kaufman JS, Hernán MA. Should patients with chronic
disease be told to gain weight? Am J Med 2015;128(4):334-6.
Canto et al. JAMA 2011;306(19):2120-2127.
5 major CHD
risk factors:
Hypertension
Smoking
Dyslipidemia
Diabetes
Family hx of CHD
25% of
original
cohort
Pre-hospital MI
mortality is a collider,
making risk factors
associated with
every U among those
selected (S).
Effect estimators
will thus be
biased if any U is
not controlled.
U
Risk
Factors
Pre-Hosp
Mortality
MI
Hospitalization
S
Post
Hosp
Mortality
So, in contrast to previous descriptions, bias will exist
even if MI hospitalization is not confounded.
Flanders WD, et al. A Nearly Unavoidable Mechanism for Collider Bias with
Index-Event Studies. Epidemiology 2014 Sep;25(5):762-4.
Therefore, selection bias results in this
example from:
1) Recruiting MI hospitalized patients into the study when
there are common unmeasured causes of MI
hospitalization and mortality
2) Removal of the frailest people via pre-hospital mortality
(maybe around 30%?)
3) Removal of 75% of the hospitalized cohort with prior
CVD diagnosis or transfer within 30 days
This is easily enough to produce a paradoxical reverse
association in which the risk factors erroneously appear
protective, even if there is no individual whose risk is
lowered by the presence of one of these factors.
Some additional selection bias structures:
Treatment
Censoring
Death
Symptoms
U
Unmeasured variable U represents underlying disease
severity, and those with more severe disease have a
greater risk of death. Patients with more severe disease
are more likely to be censored because they are unwell.
Patients receiving treatment are at a greater risk of
experiencing side effects, which also lead to drop-out.
Some additional selection bias structures:
Treatment
Symptoms
U
Censoring
Death
In this variation of the previous structure, treatment
and underlying severity both affect symptoms, which
in turn affects drop-out. The censoring as a function
of symptoms, which is affected by both treatment and
U, creates the same conditional dependency.
Other mechanisms of selection bias:
• Differential loss to follow-up, also known as
“informative censoring”
• Missing data bias, nonresponse bias: Censoring can
represent missing data on the outcome for any reason,
not just as a result of loss to follow up.
• Healthy worker bias: Effect of an occupational chemical in
a factory. Unmeasured illness is predictive of death and of
missing work, but only subjects at work are recruited.
• Self-selection bias, volunteer bias
• Selection affected by treatment received before
study entry (left-truncation)
Survival Produces an Unavoidable Selection Bias:
Start out with a randomized trial so that all covariates
are balanced at time 0.
Once events occur, if you condition your estimate on
having survived to the next time point, every other
cause of disease must now be correlated with exposure.
Genetic
Variant
Treat
ment
Genetic
Variant
Death
Treat
=0
ment
time 1
Death
=?
time 2
Flanders WD, Klein M. Properties of 2 counterfactual effect definitions
of a point exposure. Epidemiology 2007; 18(4):453-60.
This is exactly why the HAZARD RATIO (the parameter
estimated by a Cox Proportional Hazards Model) should
not be used (unless the outcome is rare):
The hazard of death at time 1 is the probability of dying at
time 1. But the hazard at time 2 is the probability of dying
at time 2 among those who survived past time 1:
Treatment
Y1
Y2
U
Treated survivors of time 1 differ in their distribution of U
compared to untreated survivors of time 1, making this
conditional measure confounded by U in a way that a marginal
measure is not. This concern applies to both observational
studies and randomized experiments.
Why do we continue to base
inference on so many confusing
studies that use highly selected
samples, such as diagnosed
patients?
There is a simple design
concept to avoid this mess…
An important step in eliminating “obesity paradox” and
similar selection biases is just to ensure that the start of
exposure and the start of follow-up coincide.
That is exactly how we analyze randomized clinical trials:
Nobody would ever propose an RCT that would select
individuals free of disease 5 years after randomization and
then compare the disease incidence between arms only from
that point forward.
A simple rule of ensuring that the start of follow-up and
initiation of treatment coincide is natural in RCTs, but often
overlooked when analyzing observational studies.
For example, widespread confusion about the cardiovascular
effects of hormone therapy resulted from observational
analyses that effectively ignored the first few years of
follow-up by comparing prevalent users versus never users.
Hernán MA, Robins JM. Observational Studies Analyzed Like Randomized
Experiments: Best of Both Worlds. Epidemiology 2008;19:789-92.
Admittedly, this rule is hard to apply to exposures like
obesity that lack a clear onset, but should be very clear
for medical and pharmacological interventions.
Then estimate risk of outcome at each follow-up time,
without conditioning on survival up to that point (just
comparing to the baseline denominator)
Causal effect estimate is the difference between
covariate-standardized survival curves at time t
Hernán MA, The hazards of hazard ratios. Epidemiology 2010;21(1):13-5.
To the extent that confounding and selection bias are
due to measured covariates C, these can be handled by
inverse weighting (IPTW, IPCW)
This is especially convenient for
longitudinal data in which the
confounder C may be effected
by previous treatment Xt and
may in turn influence the next
dose of treatment Xt+1.
C
X
Y
Z
C
It is also helpful in the longitudinal
X
Y
setting where the remaining cohort
at each time t becomes increasing selected. Reweighting
the cohort by measured characteristics allows remaining
subjects to proxy for the ones that are missing.
Summary:
Models are used to parameterize associations between
treatment and response variables.
Often, we want to interpret these associations causally (i.e.
predicting the change in Y that would occur under some
specific intervention on X).
The validity of this causal interpretation is threatened
by systematic and random errors.
The systematic errors include confounding bias, which get
a lot of attention in training and practice.
Information bias and selection bias are other important
sources of systematic error, and should be considered more
frequently and thoughtfully in design and analysis.