Open slides - CTN Dissemination Library

Download Report

Transcript Open slides - CTN Dissemination Library

CTN Design & Analysis Workshop
Handling Missing Data in the
Analysis of CTN Trials:
Pitfalls and Possible Solutions
Neal Oden, PhD, DSC2-EMMES
Gaurav Sharma, PhD, DSC2-EMMES
Paul Van Veldhuisen, PhD, DSC2-EMMES
Paul Wakim, PhD, CCTN, NIDA
15 March 2011
Today’s Workshop






The problem
Prevention
Types of missing data
Analysis methods
Case study
Open discussion
Missing Data

Information within a trial that is meaningful
for analysis but not collected
 Focus
here mostly on primary outcome data,
but relevant to missing secondary outcomes
and covariates too
Missing Data

Randomization
 Balances
treatment groups for known and
unknown factors
 Lose
benefits if there is drop-out, as groups at
outcome may not have been similar at baseline

Intention-to-treat principle
 Violates
principle if not all participants
contribute to the primary analysis
Missing Data

If missing unrelated to assigned treatment
 Reduces

statistical power
If missing related to assigned treatment or
to outcome
 Biases
the estimate of the treatment effect
Causes of Missing Data



Due to discontinuation of study treatment
Outcomes undefined for some participants
 QOL measures after death
 Quantitative drug use hair analysis in
individuals without hair
 Test fails/specimen lost
Attrition
 Related to health status/drug use
 Unrelated to health status/drug use (e.g.,
moved)
Continuing Data Collection
for “Drop-Outs”

Distinction between
Premature end of treatment
AND
End of study

Does collecting data after premature end
of treatment make sense?
Rationale



Preserves intention-to-treat approach
Many CTN trials are pragmatic trials
 NOT “Does treatment work if perfectly
delivered”?
but RATHER
 “Is this a good treatment strategy or
policy”?
OR
 “What happens once treatment starts or is
recommended?”
Rationale

Delivery of medicine deals with people in the
real world
A
100% efficacious cure for stimulant use is
useless for public health if nobody can stand it.

Strive to collect complete data for primary
outcome on ALL participants, even in those
who do not complete intervention
 Too
much missing data - > no way result will be
believable no matter how sophisticated the
statistical method
Why Do We Like It?
Weight loss diet



People on the effective arm lose weight
and stay in the study
Some on the ineffective arm get
discouraged and quit
If we analyzed only the people who stayed
in the trial, the ineffective arm would look
too good
Approaches to Missing Data

Design and conduct of clinical trial that
minimizes missing data
 May
require trade-offs with
generalizability

Apply analysis methods that use
information in observed data to help
analyze primary outcome data in the
presence of missing data
An ounce of
prevention is
worth a pound
of cure
B. Franklin
Minimize Missing Data in…..
Trial Design

Flexible dose

Target population

Allow rescue therapy for poor responders


Define primary outcomes that are highly
ascertainable
Minimize participant burden/reduce follow-up
 Number
of visits/assessments
Minimize Missing Data in…...
Trial Conduct



Explain importance of trial participation
during consent process
Emphasize to staff importance of maintaining
follow-up even when treatment is refused
Incentives
 For
participants, need to ensure level is not
viewed as coercive
Minimize Missing Data in…...
Trial Conduct

Expression of thanks
 Written/verbal

Assistance with travel

Reminders before visits

Welcoming staff/friendly environment

Keep locator information current

Monitor and report to investigators extent
of missing data
Availability of Primary Outcome:
Percent of Measures with Values
(N=29 trials)
100
90
80
Percent
70
60
50
40
30
20
10
0
CTN Study
What’s the big deal?
We need N = 400 (based on power analysis)
But we expect 20% missing
So we set the initial N = 500
So that the final (analyzed) N = 400
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Technical terms that we can’t escape…
Missing at random (MAR)
Missing completely at random (MCAR)
Missing not at random (MNAR)
Ignorable
Non-ignorable
… but what do they mean?
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Missing Completely at Random (MCAR)
(Non-technical) Definition:
The fact that Y is missing has nothing to do with the
unobserved value of Y, or with other variables
Therefore:
The set of participants with complete data can be
regarded as a simple random (or representative)
sample of all participants
What to do?
Ignore the missing data and analyze the available data
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Missing at Random (MAR)
(Non-technical) Definition:
The fact that Y is missing can be explained by other
observed values of Y, or by other measured variables
Therefore:
The observed data can be used to account for the
missing data
What to do?
Use Maximum Likelihood or Multiple Imputation
approach, and include in the model the other
measured variables that explain missingness
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Missing Not at Random (MNAR)
(Non-technical) Definition:
The fact that Y is missing cannot be explained by
other observed values of Y, or by other measured
variables
Therefore:
The observed data cannot be used to account for
the missing data; and outside information is needed
In simple English:
We have a problem
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
In Summary…
Missingness
(i.e. whether the data are missing or not)
is related to
is not related to
observed or
unobserved data
MCAR
MAR
observed data
MNAR
unobserved data
unobserved data
Based on Graham 2009
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Bottom Line
MCAR: No big deal
MAR:
Use available collected data to
“explain” missing mechanism, and
use existing statistical methods
MNAR: Need outside information to
“explain” missing mechanism
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Ignorable & Non-Ignorable
(roughly speaking)
Ignorable (available data are sufficient):
• Missing Completely At Random (MCAR)
• Missing At Random (MAR)
Non-Ignorable (need outside information):
• Missing Not At Random (MNAR)
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Missing Data Analysis
Methods
Complete Case and Pairwise Deletion
CC
Y1
X
X
X
X
PD
Y2
X
X
X
X
Y3
X
X
-
Y1
X
X
X
X
Y2
X
X
X
X
(Correlation Illustration)
Simple, Default in Statistical Software
 Potential loss of info and precision
 Biased when observation is not MCAR

Y3
X
X
-
Single Imputation
Impute a single value, i.e. mean, BOCF, LOCF,
imputing missing as positive…
Simple, artificially increases sample size
 Underestimate SE and incorrect p-values
 Most SI methods require MCAR assumptions to
hold, while some, such as LOCF, even require very
strong and often unrealistic assumptions

Multiple Imputation (MI)
Observed Data
Imputations
1 2…m
…
…
…
…
A simulation based approach to missing data
The General Idea
IMPUTATION
(1)
Incomplete Data
ANALYSIS
(2)
Imputed Data
Analysis Results
POOLING
(3)
Final Results
(1) IMPUTATION Models

The imputation model should include primary
predictive variables and other variables
associated with missingness

Multiple Imputation method is robust even with
approximate imputation models
(2) ANALYSIS Models

Regression Model

General Linear Model

Generalized Linear Model (Logistic Regression,
Poisson Regression)
(3) Rules for POOLING
Estimate 1
Variance 1
Estimate 2
Variance 2
Estimate …
3
Variance 3
Estimate ‘m’
Variance ‘m’
Mean of Estimate
Within Variance + Between Variance
=
Total Variance

Confidence Interval for Parameter of Interest is given by

Mean of Estimate + tdf √(Total Variance)
Desirable Features



MI gives approximately unbiased estimates of all
parameters
MI provides good estimates of the standard
errors
MI can be used with many kinds of data and
analyses without specialized software
Requires MAR assumption
Maximum likelihood

Basic idea
Given some data,
 Try to guess the parameter(s) of the probability
distribution that generated the data
 MLE of a parameter is the value that maximizes the
probability of the data you already have

Example:


Flip a coin, get 45 heads, 36 tails
We don’t know p, but whatever it is:


Pr(45 H in 81 tosses) = K p45(1-p)36
How to guess p?
Pick the value of p that maximizes the probability of
what already happened
 Pick p to maximize L = p45(1-p)36
 Best guess turns out to be 45/81

Maximum likelihood estimates
have nice properties


Consistent
Asymptotically
Normal
 Unbiased
 minimum variance


etc.
New problem




H = 45
T = 36
? = 19
Now how to guess p?
If we knew how many missing were H and how
many T, we would know what to do.
 But we don’t.
 What to do?

A solution
 If
data are MAR,
 you can get MLE’s by
 maximizing
the (conditional) likelihood
for the nonmissing data
 ignoring the missing data mechanism.
Important Application

Longitudinal analysis
Participant 1, visit 1, 2, 3, …
 Participant 2, visit 1, 2, 3, …



For each visit, y = a + b1 x1 + b2 x2 + …
First approach:
Treat all visits as independent
 Do the regression on all visits together
 Wrong, because visits from a single participant are
related, not independent

Important Application (cont’d)

Second approach





The visits from a single participant have covariance
Use a mixed model
It used to be that you had to have all visits nonmissing
for this analysis
But modern software (SAS MIXED, GLIMMIX)
ignores the missing-data mechanism and gets MLE’s
from only the nonmissing data, even if some visits are
missing.
If data are MAR, this is fine!
Modern longitudinal ML software
uses more data
Visit
Participant
1
1
2
3
4
5
2
3
4
Neither old nor
new method can
use this visit
Older CC analysis
would use only
these cases
Complete visit
Incomplete visit
Another application

Survival analysis
Example: time to relapse
 For some people, you have the time
 For others, you don’t because

Study ended
 People died
 People dropped out
 etc.
 People without relapse times are said to be

CENSORED
Another application (cont’d)






For censored people, you don’t know the relapse time, but you
know it is after the censor time
Survival analysis handles censored data, but
You have to make the assumption that censoring is
noninformative.
If people drop out because they know they are going to relapse
the next day, the censoring is informative.
Informative censoring gives biased survival time estimates
The “noninformative censoring” assumption is basically an MAR
assumption.
What if data are not MAR?


When the missing data are nonignorable (i.e.,
MNAR), standard statistical models can yield
badly biased results
Cannot test MAR versus MNAR
Sensitivity Analysis
The missing data mechanism is not
identifiable from observed data
 We don’t know what we don’t know
 One or more analyses can be performed
using different assumptions

 Example:

Worst Case Analysis
(won’t work with a lot of missing data)
Goals of Sensitivity Analysis


Consider a range of potential associations
between missingness and response
Assess the degree to which conclusion can be
influenced by the missingness mechanism
If the conclusion is largely unchanged the result may
be considered robust
 Otherwise, the conclusion should be interpreted
cautiously and may be misleading

MNAR models



Use of non-ignorable models can be helpful in
conducting a sensitivity analysis
Not necessarily a good idea to rely on a single
MNAR model, because the assumptions about
the missing data are impossible to assess with
the observed data
One should use MNAR models sensibly,
possibly examining several types of such models
for a given dataset
Two general classes of MNAR
models


Selection Models – use model for the full data
response and a selection mechanism
Pattern Mixture Models – use mixture of
missing data pattern information in the model
Case Study:
CTN0010 - BUP for Adolescents
Two groups:
Bup/Nal detoxification over 2 weeks
vs.
Bup/Nal maintenance over 12 weeks
N (analyzed) = 152
at 6 community treatment programs
Main outcome measure: Opioid-positive urine
test result at weeks 4, 8 & 12
Evaluation: weekly for 12 weeks,
comprehensive at 4, 8, 12, 24, 36 & 52 weeks
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Woody, JAMA 2008
Missingness in CTN0010
(from Paul Allison’s analysis)
20 participants had missing outcome for all 12 weeks
(effective sample size = N – 20)
Available Data (after removing the 20 cases)
Week
1
2
3
4
5
6
7
8
9
10 11 12
%
90 74 60 78 48 45 44 69 40 37 37 67
present
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Paul Allison’s Analysis
• Included in the model each of Weeks 1 to 12
• Used Maximum Likelihood Estimation (MLE)
and Multiple Imputation (MI) approaches
(MLE is preferred over MI)
• Used random effects (mixed) logit model
with SAS PROC GLIMMIX
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Take-Home Messages
1) Model all the available outcome data at all time
points, including outcome at baseline (t=0), and then
test the time points (contrasts) of interest
2) There are good data analytic methods for dealing with
missing data in repeated-measures designs (under
MAR assumption): use random effects (mixed) models
estimated by maximum likelihood
3) Allow for a linear and quadratic time trend (saves
degrees of freedom), or spline model (broken line)
4) If no time-related pattern, use time as a class variable,
i.e. each time point is a category (not continuous)
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Take-Home Messages (cont’d)
5) Imputing missing outcomes as positive is a
crude approach – one can often do better
6) Incorporation of covariates and auxiliary
variables
7) Sensitivity analysis is absolutely vital
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
References
Allison, Missing Data, Sage University Papers Series on Quantitative
Applications in the Social Sciences, 07-136, Thousand Oaks, CA: Sage, 2001.
Fitzmaurice, Laird & Ware, Applied Longitudinal Analysis, Wiley, 2004.
Graham, Missing Data Analysis: Making It Work in the Real World, Annual
Review of Psychology, 2009, 60: 549-576.
Liang & Zeger, Longitudinal Data Analysis of Continuous and Discrete
Responses for Pre-Post Designs, Sankhya, 2000, 62(B): 134-148.
Weiss, An Introduction to Modeling Longitudinal Data, presentation at
UCLA CALDAR Summer Institute on Longitudinal Research, August 2010.
Woody et al., Extended vs Short-term Buprenorphine-Naloxone for
Treatment of Opioid-Addicted Youth: A Randomized Trial, JAMA, 2008,
300(17): 2003-2011.
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Contact Information
Neal Oden: [email protected]
Gaurav Sharma: [email protected]
Paul Van Veldhuisen: [email protected]
Paul Wakim: [email protected]
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services
Questions & Comments
National Institute on Drug Abuse ─ National Institutes of Health ─ U.S. Department of Health and Human Services