adaptive bayesian designs for dose

Download Report

Transcript adaptive bayesian designs for dose

BAYESIAN
ADAPTIVE DESIGN
& INTERIM ANALYSIS
Donald A. Berry
[email protected]
Some references
Berry DA (2003). Statistical
Innovations in Cancer Research. In
Cancer Medicine e6. Ch 33. BC
Decker. (Ed: Holland J, Frei T et al.)
 Berry DA (2004). Bayesian statistics
and the efficiency and ethics of
clinical trials. Statistical Science.

2
Benefits
 Adapting;
examples
 Stop
early (or late!)
 Change doses
 Add arms
 Drop arms
 Final
analysis
 Greater
precision (even full follow-up)
 Earlier conclusions
3
Goals
 Learn
faster: More efficient
trials
 More efficient drug/device
development
 Better treatment of patients
in clinical trials
4
OUTLINE: EXAMPLES
 Extraim
analysis
 Modeling
early endpoints
 Seamless
Phase II/III trial
 Adaptive
randomization
 Phase
II trial in AML
 Phase II drug screening process
 Phase III trial
5
EXTRAIM ANALYSES*
 Endpoint:
CR (detect 0.42 vs 0.32)
 80% power: N = 800
 Two extraim analyses, one at 800
 Another after up to 300 added pts
 Maximum n = 1400 (only rarely)
 Accrual: 70/month
 Delay in assessing response
*Modeling due to Scott Berry
<[email protected]>
6
 After
800 pts accrued, have
response info on 450 pts
 Find pred prob of stat sig when
full info on 800 pts available
 Also when full info on 1400
 Continue if . . .
 Stop if . . .
 If continue, n via pred prob
 Repeat at 2nd extraim analysis
7
Tabl e 1: p0=0.42
p1
P(succ)
0.37 0.0001
0.42 0.0243
0.47 0.4467
0.52 0.9389
0.57 0.9989
meanSS
844.6
1011.2
1188.5
1049.9
874.2
sdSS
122.0
247.6
254.5
248.7
149.1
P(800)
0.8707
0.5324
0.2568
0.4435
0.7849
P(1400)
0.0194
0.2360
0.5484
0.2693
0.0268
P(succ1)
0.0001
0.0084
0.1052
0.4217
0.7841
P(succ2)
0.0001
0.0059
0.0914
0.2590
0.1729
Tabl e 2: p0=0.32
p1
P(succ)
0.27 0.0001
0.32 0.0284
0.37 0.4757
0.42 0.9545
0.47 0.9989
meanSS
836.5
1013.1
1186.6
1045.5
922.7
sdSS
111.1
246.3
252.0
245.9
181.0
P(800)
0.8937
0.5238
0.2513
0.4485
0.6632
P(1400)
0.0152
0.2338
0.5339
0.2449
0.0258
P(succ1)
0.0005
0.0094
0.1083
0.4316
0.6632
P(succ2)
0.0000
0.0083
0.1044
0.2505
0.2111
sdSS
95.3
246.6
246.3
234.8
205.4
P(800)
0.9163
0.5242
0.2313
0.3702
0.4121
P(1400)
0.0086
0.2340
0.5392
0.2030
0.0508
P(succ1)
0.0000
0.0090
0.1089
0.3577
0.3977
P(succ2)
0.0000
0.0062
0.1063
0.2065
0.1685
Tabl e 3: p0=0.22
p1
P(succ)
0.17 0.0000
0.22 0.0288
0.27 0.5484
0.32 0.9749
0.37 0.9995
vs 0.80
meanSS
827.7
1013.3
1199.0
1074.4
1024.7
MODELING EARLY ENDPOINTS:
LONGITUDINAL MARKERS
 Example
CA125 in ovarian cancer
 Use available data from trial (&
outside of trial) to model
relationship over time with
survival, depending on Rx
 Predictive distributions
 Use covariates
 Seamless phases II & III
9
CA125 data & predictive
distributions of survival for
two of many patients* ——>
*Modeling due to Scott Berry
<[email protected]>
10
Patient #1
Treatment
Days
Patient #1
Patient #2
Days
Patient #2
Methods
 Analytical
 Multiple
imputation
15
SEAMLESS PHASES II/III*
 Early
endpoint (tumor response,
biomarker) may predict survival?
 May depend on treatment
 Should model the possibilities
 Primary endpoint: survival
 But observe relationships
*Inoue, et al (2002 Biometrics)
16
Conventional drug development
Good
resp
No resp Stop
No survival
advantage
Not
Phase 3
Phase 2
6 mos
Survival
advantage Market
9-12 mos
> 2 yrs
Seamless phase 2/3
< 2 yrs (usually)
17
Seamless phases
 Phase
2: 1 or 2 centers; 10 pts/mo,
randomize E vs C
 If pred probs “look good,” expand to
Phase 3: Many centers; 50 pts/mo
(Initial centers continue accrual)
 Max n = 900
[Single trial: survival data combined
in final analysis]
18
Early stopping
 Use
pred probs of stat sig
 Frequent analyses (total of 18)
using pred probs to:
 Switch
to Phase 3
 Stop accrual for


Futility
Efficacy
 Submit
NDA
19
Comparisons
Conventional Phase 3 designs:
Conv4 & Conv18, max N = 900
(same power as adaptive design)
20
Expected N under H0
1000
855
884
Conv4
Conv18
800
600
431
400
200
0
Bayes
21
Expected N under H1
1000
887
888
Conv4
Conv18
800
649
600
400
200
0
Bayes
22
Benefits
 Duration
of drug development is
greatly shortened under adaptive
design:
 Fewer
patients in trial
 No hiatus for setting up phase 3
 All patients used for


Phase 3 endpoint
Relation between response & survival
23
Possibility of large N
N
seldom near 900
 When
it is, it’s necessary!
 This
possibility gives Bayesian
design its edge
[Other reason for edge is
modeling response/survival]
24
ADAPTIVE RANDOMIZATION
Giles, et al JCO (2003)
 Troxacitabine
(T) in acute myeloid
leukemia (AML) combined with
cytarabine (A) or idarubicin (I)
 Adaptive randomization to:
IA vs TA vs TI
 Max n = 75
 End point: Time to CR (< 50 days)
25
Adaptive Randomization
 Assign
1/3 to IA (standard)
throughout (until only 2 arms)
 Adaptive
to TA and TI based on
current results
 Results

26
Patient
Prob IA
Prob TA
Prob TI
Arm
CR<50
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.34
0.35
0.37
0.38
0.39
0.39
0.44
0.47
0.43
0.50
0.50
0.47
0.57
0.57
0.56
0.56
0.33
0.32
0.32
0.30
0.28
0.28
0.27
0.23
0.20
0.24
0.17
0.17
0.20
0.10
0.10
0.11
0.11
TI
IA
TI
IA
IA
IA
IA
TI
TI
TA
TA
TA
TA
TI
TA
IA
TA
not
CR
not
not
not
CR
not
not
not
CR
not
not
not
not
CR
not
CR
27
Patient
18
19
20
21
22
Drop 23
24
TI 25
26
27
28
29
30
31
32
33
34
Prob IA
Prob TA
Prob TI
Arm
CR<50
0.33
0.33
0.33
0.33
0.33
0.33
0.33
0.87
0.87
0.96
0.96
0.96
0.96
0.96
0.96
0.96
0.96
0.55
0.54
0.53
0.49
0.46
0.58
0.59
0.13
0.13
0.04
0.04
0.04
0.04
0.04
0.04
0.04
0.04
0.11
0.13
0.14
0.18
0.21
0.09
0.07
0
0
0
0
0
0
0
0
0
0
TA
TA
IA
IA
IA
IA
IA
IA
TA
TA
IA
IA
IA
IA
TA
IA
IA
not
not
CR
CR
CR
CR
CR
not
not
not
CR
not
CR
not
not
not
CR
Compare n = 75
28
Summary of results
CR < 50 days:
 IA: 10/18 = 56%
 TA: 3/11 = 27%
 TI:
0/5 = 0%
Criticisms . . .
29
SCREENING PHASE II DRUGS
 Many
drugs
 Tumor response
 Goals:


Treat effectively
Learn quickly
30
Standard designs
 One
drug (or dose) at a time;
no drug/dose comparisons
 Typical comparison by null
hypothesis: RR = 20%
 Progress hopelessly slow!
31
Standard 2-stage design
First stage 20 patients:
 Stop if ≤ 4 or ≥ 9 responses
 Else second set of 20
32
An adaptive allocation
 When
assigning next patient, find
r = P(rate ≥ 20%|data) for each drug
 Assign drugs in proportion to r
 Add drugs as become available
 Drop drugs that have small r
 Drugs with large r  phase 3
33
Suppose 10 drugs, 200 patients
Identify nugget …
With probability: In average n:
110
50
Adaptive
Standard
<70%
Adaptive
>99%
Standard
9 drugs
have mix
of RRs
20% & 40%,
1 has 60%
(“nugget”)
Adaptive also better at finding “40%”, & sooner
34
Suppose 100 drugs, 2000 patients
Identify nugget …
With probability: In average n:
1100
500
Adaptive
Standard
<70%
Adaptive
>99%
Standard
99 drugs
have mix
of RRs
20% & 40%,
1 has 60%
(“nugget”)
Adaptive also better at finding “40%”, & sooner
35
Consequences
 Treat
pts in trial effectively
 Learn quickly
 Attractive to patients, in and
out of the trial
 Better drugs identified
sooner; move through faster
36
PHASE III TRIAL
 Dichotomous
Q
endpoint
= P(pE > pS|data)
 Min
n = 150; Max n = 600
 After
n = 50, assign to arm E
with probability Q
 Except
 (Not
that 0.2 ≤ P(assign E) ≤ 0.8
“optimal,” but …)
37
Recommendation to DSMB to
 Stop
for superiority if Q ≥ 0.99
 Stop
accrual for futility if
P(pE – pS < 0.10|data) > PF
 PF
depends on current n . . .
38
Futility stopping boundary
1.0
0.95
0.8
0.75
0.6
PF
0.4
0.2
0.0
0
100
200
300
400
500
600
n
39
Common prior
density for pE & pS
 Independent
 Reasonably
 Mean
 SD
non-informative
= 0.30
= 0.20
40
Beta(1.275, 2.975)
density
0
.1
.2
.3
.4
.5
p
.6
.7
.8
.9
1
41
Updating
After 20 patients on each arm

8/20 responses on arm 1
 12/20
responses on arm 2
42
Beta(9.275,
14.975)
0
.1
.2
Beta(13.275,
10.975)
.3
.4
.5
p
.6
.7
.8
.9
1
43
Assumptions
 Accrual:
 50-day
10/month
delay to assess response
44
Need to stratify. But how?
Suppose probability assign to
experimental arm is 30%, with
these data . . .
45
Proportions of Patients on
Experimental Arm by Strata
Stratum 1
Stratum 2
Small
Big
Small
6/20 (30%)
10/20 (50%)
Big
6/10 (60%)
2/10 (20%)
Probability of Being Assigned to
Experimental Arm for Above Example
Stratum 1
Stratum 2
Small
Big
Small
37%
24%
Big
19%
44%
46
One simulation; pS = 0.30, pE = 0.45
1.0
0.9
0.8
0.7
0.6
Superiority boundary
Probability
Exp is better
178/243
= 73%
Proportion Exp
0.5
0.4
0.3
0.2
0.1
0.0
0
6
Std
Exp
12
12/38
38/83
18
24 Months
19/60
82/167
Final
20/65
47
87/178
One simulation; pE = pS = 0.30
1.0
Probability futility
0.9
Futility boundary
0.8
0.7
87/155
= 56%
0.6
0.5
0.4
Proportion Exp
Probability
Exp is better
0.3
0.2
0.1
0.0
0
6
Std
Exp
12
9 mos.
8/39
11/42
18
End
15/57
32/81
24 Months
Final
18/68
22/87
48
Operating characteristics
Prob
True ORR selec t
Std
Exp
exp
0.3
0.2 <0.001
0.3
0.3
0.05
0.3
0.4
0.59
0.3
0.45
0.88
0.3
0.5
0.98
0.3
0.6
1.0
Mean # of patients (%)
Std
Exp
Total
51 (34.9 ) 95 (65.1 ) 146
87 (43.1 ) 115 (56.9 ) 202
87 (30.4 ) 199 (69.6 ) 286
79 (30.7 ) 178 (69.3 ) 257
59 (29.5 ) 141 (70.5 ) 200
47 (30.1 ) 109 (69.9 ) 156
Mean
length Prob
(mos) max n
15
<0.001
20
0.003
29
0.05
26
0.02
20
0.003
16
<0.001
49
FDA: Why do this?
What’s the advantage?
 Enthusiasm
of PIs
 Comparison
with
standard design . . .
50
Adaptive vs tailored balanced design
w/same false-positive rate & power
(Mean number patients by arm)
ORR pS = 0.20 pS = 0.30 pS = 0.40
pE = 0.35 pE = 0.45 pE = 0.55
Arm Std Exp Std Exp Std Exp
Adaptive 68 168 79 178 74 180
Balanced 171 171 203 203 216 216
Savings 103 3 124 25 142 36
51
Consequences of Bayesian
Adaptive Approach
 Fundamental
change in way
we do medical research
 More rapid progress
 We’ll get the dose right!
 Better treatment of patients
 . . . at less cost
52
OUTLINE: EXAMPLES
 Extraim
analysis
 Modeling
early endpoints
 Seamless
Phase II/III trial
 Adaptive
randomization
 Phase
II trial in AML
 Phase II drug screening process
 Phase III trial
53