Measurement of Physical Function

Download Report

Transcript Measurement of Physical Function

Group Comparisons
Part 2
Robert Boudreau, PhD
Co-Director of Methodology Core
PITT-Multidisciplinary Clinical Research Center
for Rheumatic and Musculoskeletal Diseases
Core Director for Biostatistics
Center for Aging and Population Health
Dept. of Epidemiology, GSPH
Flow chart for group
comparisons
Measurements to be compared
continuous
discrete
( binary, nominal, ordinal with few values)
Distribution approx normal
or N ≥ 20?
No
Yes
Non-parametrics
T-tests
Chi-square
Fisher’s Exact
Covered in Part 1
Continuous Distributions
 Normal distribution
Mean
 Standard deviation
(computation, interpretation )



Empirical Rule: 68%, 95%, 99%
Confidence Intervals, t-distribution
(computation, interpretation )
Empirical rule for interpreting SD
in normal distributions
2 SD
= 95.4%
1.96 SD
= 95%
(exact)
Confidence Interval
for a Population Mean
Mean:
Standard error of the mean*:
* Standard error is general term for standard deviation of some estimator
Degrees of Freedom (DF) = n-1
Example:
n=19, df=18
t “critical value” is wider than normal z (1.96)
because standard deviation (s) is an estimate (data)
/2= 0.025 for 95%CI
 estimation “noise” is accounted for
1.96  Normal dist (limit)
Aflatoxin levels of raw peanut kernels
n= 15
df=14 (=n-1)
16, 22, 23 26, 26, 27, 28, 30, 31, 35, 36, 37, 48, 50, 52
t 0.025,14= 2.145
95% C.I: 32.47 ± 2.145*(10.63/√15)
= 32.47 ± 2.145*2.744
= 32.47 ± 5.89
= 10.63/√15 = 2.744
95% C.I: (26.58, 38.36)  
Outline For Today
Continuous Distributions
 Comparing 2-groups
Two-sample t-test
 pooled and unequal variance versions
 Wilcoxon Rank-Sum (non-parametric)

Next Didactic
 Comparing > 2 groups
ANOVA, Kruskal-Wallis
 Adjusting for Multiple Comparisons


Categorical data

Binomial distr, Chi-squares, Fisher’s Exact
2-sample independent t-test
for comparing means of two groups
General Formula: stdev = √ variance
If comparing two estimates from independent samples
 Variance(of difference) = sum of variances

e.g. compare mean TNFα cytokine levels at first visit
in RA (n=20) vs JRA (n=17) patients
Note also: Variance(of sum) = sum of variances (if independent)
2-sample t-test
to compare two groups
Case 1: Equal variances
(i.e. SDs similar, not too
different between groups)
“pooled” variance estimate
df = n1 + n2 – 2
Note: df= (n1-1) + (n2-1)
2-sample t-test
to compare two groups
Case 2: Unequal variances (i.e. SDs very different
between the groups)
denom = Stderr of numerator
D.F = Welch-Satterthwaite equation (best approx df)
Does Cell Phone Use While Driving
Impair Reaction Times?
Sample of 64 students from Univ of Utah
 Randomly assigned: cell phone group or control
=> 32 in each group
 On machine that simulated driving situations:
=> at irregular periods a target flashed red or green
 Participants instructed to hit “brake button” as soon as
possible when they detected red light
 Control group listened to radio or to books-on-tape
 Cell phone group carried on conversation about a
political issue with someone in another room
Outlier
(Milliseconds)
Does Cell Phone Use While Driving Impair
Reaction Times ? (2-sided t-test)
Cell Phone
Control
-------Difference
N
32
32
Mean
585.2
533.7
51.5
SD
89.6
65.3
}
SDs quite
different
(~ 0.05 sec)
= sqrt(89.62/32+65.32/32)=19.6
= 56.685
t = 51.5/19.6 = 2.63, p=0.011
Does Cell Phone Use While Driving Impair
Reaction Times ? (2-sided t-test)
Cell Phone
Control
-------Difference
N
32
32
Mean
585.2
533.7
51.5
SD
89.6
65.3
}
SDs quite
different
(~ 0.05 sec)
t = 51.5/19.6 = 2.63, p=0.011
area=0.011/2=0.0055
Half of area
on each side
area=0.011/2=0.0055
-2.63
2.63
Half of area
on each side
Removing the one high outlier
from cell phone group
Cell Phone
Control
-------Difference
N
31
32
Mean
573.1
533.7
SD
58.9 (“equal
65.3 variances”)
39.4
= 62.69
(pooled var)
df= n1+n2-2 = 61
t = 39.4/(62.69*√(1/31+1/32)) = 2.52 (p=0.015)
Three-dimensional and thermal surface imaging
produces reliable measures of joint shape and
temperature: a potential tool for quantifying arthritis
Steven J Spalding, C Kent Kwoh, Robert Boudreau,
Joseph Enama, Julie Lunich, Daniel Huber, Louis Denes
and Raphael Hirsch
Arthritis Research & Therapy 2008
3-D (Laser) & Thermal Imaging
JIA patient






9-year-old female with anti-nuclear antigen
(ANA)-negative and rheumatoid factor (RF)negative polyarticular JIA who presented with
left wrist pain, warmth, and swelling.
Proceeded with intra-articular steroid injection
Imaged before procedure, then 5 days after
Reduction in volume of 2 ml (10% decrease)
No significant change in SDI
HDI values changed from 1.9°C prior to the
injection to 1.1°C after
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
T-test (2-sample independent)
or Wilcoxon Rank-Sum (aka Mann-Whitney)
…………...
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
T-test (2-sample independent)
or Wilcoxon Rank-Sum (aka Mann-Whitney)
Control
(n=10)
Arthritis
(n=9)
1.2
1.4
1.1
2.4
1.0
2.3
1.2
2.1
0.6
3.0
0.5
1.1
1.0
1.4
1.0
1.3
1.3
1.1
1.2
Mean
1.01
1.79
SD
0.26
0.70
Median
1.05
1.40
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
T-test (2-sample independent)
“pooled” df = 10+9-2=17
T-Tests
Variable
Method
Variances
HDI
HDI
Pooled
Satterthwaite
Equal
Unequal
DF
t Value
Pr > |t|
17
10.2
3.36
3.23
0.0037
0.0089
Test for Equality of Variances
Variable
Method
HDI
Folded F
Num DF
Den DF
F Value
Pr > F
8
9
6.60
0.0105
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
Wilcoxon Rank-Sum (aka Mann-Whitney)
The idea/motivation:
 Method should work for any distribution
 non-parametric
 Base statistical test on ranks
 rank = order when all data is sorted from
lowest to highest
 each group then gets a “rank sum”
 Won’t be affected by outliers
 Like all statistical tests, p-value is based on
distribution (of difference in rank-sums here)
assuming there is no difference between the groups
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
Wilcoxon Rank-Sum (aka Mann-Whitney)
Base statistical test on ranks
 each group gets a “rank sum”
 Like all statistical tests, p-value is based on distribution
(of difference in rank-sums here)
assuming there is no difference between the groups
 just like shuffling cards
(with only two colors on cards; even if different n’s)
 the critical values are the “extreme” differences in
rank-sums between the two groups
(α = 0.05 => the most extreme 5% of differences )
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
Wilcoxon Rank-Sum (aka Mann-Whitney)
Wilcoxon Scores (Rank Sums) for Variable HDI
Classified by Variable Group
Group
Control
Arthritis
N
10
9
Sum of
Scores
64.50
125.50
Expected
Under H0
100.0
90.0
Std Dev
Under H0
12.172013
12.172013
Average scores were used for ties.
Mean
Score
6.45000
13.94444
HDI (Heat Distribution Index) of MCPs
10 adults controls vs 9 adults with active RA
Wilcoxon Rank-Sum (aka Mann-Whitney)
Wilcoxon Two-Sample Test
Statistic (S)
125.5000
Normal Approximation
Z
One-Sided Pr > Z
Two-Sided Pr > |Z|
2.8754
0.0020
0.0040
t Approximation
One-Sided Pr > Z
Two-Sided Pr > |Z|
0.0050
0.0101
Exact Test
One-Sided Pr >= S
Two-Sided Pr >= |S - Mean|
0.0012
0.0023
Z includes a continuity correction of 0.5.