Lect 1 Medical Statistics as a science
Download
Report
Transcript Lect 1 Medical Statistics as a science
Medical
Statistics
as a science
Why Do Statistics?
Extrapolate from data collected to make general
conclusions about larger population from which
data sample was derived
Allows general conclusions to be made from limited
amounts of data
To do this we must assume that all data is randomly
sampled from an infinitely large population, then
analyse this sample and use results to make
inferences about the population
Statistical Analysis
in a Simple Experiment
Define population of interest
Randomly select sample of subjects to study
(clinical trials do not enrol a randomly selected sample of patients
due to inclusion/exclusion criteria but define a precise patient
population)
Half the subjects receive one treatment and the other half another
treatment (usually placebo)
Measure baseline variables in each group
(e.g. age, Apache II to ensure randomisation successful)
Measure trial outcome variables in each group (e.g. mortality)
Use statistical techniques to make inferences about the distribution
of the variables in the general population and about the effect of the
treatment
Data
Categorical data: values belong to categories
Nominal data: there is no natural order to the categories
e.g. blood groups
Ordinal data: there is natural order e.g. Adverse Events
(Mild/Moderate/Severe/Life Threatening)
Binary data: there are only two possible categories
e.g. alive/dead
Numerical data: the value is a number
(either measured or counted)
Continuous data: measurement is on a continuum
e.g. height, age, haemoglobin
Discrete data: a “count” of events e.g. number of pregnancies
Descriptive Statistics:
concerned with summarising or describing a
sample eg. mean, median
Inferential Statistics:
concerned with generalising from a sample,
to make estimates and inferences about a
wider population eg. T-Test, Chi Square test
Statistical Terms
Mean: the average of the data
sensitive to outlying data
Median: the middle of the data
not sensitive to outlying data
Mode: most commonly occurring value
Range: the spread of the data
IQ range: the spread of the data
commonly used for skewed data
Standard deviation: a single number which measures
how much
the observations vary
around the mean
Symmetrical data: data that follows normal distribution
(mean=median=mode)
report mean & standard deviation &
n
Skewed data: not normally distributed
(meanmedian mode)
report median & IQ Range
Standard Normal Distribution
Standard Normal Distribution
Mean +/- 1 SD encompasses 68% of observations
Mean +/- 2 SD encompasses 95% of observations
Mean +/- 3SD encompasses 99.7% of observations
Steps in Statistical Testing
Null hypothesis
Ho: there is no difference between the groups
Alternative hypothesis
H1: there is a difference between the groups
Collect data
Perform test statistic eg T test, Chi square
Interpret P value and confidence intervals
P value 0.05 Reject Ho
P value > 0.05 Accept Ho
Draw conclusions
Meaning of P
P Value: the probability of observing a result
as extreme or more extreme than the one
actually observed from chance alone
Lets us decide whether to reject or accept the
null hypothesis
P > 0.05
P = 0.01 to 0.05
P = 0.001 to 0.01
P < 0.001
Not significant
Significant
Very significant
Extremely significant
T Test
T test checks whether two samples are likely to have come from the same
or different populations
Used on continuous variables
Example: Age of patients in the APC study (APC/placebo)
PLACEBO:
mean age 60.6 years
mean age 60.5 years
SD +/- 17.2
n= 850
95% CI 59.3-61.7
What is the P value?
SD+/- 16.5
n= 840
95% CI 59.5-61.7
APC:
0.01
0.05
0.10
0.90
0.99
P = 0.903 not significant patients from the same population
(groups designed to be matched by randomisation so no surprise!!)
T Test: SAFE “Serum Albumin”
n
mean
SD
95% CI
PLACEBO
ALBUMIN
3500
28
10
27.7-28.3
3500
30
10
29.7-30.3
Q: Are these albumin levels different?
Ho = Levels are the same (any difference is there by chance)
H1 =Levels are too different to have occurred purely by
chance
Statistical test: T test P < 0.0001 (extremely significant)
Reject null hypothesis (Ho) and accept alternate hypothesis
(H1)
ie. 1 in 10 000 chance that these samples are both from the
same overall group therefore we can say they are very likely to
be different
Effect of Sample Size Reduction
PLACEBO
ALBUMIN
n
350
350
mean
28
30
SD
10
10
95% CI
27.0-29.0
29.0-31.0
smaller sample size (one tenth smaller)
causes wider CI (less confident where mean is)
P = 0.008 (i.e. approx 0.01 P is significant but
less so)
This sample size influence on ability to find any
particular difference as statistically significant is a
major consideration in study design
Reducing Sample Size (again)
n
PLACEBO
35
ALBUMIN
35
mean
28
30
SD
10
10
95% CI
24.6-31.4
26.6-33.4
using even smaller sample size (now 1/100)
much wider confidence intervals
p=0.41 (not significant anymore)
SMALLER STUDY has LOWER POWER to find any
particular difference to be statistically significant (mean and
SD unchanged)
POWER: the ability of a study to detect an actual effect or
difference
Chi Square Test
Proportions or frequencies
Binary data e.g. alive/dead
PROWESS Study: Primary endpoint: 28 day all cause
mortality
ALIVE
DEAD
TOTAL
% DEAD
PLACEBO
581 (69.2%)
259 (30.8%)
840 (100%)
30.8
DEAD
640 (75.3%)
210 (24.7%)
850 (100%)
24.7
TOTAL
1221 (72.2%)
469 (27.8%)
1690 (100%)
Reduction in death rate = 30.8%-24.7%= 6.1%
ie 6.1% less likely to die in APC group
Perform Chi Square test P = 0.006 (very significant)
6 in 1000 times this result could happen by chance
994 in 1000 times this difference was not by chance
variation
Reducing Sample Size
Same results but using much smaller sample size (one tenth)
ALIVE
DEAD
TOTAL
% DEAD
PLACEBO
58 (69.2%)
26 (30.8%)
84 (100%)
30.8
DEAD
64 (75.3%)
21 (24.7%)
85 (100%)
24.7
TOTAL
122 (72.2%)
47 (27.8%)
169 (100%)
Reduction in death rate = 6.1% (still the same)
Perform Chi Square test P = 0.39
39 in 100 times this difference in mortality could have
happened by chance therefore results not significant
Again, power of a study to find a difference depends a lot
on sample size for binary data as well as continuous data
Summary
Size matters=BIGGER IS BETTER
Spread matters=SMALLER IS BETTER
Bigger difference=EASIER TO FIND
Smaller difference=MORE DIFFICULT TO
FIND
To find a small difference you need a big study