Transcript p value

P Values - part 3
The P value as a
‘statistic’
Robin Beaumont
1/03/2012
With much help from
Professor Geoff Cumming
P values - Putting it all together
Summary
Reviewso far
• A P value is a conditional probability which
considers a range of outcomes – shown as a
‘area’ in a graph.
• The SEM formula allows us to: predict the
accuracy of your estimate ( i.e. the mean
value of our sample) across a infinite
number of samples!
Summary
far
What is a so
statistic?
• A statistic is just a summary measure,
technically we have reduced a set of data to
one or two values:
• Range (smallest – largest)
• Mean, median etc.
• Inter-quartile range, SD Variance
• Z score, T value, chi square value, F value etc
• P value
T value
• T statistic – different types, simplest 1 sample:
Tstatistic  observed
difference in estimated mean and population value
sampling variability in means
Tstatistic  observed
difference in estimated mean and population value
SEM

observed difference in estimated mean and population value
expected variability in means due to random samping
Signal

Noise
So when t = 0 means 0/anything = estimated and
hypothesised population mean are equal
So when t = 1 observed different same as SEM
So when t = 10 observed different much greater than
SEM
T statistic example
Serum amylase values from a random sample of 15 apparently healthy
subjects. The mean = 96 SD= 35 units/100 ml.
How likely would such a ‘unusual’ sample be obtained from a
population of serum amylase determinations with a mean of 120.
(taken from Daniel 1991 p.202 adapted)
The population value = the null
hypothesis
Tstatistic 
96  120
24


35
9.037
15
2.656
This looks like a rare occurrence?
t density:
s x = 9.037 n =15
120
96
Given that the sample was obtained
from a population with a mean of 120
a sample with a T(n=15) statistic of 2.656 or 2.656 or one more extreme
will occur 1.8% of the time = just
under two samples per hundred on
Shaded
area
average.
=0.0188
....
What does the shaded
Given that the sample was obtained
area mean!
from a population with a mean of 120
0
a sample of 15 producing a mean of 96
2.656
0
-2.656
t
(120-x where x=24) or 144 (120+x
Serum amylase values from a random sample of 15 apparently
healthy subjects. mean =96 SD= 35 units/100 ml.
where x=24) or one more extreme will
How likely would such a unusual sample be obtained from a
population of serum amylase determinations with a mean of 120.
occur 1.8% of the time, that is just
(taken from Daniel 1991 p.202 adapted)
under two samples per hundred on
=P value
average.
P value = 2 · P(t < t| H is true) = 2 · [area to the left of t under a t distribution with df = n − 1]
Original units:
(n−1)
o
P value and probability for the one sample t statistic
p value
= 2 x P(t(n-1) values more extreme than obtained t(n-1) | Ho is true)
= 2 X [area to the left of t under a t distribution with n − 1 shape]
Statistic -> sampling distribution -> PDF -> p value
No sampling distribution! Create a virtual one
P Value Variability
Taking another random sample the P value be different
How different? – Does not follow a normal distribution
Depends upon the probability of the null hypothesis
being true! Remember we have assumed so far that the
null hypothesis is true.
Dance of the p values – Geoff Cummings
Simplified dance of the p values when the null hypothesis is true
Example from Geoff
Cummings dance of the p
values
The take home message is
that we can obtain very small
p values even when the null
hypothesis is true.
Why no CI for the P Value if it varies across trials
• P value -> statistic but
• Not all statistics represent values that are reflected in a
population value
• Other ways of getting an idea of variability across trials:
• Reproducibility Probability Value (RP)
Goodman 1992 and also 2001 journal articles
Hung, O’Neill, Bauer & Kohne 1997 Biometrics journal
Shao & Chow 2002 – Statistics in Medicine journal
Boos & Stefanki 2011 – Journal of the American statistical
association
Cummings 2008 + and book
Cumming’s Reproducibility (replication) Probability Value
Given Pobtained = 0.05
What is the interval in which
we are likely to see 80% of
subsequent P values?
Answer:
We have 80% of seeing
subsequent p values fall within
the zero to 0.22 boundary
0, 0.22 [One sided]
This means that we have a
20% of them being
subsequently > 0.22
What about when the null hypothesis is not true?