Getting to the essential

Download Report

Transcript Getting to the essential

Measuring chance
Probabilities
FETP India
Competency to be gained
from this lecture
Apply probabilities to field epidemiology
Key issues
•
•
•
•
Probabilities
Rule of addition
Rule of multiplication
Non-independent events
Question 1
• Suppose:
 The success rate of a programme to stop smoking
is 75% compared to the expected 70%
• Can we really be certain that the programme
is successful?
Probabilities
Question 2
• Suppose:
 The mean height of 200 adults in a suburban
area of a city is 165 cm compared to the city’s
mean height of 170 cm
• Can we really be certain that people in the
suburb have a shorter height?
Probabilities
Question 3
• Suppose:
 In a trial involving 100 patients, treatment A is
better than treatment B
• Can we really be certain that treatment A is
better than treatment B?
Probabilities
Probabilities and statistical inference
• In statistics, we infer from a sample to a
population
• In that process, there is a element of chance
• We study probabilities (science) to measure
this element of chance (intuitive notion)
Probabilities
Tossing a coin
• Two possible events (Outcomes):
 Head or tail
• Probability of getting a head in a toss:
 1/2 = 50%
• Probability of getting a tail in a toss:
 1/2 = 50%
Probabilities
Throwing a dice
• Six possible events (Outcomes):
 1,2,3,4,5 or 6
• Probability of getting a score of 1:
 1/6
• Probability of getting a score of 4:
 1/6
• Probability of getting a score of 6:
 1/6
Probabilities
Drawing a card from a pack
• There are 52 cards in a pack of playing cards
which includes:
 4 aces, 2 red and 2 black
• A card is randomly picked from the pack:
 Probability of getting an ace: 4/52
 Probability of getting a black ace: 2/52
 Probability of getting a red ace: 2/52
Probabilities
Using experience as a relative frequency
• Suppose a coin is tossed 10,000 times and
head (H) has occurred 4,980 times
• The relative frequency of head is:
 H = 4,980  10,000 = 0.498  0.5
Probabilities
Theoretical approach
• Assuming that the coin is fair, both head (H)
and tail (T) have equal chance of occurring
• Probability of the event “head”:
Number of outcomes of interest (say, Head)
Number of possible outcomes
i.e., P(H) = 1/2
and P(T) = 1/2
Probabilities
Generic concept of probabilities
• Numerator
 Event of interest
• Denominator
 All the possible events that may occur
• Probabilities are proportions:
 They range between 0 and 1
 The numerator is part of the denominator
Probabilities
Definition of probabilities
• Probability is defined as a proportionate
frequency
• If a variable can take any of N values and n
of these constitute the event of interest to
us, the probability of the event is given by
n/N
Number of outcomes of interest
Total number of outcomes
Probabilities
Rule of addition
• Mutually exclusive events
 P(A) = Probability of event A occurring
 P(B) = Probability of event B occurring
• The two events A and B are said to be mutually
exclusive if they cannot occur together
• In this case, the probability that one OR the other
occurs is the sum of the two individual probabilities
 P(A or B) = P(A) + P(B)
Rule of addition
“OR”: Additive probabilities
• What is the probability of getting a 3 (1/6) OR a 5
from a dice (1/6)
 Probability of getting a score of 3 = 1/6
 Probability of getting a score of 5 = 1/6
• 3 and 5 cannot occur at the same throw
• The total number of possible events remains 6
• “1 OR 6” constitute 2 of the 6 possible events
 Probability: 2 / 6
• The probability of getting an event or the other is
the sum of the individual probabilities
 1/6 + 1/6 = 2/6
Rule of addition
Additive probabilities
• When the events are mutually exclusive and
collectively exhaustive, the probability of
each event add up to 1
• Probability of getting a head in a coin toss:
 0.5
• Probability of getting a tail in a coin toss:
 0.5
• Probability of getting a head OR a tail
 0.5 + 0.5 = 1
Rule of addition
Rule of multiplication
• P(A) = Probability of an event A occurring
• P(B) = Probability of an event B occurring
• The two events A and B are said to be
independent if the occurrence of one has no
implications on the other
• In this case, the probability of both A and B
occurring at the same time is the product of
the two individual probabilities
 P (AB) = P (A) x P (B)
Rule of multiplication
“AND”: Multiplicative probabilities
• A coin is tossed and a dice is thrown simultaneously
• The outcome of the toss of the coin has no
implication on the result of the throw of the dice
• What is the probability of getting a head from a coin
(1/2) AND a 6 from a dice (1/6)
• The total number of possible events is a
multiplication of the possible events
 2 x 6 = 12
• “Head AND 6” is only one of the 12 possible events
 Probability: 1 / 12
• The probability of getting a combination of events is
a multiplication of the individual probabilities
 1/6 x 1/2 = 1/12
Properties of the events considered
so far
• Mutually exclusive
 If the tossed coin shows head, it does not shows
tail
• Independent
 The outcome of the coin tossing does not
influence the dice throwing
Rule of multiplication
Considering non-independent events
• Village survey
• Event A:
 Being female
 P (A): Probability of being female
• Event B:
 Being under 5
 P (B): Probability of being under 5
• Event A and event B are not independent
Non independent events
Change in the additive rule in the case of
non-mutually exclusive events
• If the events are not mutually exclusive, the total
probabilities exceed one
• Probability of being female, P(A) also includes
female under 5
• Probability of being under 5, P(B) also includes
female under 5
• Female under 5 are counted twice
• Subtract the probability of the combined events
 P(A OR B) = P(A) + P (B) - P (A AND B)
Non independent events
In
Example from a clinical trial
• Proportion of male patients = 0.60
• Proportion of young patients = 0.80
• We wish to determine the probability of patients
who were either male or young or both
Young Old
• 0.6 + 0.8 = 1.4, absurd result
Male
0.48
0.12
 (Male and Young are counted twice)
Female 0.32
Total
0.80
• Sex and age are independent
• Probability of being male and young
Total
0.60
0.08 0.40
0.20 1.00
 0.6 x 0.8 = 0.48
• Proportion who are either male or young ( or both)
 0.6 + 0.8 - 0.48 = 0.92
Non independent events
Change in the multiplicative rule in the
case of non-independent events:
Conditional probabilities
• The probability of being a female under 5 is
not equal to P (A) x P (B)
• P (A AND B) = P (A) x P (B, given A)
= P (B) x P (A, given B)
• P (B, given A) is the probability of getting
the event “under 5” (B) GIVEN that the
event “female” occurred (A)
Non independent events
Selection of a subject in a survey
• Survey in a small community of 800 subjects
 128 are aged under 5 years of age
 192 are 5–15 years of age
 480 are aged above 15 years
• A subject is selected at random
• Probability of selecting a child under 5 years of age
 128 / 800 = 0.16
• Probability of selecting a child 5 to 15 years of age
 192 / 800 = 0.24
• Probability of selecting a child older than 15 years
 128+192 / 800 = 320 / 800 = 0.40
Non independent events
Illustration of conditional probabilities
• Consider a group of 5 persons
 3 males (M1, M2, M3) and 2 females (F1, F2)
• One person is selected at random and then a second
is selected again at random from the remaining 4
• What is the probability of selecting a male twice?
• First round:
 Probability of selecting a male = 3 / 5 (M2)
• Second round:
 There are 4 persons left (M1, M3, F1, F2)
 Probability of selecting a male = 2 / 4
• Probability of selecting a male on both occasions
• 3 / 5 x 2 / 4 = 6 / 20
Non independent events
Checking from first principles
• The first person can be selected in 5 ways
 M1 or M2 or M3 or F1 or F2
• With each of these the second person can be
selected in 4 ways
 (e.g., M1 or M3 or F1 or F2 following M2)
• Total number of ways to select 2 persons
 5 x 4 = 20
• Selection of a male
 First round: 3 ways
 Second round: 2 ways
• Total number of ways to select a male twice
 3 x 2 = 6, required Probability = 6 / 20
Non independent events
Laboratory example
• Probability of ‘0’
contaminations
 0.728
• Probability of getting at
least 2 contaminations
 Probability of getting 2
contaminated cultures +
 Probability of getting 3
contaminated cultures
Number of
contaminated
cultures
0
1
2
3
Total
Number Proportionate
of
frequency
patients
364
122
13
1
500
0.728
0.244
0.026
0.002
1.000
• 0.026 + 0.002 = 0.028
Non independent events
Random variables and probability
distributions
• Statistical experiment is any process by which an
observation (or measurement) is obtained
 Counting the number of sick patients
 Measuring the birth weight of infants
• Variable is called random variable
 Discrete random variable
• The observation can take only a finite number of values
 Continuous random variable
• The observation can take infinite number of values
• The probability distribution is simply an assignment
of probabilities:
 To the specific values of the random variable
 To a range of values of the random variable Non independent events
Key messages
• Probabilities quantify chance
• The probability of occurrence of one or
another mutually exclusive events are added
• The probability of occurrence of one and
another independent event are multiplied
• Non-independent events are addressed
through conditional probabilities