Transcript Week6
Continuous Distributions
Week 6
Objectives
On completion of this module you should be able
to:
calculate areas under the standard normal
curve,
solve and interpret problems involving the
normal distribution,
check assumptions of normality,
calculate probabilities using the uniform
distribution,
solve and interpret problems involving the
uniform distribution,
2
Objectives
On completion of this module you should be able
to:
calculate probabilities using the exponential
distribution and
solve and interpret problems involving the
exponential distribution.
3
Three continuous distributions
Normal Distribution
x
x
Uniform Distribution
x
Exponential Distribution 4
The Normal distribution
Characteristics of the normal distribution:
it is bell-shaped (symmetrical) and unimodal
(one mode),
the mean, median and mode are identical,
most of the data falls within ±1.33 standard
deviations of the mean,
the variable (X) has an infinite domain,
as X±, f(X)0 and
the total area under curve is 1.
Normal Distribution
5
x
Normal probability density function
2
1
1 2 X
f X
e
2
where
e = 2.71828…
= 3.14159…
= population mean
= population standard deviation
Normal Distribution
6
x
Mean and standard deviation
• The mean determines location…
• The standard deviation determines the spread…
7
Standardised Normal distribution
1
1 2 Z 2
f Z
e
2
where Z
X
The standard normal distribution always has a
mean of 0 (is centred on zero) and a standard
deviation of 1.
8
Example 6-1
A final exam for a particular accountancy course
is known by students to be a difficult one.
In the past, the mean mark was 62% and the
standard deviation was 11%.
What proportion of students have received a
mark of:
(a) At least 65%
9
Solution 6-1
We are told that μ = 62 and = 11.
= 62
= 11
Normal
distribution
65
10
Solution 6-1
Using the transformation formula:
Z
X
65 62
0.27 (to 2 dec. pl.)
11
= 0
= 1
Standard
Normal
distribution
0.27
11
Solution 6-1
This means that P X 65 P Z 0.27
We can now look the standardised value up in
table (Table E.2).
Tables gives probabilities of less than specified
Z values.
We know the probability under the curve is 1, so
we can subtract the tabulated values from 1 to
get our desired probability.
12
Using Table E.2 from the text
Table E.2 gives P(Z < 0.27):
We want
1 – P(Z < 0.27) = P(Z > 0.27):
13
Solution 6-1
P X 65 P Z 0.27 1 P Z 0.27
1 0.6064 0.3936
So 39.36% of students will receive more than
65% on the exam.
Always sketch the required area when solving
normal distribution problems – this helps you
find the correct probability area!
14
Solution 6-1
What proportion of students have received a mark of:
(b) at least 50%
X 50 62
Z
1.09
11
P X 50 P Z 1.09
1 P Z 1.09
Z = – 1.09
(50%)
1 0.1379
0.8621
So 86.21% of students can be expected to receive
more than 50% on the exam.
15
Solution 6-1
What proportion of students have received a
mark of:
(c) less than 40%
Z
X
40 62
2
11
P X 40 P Z 2
0.0228
Z = – 2
(40%)
So 2.28% of students can be expected to receive
less than 40% on the exam.
16
(70%)
Z
(100%)
= 3.45
0.73
Solution 6-1
What proportion of students have received a mark of:
(d) between 70% and 100%
X 70 62
Z1
0.73
11
X 100 62
Z2
3.45
Z = 0.73
Z = 3.45
11
(70%)
(100%)
P 70 X 100 P 0.73 Z 3.45
0.99972 0.7673
0.23242
So 23.24% of students will receive between 70% and
100% on the exam.
17
Solution 6-1
Note we could logically have assumed that no
student could get more than 100% on the
exam.
This would mean that
P X 100 1
and so
P 70 X 100 P X 100 P X 70
1 P Z 0.73
1 0.7673
0.2327
18
Solution 6-1
(d) Between what two marks symmetrically
distributed around the mean will 95% of the
students’ marks fall?
P Z1 Z Z 2 0.95
95%
0.025
0.475 0.475
0.025
19
Solution 6-1
We look for 0.025 in Table E.2.
It corresponds to a Z value of -1.96.
Therefore Z1 = –1.96 and Z2 = 1.96 (since the
distribution is symmetrical).
0.025
0.025
0.475 0.475
– 1.96
+ 1.96
20
Solution 6-1
Using Z
X
we discover that
X 62 1.96 11 X 62
1.96
11
X 62 1.96 11 40.44
X 62
and 1.96
1.96 11 X 62
11
X 62 1.96 11 83.56
So 95% of students can be expected to receive
between 40.44% and 83.56%.
21
Evaluating the normality assumption
Recall that the normal distribution is:
bell-shaped
has IQR equal to 1.33 standard deviations
is continuous with an infinite range
We can begin checking for normality using:
a box-and-whisker plot
a stem-and-leaf display (for small data sets)
or a histogram (for larger data sets)
22
Evaluating the normality assumption
Examining summary statistics and checking that:
mean, median and mode are all similar
IQR is approximately equal to 1.33 standard
deviations
range is approximated by 6 standard
deviations
2/3 of observations lie within ±1 standard
deviation of the mean, 4/5 within ±1.28
standard deviations, 19 out of 20 observations
within ±2 standard deviations.
23
Example 6-2
Last term, a group of 21 students enrolled in an
accounting course on a particular campus.
Their scores on the final exam are recorded
below.
Determine whether or not these marks are
normally distributed by evaluating the actual
versus theoretical properties and by constructing
a normal probability plot.
59
64
48
49
75
76
51
74
51
53
48
58
67
71
43
44
43
72
63
62
64
24
Solution 6-2
We begin with a five number summary:
Five-number Summary
Minimum
First Quartile
43
48.5
Median
59
Third Quartile
69
Maximum
76
and the mean and standard deviation:
Mean
58.80952
Std. Deviation
11.10234
25
Solution 6-2
The mean (58.80952) is very similar to the median
(59).
The mode is not very helpful with such a small
data set.
IQR = 69 – 48.5 = 20.5
1.33 standard deviations: 1.33 11.10234 = 14.77
(2 dec. pl.)
Range = 76 – 43 = 33
6 std. dev. = 6 11.10234 = 66.61 (2 dec. pl.)
The stem-and-leaf diagram given by PHStat2 is
not particularly useful (try this yourself to see why).
26
Box plot of exam m arks
M arks
40
45
50
55
60
65
70
75
80
27
28
Solution 6-2
The data appears to be roughly symmetrical.
The upper and lower quartiles may be a little
too large and too small respectively to fit the
normal distribution.
The data on the normal probability plot
(produced using PHStat2) approximately
follows a straight line.
This is a small data set so accurate
conclusions are difficult to draw, but the data is
probably approximately normally distributed.
29
Uniform distribution
1
f X
if a X b
ba
where
a = minimum value of X
b = maximum value of X
ab
2
b a
2
12
x
Uniform Distribution
30
Example 6-3
A surfer knows that the time between wipe-outs
(falling off his surfboard) is uniformly distributed
between two minutes and nine minutes in
particularly large surf.
What is the probability that the time between
wipe-outs is:
(a) less than five minutes?
31
Example 6-3
0.2
0.1
1
2
3
4
5
6
7
8
9
10
The total area under the rectangle is 1, so
since the length is b – a = 9 – 2 = 7, the height
1
1
1
must be
0.1428...
ba 92 7
y
0.2
0.1
1 2 3 4 5 6 7 8 9 10 x
32
Solution 6-3
y
P X 5 P 2 X 5
0.2
0.1
1 2 3 4 5 6 7 8 9 10 x
1
5 2
9
2
3
7
Base Height
33
Solution 6-3
What is the probability that the time between
wipe-outs is:
(b) between three and four minutes?
P 3 X 4
y
0.2
0.1
1 2 3 4 5 6 7 8 9 10 x
1
4 3
92
1
7
34
Solution 6-3
What is the probability that the time between
wipe-outs is:
(c) more than six minutes?
P X 6
y
0.2
0.1
1 2 3 4 5 6 7 8 9 10 x
1
9 6
92
3
7
35
Solution 6-3
(d) What is the expected value of the time
between wipe-outs?
ab 29
5.5
2
2
(e) What is the standard deviation of the time
between wipe-outs?
b a
2
9 2
2
12
12
2.0207 (to 4 dec. pl.)
36
Exponential distribution
f arrival time X 1 e X
where
x
Exponential Distribution
e = 2.71828…
= the population mean number of arrivals
per unit
X = any value of the continuous variable
where 0 X
37
Example 6-4
People are known to arrive at a particular
vending machine at a mean rate of 27 per hour.
Assuming that these arrival times follow an
exponential distribution, find the probability that
the next person will arrive:
(a) within one minute?
We are given:
27
P arrival time X 1 e
X
1 e27 X
38
Solution 6-4
(a) P arrival time 1 minute 1 e
1
27
60
0.3624 (to 4 dec. pl.)
Note that we converted the units from minutes
to portions of hours since the variable is
expressed in hours.
(b) within five minutes?
P arrival time 5 minutes 1 e
5
27
60
0.8946
39
Solution 6-4
(c) in more than five minutes?
P arrival time>5 minutes
1 P arrival time 5 minutes
1 0.8946 0.1054
40
After the lecture each week…
Review the lecture material
Complete all readings
Complete all of recommended problems (listed
in SG) from the textbook
Complete at least some of additional problems
Consider (briefly) the discussion points prior to
tutorials
41