Chapter 6: Modeling Random Events... Normal

Download Report

Transcript Chapter 6: Modeling Random Events... Normal

+
Chapter 6: Modeling
Random Events... Normal
& Binomial Models
+
Models...

Help us predict what is likely to happen

Remember LSRLs (a model for linear, bi-variate data); not
exact (some points above line, some below); but best model
we have

Two more models discussed in this chapter are for Normal
(or ≈ Normal) and for Binomial distributions

Both of these describe/model numeric, uni-variate data

Again, models are not perfect... But in many cases, they are
good, effective, or “good enough”
+
Two types of numeric data...

Discrete random variables/data (we will use binomial
distribution with discrete data)

Continuous random variables/data (we will often use Normal
distribution with continuous data)

Let’s discuss discrete random variables/data first
+ Discrete Random Variables

Discrete random variables have a “countable” number of possible
positive outcomes and must satisfy two requirements. Note:
‘countable’ is not the same as finite.

(1) Every probability is a # between 0 and 1;

(2) The sum of the probabilities is 1
+ World-Wide 2015 High School AP Statistics
Score Distribution

Is this a discrete random variable probability distribution?
Why or why not?
1
2
3
4
5
.238
.189
.252
.189
.132
+
Discrete Random Variables...
Discuss
other examples of discrete
random variables. 1 minute.
+
+ Other Examples of Discrete Random
Variables...
•
Number of times people have seen Shawn Mendes in concert
•
Number of gifts we get on our birthday
•
Number of burgers sold at In-N-Out Burger per day
•
Number of stars in the sky
All are whole, countable numbers; all vary; usually represented by a
table or probability histogram
+
Non-examples of Discrete Random
Variables...

Your height

Weight of a candy bar

Time it takes to run a mile
+
Continuous Random Variables...
are
usually measurements
heights, weights, time
amount
of sugar in a granny smith
apple, time to finish the New York
marathon, height of Mt. Whitney
+
How can we distinguish between
continuous and discrete?
 Discuss
in your groups for a few minutes.
+
How can we distinguish between
continuous and discrete?
 Discuss
in your groups for a few minutes.
 Ask
yourself ‘How many? How much? Are
you sure?’
 For
example, # of children, pounds of
Captain Crunch produced each year, # of
skittles, ounces in a bag of skittles
+ Continuous Random Variables ...
•
take on all values in an interval of numbers
•
probability distribution is described by a density curve
•
probability of event is area under the density curve and
above the values of X that make up the event
•
total area under (density) curve = 1
+ Continuous Random Variables...

Probability distribution is area under the density curve, within an
interval, above x-axis
+
Continuous Random Variables ...


+ Random Variables...
Consider a six-sided die... What is the probability...
P ( roll less than or equal to a 2) is
P ( roll less than a 2) is
Different probabilities; discrete random variable
Note: possible outcomes are 1, 2, 3, 4, 5, 6; but
probabilities of those outcomes are (often)
fractions/decimals
+ Continuous random variables ...

All continuous random variables assign probabilities to intervals

All continuous random
variables assign a probability of zero to every
individual outcome. Why?
+
Continuous Random Variables...
 There
is no area under a vertical line (sketch)
 Consider...
0.7900 to 0.8100
P = 0.02
0.7990 to 0.8010
P = 0.002
0.7999 to 0.8001
P = 0.0002
P (an exact value –vs. an interval--) = 0
+
Density Curves & Continuous
RV’s..

Can use ANY density curve to assign probabilities/model
continuous distributions/RV’s; many models

Most familiar density curves are the Normal (bell) density
curves

Based on Empirical Rule, 68-95-99.7, symmetric, uni-modal,
chapter 3)

Many distributions/events are considered Normal & can be
modeled by Normal density curves, such as ... cholesterol
levels in young boys, heights of 3-year-old females, Tiger
Woods’ distance golf ball travels on driving range, basic
skills vocabulary test scores for 7th graders, etc.
+
Mean & Standard Deviation of Normal
Distributions...

μx for continuous random variables lies at the center of a
symmetrical (or fairly symmetrical) density curve (Normal or
approximately Normal)

N (μ, σ)

Remember... μand σ are population parameters;
sample statistics.


Calculating σ and/or σ2 for continuous random
variables….
beyond the scope of this course… will be given this
information if needed
xand
s are
+
Normal distributions/density
curves...
+
Four possibilities that we need to
know how to calculate...

When calculating probabilities for Normal distributions,
there are four possibilities that we might be asked to
calculate.

Let’s draw some pictures and match up some questions to
each drawing

Remember for continuous random variables (like Normal
distributions) there is no difference between ‘less than’ and
‘less than or equal to;’ likewise no difference between
‘greater than’ or ‘greater than or equal to.’
+
Female heights... N(64.5, 2.5)
+
Female heights... N(64.5, 2.5)

Calculate the probability that a randomly chosen female is:

shorter than 63 inches

no more than 61 inches

taller than 66 inches

68 inches or taller

between 63.5 inches and 64 inches

between 5 inches and 72 inches
+
Female heights... N(64.5, 2.5)

Calculate the probability that a randomly chosen female is:


shorter than 63 inches or taller than 70 inches
as short or shorter than 65 inches or as tall or taller than 67 inches
+
Female heights... N(64.5, 2.5)

Sometimes we want to turn this around... Sometimes we are
given a probability & we need to find the value that
corresponds to that probability.

What is the height of a randomly chosen woman if she is in
the 20th percentile? Let’s draw a picture...

What is the height of a randomly chosen woman if she is in
the 85th percentile?
+
Normal model...

Very helpful, but one size does not fit all

Good first choice if data is continuous, uni-modal, symmetric,
68-95-99.7
+
Another important, “special” type of
distribution...
 If
certain criteria is met, easier to calculate
probabilities in specific situations
 Next
types of distributions we will examine are
situations where there are only two outcomes
 Win
or lose; make a basket or not; boy or girl ...
+
Discuss situations where there are only
two outcomes...
 Yes
or no
 Open
or closed
 Patient
has a disease or doesn’t
 Something
 Person
A
is alive or dead
has a job or doesn’t
part is defective or not
+
that is what this section is all about...
 ... a
class of distributions that are concerned about
events that can only have 2 outcomes
 The
Binomial Distribution
+
Binary; Independent; fixed Number;
probability of Successes
The Binomial setting is:
1.
Each observation is either a success or a failure (i.e., it’s
binary)
2.
All n observations are independent
3.
Fixed # (n) of observations
4.
Probability of success, p, is the same for each observation
“BINS”
+
Binomial Distribution: practice…
I roll a die 3 times and observe each roll to see if it is
even or odd. Is:
1.
each observation is either a success or a failure?
2.
all n observations are independent?
3.
fixed # (n) of observations?
4.
probability of success, p, is the same for each
observation ?
BINS
+
Binomial Distribution
 If
BINS is satisfied, then the distribution can be
described as B (n, p)
B
binomial
n
the fixed number of observations
p
probability of success
 Note:
This is a discrete probability distribution.
 Remember
N (μ, σ)… is that discrete??
+
Binomial Distribution
 Most
important: being able to recognize situations
and then use appropriate tools for that situation
 Let’s
practice...
+
Are these binomial distributions? Why
or why not?
 Toss
a coin 20 times to see how many tails occur.
 Asking
200 people if they watch ABC news
 Rolling
a die until a 6 appears
+
Are these binomial distributions or
not? Why or why not?
 Asking
20 people how old they are
 Drawing
 Rolling
5 cards from a deck for a poker hand
a die until a 5 appears
+
How could we change the situations
to make these/force these to be
binomial distributions?
 Asking
20 people how old they are
 Drawing
 Rolling
5 cards from a deck for a poker hand
a die until a 5 appears
+ Side note: Binomial Distribution... Is
the situation ‘independent enough’?
An engineer chooses a SRS of 20 switches from a shipment of 10,000
switches. Suppose (unknown to the engineer) 12% of switches in
the shipment are bad.
Not quite a binomial setting. Why?
For practical purposes, this behaves like a binomial setting; ‘close
enough’ to independence; as long as sample size is small
compared to population. Rule of thumb: sample ≤ 10% of
population size
+
Binomial probabilities...in Minitab
use Probability Density Function

Two different ‘flavors’ of binomial calculations... First one is
use for finding the probability of an exact value, a particular
point (i.e., probability that a family has exactly 2 boys)

Remember, these are discrete random variables

Can calculate probabilities of exact values (unlike
continuous random variables)

Minitab: go to Probability, Probability Density Function, x =,
binomial, trials, event probability
+
Binomial probabilities: in Minitab
use Cumulative Distribution
Function...

Use for cumulative values (i.e., probability that a family has at
most 2 boys; at least 3 girls; no more than 4 boys, etc.)

Cumulative to left; at that value or less

If want area to right, need to do “1 minus ....”

Pay attention to < vs. ≤; and > vs. ≥

Minitab go to Probability, Cumulative Distribution Function,
value (what x = ), binomial, trials, event probability
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that exactly 2 of them have type
O blood?
Binomial setting? Check for BINS.
p = 0.25
n=5
X=2
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that exactly 2 of them have type
O blood?
Binomial setting? Check for BINS.
p = 0.25
n=5
X=2
= 0.2636; context, always!
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that exactly 4 of them have type
O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=4
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type
O. If these parents have 5 children, what is
the probability that exactly 4 of them have
type O blood? Binomial setting? Check
for BINS.
p = 0.25
n=5
X=4
= 0.0146; context, always
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that exactly 1 of them have type
O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=1
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that exactly 1 of them have type
O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
= 0.3955; context, always
X=1
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type
O. If these parents have 5 children, what is
the probability that at most 2 of them have
type O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=2
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type
O. If these parents have 5 children, what is
the probability that at most 2 of them have
type O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=2
= 0.8965; context, always
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type
O. If these parents have 5 children, what is
the probability that at most 4 of them have
type O blood? Binomial setting? Check
for BINS.
p = 0.25
n=5
X=4
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that at most 4 of them have type
O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=4
= 0.9990, context, always
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that at most 1 of them have type
O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=1
+ Practice...
Each child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
probability that at most 1 of them have type
O blood? Binomial setting? Check for
BINS.
p = 0.25
n=5
X=1
= 0.6328, context, always
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that at least 2 (meaning 2, 3, 4,
or 5) of them have type O blood?
p = 0.25
n=5
X = 2, 3, 4, or 5
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that at least 2 (meaning 2, 3, 4,
or 5) of them have type O blood?
p = 0.25
n=5
X = 2, 3, 4, or 5
1 - 0.6328 = 0.3672; context, always
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that at least 3 (meaning 3, 4,
or 5) of them have type O blood?
p = 0.25
n=5
X = 3, 4, or 5
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that at least 3 (meaning 3, 4, or
5) of them have type O blood?
p = 0.25
n=5
X = 3, 4, or 5
1 - 0.8965 = 0.1035; context, always
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that more than 3 (meaning 4
or or 5) of them have type O blood?
p = 0.25
n=5
X = 4 or 5
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that more than 3 (meaning 4
or or 5) of them have type O blood?
p = 0.25
n=5
X = 4 or 5
1 - 0.9844 = 0.0156; context, always
+ Practice...
... probability 0.25 of having blood type O. If
these parents have 5 children, what is the
probability that … of them have type O
blood?
p = 0.25
a) …at most 3 of them…
b) …at least 4 of them…
c) …more than 1 of them…
d) …exceeds 3 of them…
e) … below 2 of them…
f) … 0 of them…
n=5
+
Binomial Distribution: Mean and
Standard Deviation
If
a basketball player makes 75% (“p”
of her free throws, what do you think
the mean number of baskets made
will be in 12 tries?
x

+
Binomial Distribution: Mean and
Standard Deviation

If a basketball player makes 75% (“p” of her free throws,
what do you think the mean number of baskets made will be
in 12 tries?

(0.75) (12) = 9; we expect she should make 9 baskets in 12
tries.

Her mean number of baskets made should be 9.

We expect her

x
= 9 baskets; E(x) = 9
+
Binomial Mean & Standard Deviation
 If
a count X is a binomial distribution with number
of observations n and probability of success p,
then
 μ=
np
σ=
and
 Only
np(1  p)
for use with binomial distributions;
remember criteria... BINS

+
Practice...
 If
a basketball player makes 75% (“p”) of her free
throws, we expect her to make 9 baskets in 12
tries.
 What
is the SD of this distribution?

+
Practice...
 If
a basketball player makes 75% (“p”) of her free
throws, we expect her to make 9 baskets in 12
tries.
 What
 SD
=
is the SD of this distribution?
np(1  p)
=
(12)(0.75)(1 0.75)
= 1.5

+ Practice...
 Each
child born to a particular set of parents
has probability 0.25 of having blood type O.
If these parents have 5 children, what is the
mean and standard deviation for this
distribution?
+ Practice...
 Each
child born to a particular set of parents has
probability 0.25 of having blood type O. If these
parents have 5 children, what is the mean and
standard deviation for this distribution?
 mean
= np = (5)(.25) = 1.25; the family should
expect to have 1.25 children with type O blood
SD
= np(1  p)
= 0.9682
=
(5)(0.25)(1 0.25)
+ Remember & Caution...
 Binomial
distribution is a special case of a
probability distribution for a discrete random
variable
 All
binomials distributions are discrete random
variable distributions BUT not all discrete random
variable distributions are binomial distributions
 Don’t
assume; don’t apply binomial tools to all
discrete RV distributions; must meet binomial
distribution criteria (BINS)
+
Next test... Exam #2 ...

Chapters 4, 5, & 6