STA 291-021 Summer 2007
Download
Report
Transcript STA 291-021 Summer 2007
Lecture 6
Dustin Lueker
(mu)
(sigma)
population mean
population standard deviation
2
(sigma-squared)
population variance
x or xi (x-i) observation
x (x-bar)
s
2
s
sample mean
sample standard deviation
sample variance
summation symbol
STA 291 Summer 2008 Lecture 6
2
Sample
◦ Variance
s2
2
(
x
i
x
)
n 1
◦ Standard Deviation
Population
◦ Variance
2
s
2
(
x
i
x
)
n 1
2
(
x
i
)
◦ Standard Deviation
N
2
(
x
i
)
N
STA 291 Summer 2008 Lecture 6
3
1.
2.
3.
4.
5.
Calculate the mean
For each observation, calculate the deviation
For each observation, calculate the squared
deviation
Add up all the squared deviations
Divide the result by (n-1)
Or N if you are finding the population variance
(To get the standard deviation, take the square root of the result)
STA 291 Summer 2008 Lecture 6
4
If the data is approximately symmetric and
bell-shaped then
◦ About 68% of the observations are within one
standard deviation from the mean
◦ About 95% of the observations are within two
standard deviations from the mean
◦ About 99.7% of the observations are within three
standard deviations from the mean
STA 291 Summer 2008 Lecture 6
5
Standardized measure of variation
◦ Idea
A standard deviation of 10 may indicate great
variability or small variability, depending on the
magnitude of the observations in the data set
CV = Ratio of standard deviation divided by
mean
◦ Population and sample version
STA 291 Summer 2008 Lecture 6
6
Which sample has higher relative variability?
(a higher coefficient of variation)
◦ Sample A
mean = 62
standard deviation = 12
CV =
◦ Sample B
mean = 31
standard deviation = 7
CV =
STA 291 Summer 2008 Lecture 6
7
Experiment
Random (or Chance) Experiment
Outcome
Sample Space
Event
Simple Event
◦ Any activity from which an outcome, measurement, or other
such result is obtained
◦ An experiment with the property that the outcome cannot
be predicted with certainty
◦ Any possible result of an experiment
◦ Collection of all possible outcomes of an experiment
◦ A specific collection of outcomes
◦ An event consisting of exactly one outcome
STA 291 Summer 2008 Lecture 6
8
Let A denote an even
Complement of an event A
◦ Denoted by AC, all the outcomes in the sample
space S that do not belong to the even A
◦ P(AC)=1-P(A)
Example
◦ If someone completes 64% of his passes, then what
percentage is incomplete?
STA 291 Summer 2008 Lecture 6
9
Let A and B denote two events
Union of A and B
◦ A∪B
◦ All the outcomes in S that belong to at least one of
A or B
Intersection of A and B
◦ A∩B
◦ All the outcomes in S that belong to both A and B
STA 291 Summer 2008 Lecture 6
10
Let A and B be two events in a sample space S
◦ P(A∪B)=P(A)+P(B)-P(A∩B)
At State U, all first-year students must take chemistry
and math. Suppose 15% fail chemistry, 12% fail math,
and 5% fail both. Suppose a first-year student is
selected at random, what is the probability that the
student failed at least one course?
STA 291 Summer 2008 Lecture 6
11
Let A and B denote two events
A and B are Disjoint (mutually exclusive)
events if there are no outcomes common to
both A and B
◦ A∩B=Ø
Ø = empty set or null set
Let A and B be two disjoint (mutually
exclusive) events in a sample space S
◦ P(A∪B)=P(A)+P(B)
STA 291 Summer 2008 Lecture 6
12
The probability of an event occurring is
nothing more than a value between 0 and 1
◦ 0 implies the event will never occur
◦ 1 implies the event will always occur
How do we go about figuring out
probabilities?
STA 291 Summer 2008 Lecture 6
13
Can be difficult
Different approaches to assigning probabilities to
events
◦ Subjective
◦ Objective
Equally likely outcomes (classical approach)
Relative frequency
STA 291 Summer 2008 Lecture 6
14
Relies on a person to make a judgment on
how likely an event is to occur
◦ Events of interest are usually events that cannot be
replicated easily or cannot be modeled with the
equally likely outcomes approach
As such, these values will most likely vary from person
to person
The only rule for a subjective probability is
that the probability of the event must be a
value in the interval [0,1]
STA 291 Summer 2008 Lecture 6
15
The equally likely approach usually relies on
symmetry to assign probabilities to events
◦ As such, previous research or experiments are not
needed to determine the probabilities
Suppose that an experiment has only n outcomes
The equally likely approach to probability assigns a
probability of 1/n to each of the outcomes
Further, if an event A is made up of m outcomes then
P(A) = m/n
STA 291 Summer 2008 Lecture 6
16
Selecting a simple random sample of 2
individuals
◦ Each pair has an equal probability of being selected
Rolling a fair die
◦ Probability of rolling a “4” is 1/6
This does not mean that whenever you roll the die 6
times, you always get exactly one “4”
◦ Probability of rolling an even number
2,4, & 6 are all even so we have 3 possibly outcomes in
the event we want to examine
Thus the probability of rolling an even number is
3/6 = 1/2
STA 291 Summer 2008 Lecture 6
17
Borrows from calculus’ concept of the limit
a
P( A) lim
n n
◦ We cannot repeat an experiment infinitely many
times so instead we use a ‘large’ n
Process
Repeat an experiment n times
Record the number of times an event A occurs, denote this
value by a
Calculate the value of a/n
a
P( A)
n
STA 291 Summer 2008 Lecture 6
18
“large” n?
◦ Law of Large Numbers
As the number of repetitions of a random experiment
increases, the chance that the relative frequency of
occurrence for an event will differ from the true
probability of the even by more than any small number
approaches 0
Doing a large number of repetitions allows us to
accurately approximate the true probabilities using the
results of our repetitions
STA 291 Summer 2008 Lecture 6
19