Basic Combinatorics - Mr. Liu`s CCHS Classes
Download
Report
Transcript Basic Combinatorics - Mr. Liu`s CCHS Classes
Give the number of objects described.
1. The number of cards in a standard deck.
2. The number of face cards in a standard deck.
3. The number of vertices of a octogon.
4. The number of faces on a cubical die.
5. The number of possible totals when two dice are rolled.
Slid
e 93
Give the number of objects described.
1. The number of cards in a standard deck. 52
2. The number of face cards in a standard deck. 12
3. The number of vertices of a octogon. 8
4. The number of faces on a cubical die. 6
5. The number of possible totals when two dice are rolled. 11
Slide 9- 4
! Means factorial
0!=1
1!=1
n!=(n)(n-1)(n-2)(n-3)….
6!=6*5*4*3*2*1=720
𝑛
Combination: 𝐶𝑛,𝑟 𝑜𝑟 𝐶
𝑜𝑟 𝑛𝐶𝑟 =
𝑟!
𝑟
𝑛
Permutation: 𝑃𝑛,𝑟 𝑜𝑟 𝑃
𝑜𝑟 𝑛𝑃𝑟 =
𝑟
𝑛!
𝑛−𝑟 !
𝑛!
𝑛−𝑟 !
Order does not matter! It is not important
› If you have 3,1,2, it is the same as 1,3,2
because they all have 1,2,3
Different Arrangement of things
Combination is choosing
How many 10 person committees can be formed
from a group of 20 people?
Slid
e 99
How many 10 person committees can be formed
from a group of 20 people?
Notice that order is not important. Using combinations,
20!
C
184, 756.
10! 20 10 !
20
10
There are 184,756 possible committees.
Slid
e 910
If you have three spots for ASB officers. If
there are 10 people apply to be an
officer, how many different combination
of people can you have?
10C3
Order does matter! It is important
Note: Permutation is position
112 people ran the marathon and only
top 3 runner gets the prize. How many
different ways winners can be
organized?
112P3
16 Actors answer a casting call to try out
for roles as dwarfs in a productions of
Snow White and the Seven Dwarfs. In
how many different ways can the
director cast the seven roles?
16P7
Both of them break down into two different
category:
Combination with repetition (ex: ice cream
scoops)
Combination without repetition (ex: lottery)
Permutation with repetition (ex: lock in
locker room)
Permutation without repetition (ex:
marathon race)
You have 8 people, what are the
number of possible ordered seating
arrangement for 5 chairs
Permutation, 𝑃8,5 =
8!
8−5 !
= 6720
Gamestop has 25 new games this month
and you decided to buy 5 of them. How
many different arrangement of game you
can have?
Combination=𝐶25,5 =
25!
5!20!
= 53130
What is the chance of winning the
jackpot for MEGA Million?
You have 5 slots + 1 slot for MEGA
number
The first 5 slots are numbers between 175, mega number is number between 115
75!
∗
70!5!
15 = 258890850
.00000000386
Or .000000386%
1 in 258890850
What is the chance of winning the
Jackpot for Powerball?
You have 5 slots+1 slot for Power number
The first 5 slots are numbers between 156, powerball slot is number between 135
1 in 133,693,560
Even though it’s harder to win the
Jackpot, for overall winning chance, you
have more chance for MEGA million
than Powerball
If a procedure P has a sequence of stages S , S ,..., S and if
1
2
n
S can occur in r ways,
1
1
S can occur in r ways
2
2
S can occur in r ways,
n
n
then the number of ways that the procedure P can occur is the
product rr ...r .
1
2
n
Slid
e 928
If a license plate has four letters followed by three
numerical digits. Find the number of different license
plates that could be formed if there is no restriction on
the letters or digits that can be used.
Slid
e 929
If a license plate has four letters followed by three
numerical digits. Find the number of different license
plates that could be formed if there is no restriction on
the letters or digits that can be used.
You can fill in the first blank 26 ways, the second blank 26 ways, the third blank 26
ways, the fourth blank 26 ways, the fifth blank 10 ways, the sixth blank 10 ways,
and the seventh blank 10 ways. By the Multiplication Principle, there are
26×26×26×26×10×10×10 = 456,976,000 possible license plates.
Slid
e 930
California License Plate has 3 letters and
4 numbers. Find the number of different
license plates that could be formed.
10*26*26*26*10*10*10=175,760,000 ways
You have 7 people with 6 seats. How
many different ways can you make?
(Hint: does order matter?)
There are n! permutations of an n-set.
Slid
e 934
Count the number of different 8-letter “words” that can be
formed using the letters in the word COMPUTER.
Slid
e 935
Count the number of different 8-letter “words” that can be
formed using the letters in the word COMPUTER.
Each permutation of the 8 letters forms a different word. There are 8! =
40,320 such permutations.
Slid
e 936
There are n ! distinguishable permutations of an n-set containing n
distinguishable objects.
If an n-set contains n objects of a first kind, n objects of a second
1
2
kind, and so on, with n n ... n n, then the number of
1
2
k
distinguishable permutations of the n-set is
n!
.
n !n !n ! n !
1
2
3
k
Slid
e 937
n
There are 2 subsets of a set with n objects (including the
empty set and the entire set).
Slid
e 938
http://www.mathsisfun.com/combinatori
cs/combinations-permutations.html
Read the different examples
P708 #5-42 eoe
Use the distributive property to expand the binomial.
1. x y
2
2. (a 2b)
3. (2c 3d )
4. (2 x y )
2
2
2
5. x y
3
Slid
e 942
Use the distributive property to expand the binomial.
1. x y
x 2 xy y
2
2
2
2. (a 2b) a 4ab 4b
3. (2c 3d ) 4c 12cd 9d
4. (2 x y ) 4 x 4 xy y
2
2
2
2
2
2
5. x y
3
2
2
2
x 3x y 3 xy y
3
2
2
3
Slide 9- 43
What is Pascal’s Triangle?
It is the coefficient for the expansion of
the binomials 𝑎 + 𝑏 𝑛
The binomial coefficients that appear in the expansion of (a b)
n
are the values of C for r 0,1, 2,3,..., n.
n
r
A classical notation for C , especially in the context of binomial
n
r
n
coefficients, is . Both notations are read "n choose r."
r
Slid
e 945
n n 1 n 1
r r 1 r or, equivalently, C C C
n
r
n 1
r 1
n 1
Slid
e 946
r
For any positive integer n,
n
n
n
n
a b a a b ... a b ... b ,
0
1
r
n
n
n!
where C
.
r !(n r )!
r
n
n
n
n 1
nr
r
n
r
Slid
e 947
Expand a b , using a calculator to compute the binomial coefficients.
4
Slid
e 948
Expand a b , using a calculator to compute the binomial coefficients.
4
0,1, 2,3, 4 into the calculator to find the binomial
coefficients for n 4. The calculator returns the list 1,4,6,4,1 .
Enter 4 C
n
r
Using these coefficients, construct the expansion:
a b
4
a 4a b 6a b 4ab b .
4
3
2
2
3
4
Slid
e 949
𝑥+4
5
Expand:
4𝑥 + 𝑦 3
Expand:
5𝑥 + 3𝑎
8
Coefficient of 𝑥 10 in the expansion of
𝑥 + 2 15
15𝐶10𝑥 10 25
Find the coefficient of 3𝑥
expansion of 3𝑥 + 4𝑦 7
4
in the
Find the coefficient of 𝑎5 in the
expansion of 𝑎 + 2𝑏 10
For any integer n 1, n ! n n 1!
For any integer n 0, n 1! n 1 n !
Slid
e 957
𝑛
𝑛+1
−
=n
2
2
𝑛
Remember 𝐶( )
𝑟
𝑛
𝑛+1
−
=
2!
2
2
𝑛+1 !
𝑛!
−
𝑛+1−2 !
2! 𝑛−2 !
𝑛+1 𝑛 𝑛−1 !
𝑛 𝑛−1 𝑛−2 !
−
2 𝑛−1 !
2 𝑛−2 !
=
𝑛2 +𝑛
2
2𝑛
2
=
=
=n
−
𝑛2 −𝑛
2
P715 #1-34 eoe
Probability is a numerical measure
between 0 and 1 that describes the
likelihood that an event will occur.
Probabilities closer to 1 indicate that the
event is more likely to occur.
Probabilities closer to 0 indicated that
the event is less likely to occur.
P(A) = probability of event A; you read it as
“P of A”.
P(A)=1, the event A is certain to occur
P(A)=0, the event A is certain to not occur
› Binary number works like this…1 means it’s true, 0
means false.
“There are only 10 types of people in the
world: those who understand binary, and
those who don't”
1) A probability assignment based on intuition
incorporates past experience, judgment, or
opinion to estimate the likelihood of an event
2) A probability assignment based on relative
frequency uses the formula
› Probability of event=relative frequency=
𝑓
𝑛
› Where f is the frequency of the event occurrence in
a sample of n observations
3) A probability assignment based on equally
likely outcomes uses the formula
› Probability of
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠 𝑓𝑎𝑣𝑜𝑟𝑎𝑏𝑙𝑒 𝑡𝑜 𝑒𝑣𝑒𝑛𝑡
event=
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
Intuition – NBA announcer claims that Kobe
makes 84% of his free throws. Based on this, he
will have a high chance of making his next free
throw.
Relative frequency – Auto Fix claims that the
probability of Toyota breaking down is .10
based on a sample of 500 Toyota of which 50
broke down.
Equally likely outcome - You figure that if you
guess on a SAT test, the probability of getting it
right is .20
Create a situation for each of the
probability assignments. (intuition,
relative frequency, equally likely
outcome)
Show me
In the long run, as the sample size
increases, the relative frequencies of
outcomes get closer to the theoretical
(or actual) probability value
Example: The more numbers you ask, the
more likelihood that P(getting a girl’s real
number)=1
The more numbers you ask, the more
likelihood that P(getting a (hot) girl’s real
number)=1
Then after collecting all the numbers, the
more girls you ask out on a date, the
more likelihood that P(getting a date)=1
Casino (the more you play, the more you
lose)
Insurance (the more people you insure,
the less the likelihood the company have
to pay for the insurance benefits)
Statistical experiment or statistical
observation can be thought of as any
random activity that results in a definite
outcome
An event is a collection of one or more
outcomes of a statistical experiment or
observation
Simple event is one particular outcome of a
statistical experiment
The set of all simple events constitutes the
sample space of an experiment
Brown eyes’ genotype is Bb or BB
Blue eyes’ genotype is bb
If your Dad has Brown eyes (and his dad
has blue eyes) and your Mom has blue
eyes, what’s the probability that you
have blue eyes?
Dad
Mom
B
b
b
Bb
bb
b
Bb
bb
P(blue eyes)=
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑓𝑎𝑣𝑜𝑟𝑎𝑏𝑙𝑒 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑢𝑡𝑐𝑜𝑚𝑒𝑠
2
1
=4=2
You are running out of time in a true/false quiz. You
only have 4 questions left! How should you guess?
P(all false)=
P(3 false)=
P(all true)=
P(2 false)=
P(1 true)=
P(1 false)=
P(2 true)=
P(3 true)=
Your sample space should have 16 different combinations
TTTT
FTTT
TFTT
TTFT
TTTF
FFTT
FTFT
FTTF
TFFT
TFTF
TTFF
FFFT
TFFF
FTFF
FFTF
FFFF
P(all false)= 1/16
P(3 false)= 4/16
P(all true)= 1/16
P(2 false)= 6/16
P(1 true)=
P(1 false)= 4/16
P(2 true)= 6/16
P(3 true)= 4/16
You will probably choose 2 true and 2 false
4/16
The sum of the probabilities of all simple
events in a sample space must equal 1
The complement of event A is the event
that A does not occur. 𝐴𝑐 designates the
complement of event A. Furthermore,
1) P(A)+P(𝐴𝑐 )= 1
P(event A does not occur)=P 𝐴𝑐 = 1 − 𝑃(𝐴)
P(getting A in Mr. Liu’s class)+P(not
getting A in Mr. Liu’s class) =1
P(getting A in Mr. Liu’s class)=.15
What’s the P(not getting A in Mr. Liu’s
class)?
P(not getting A in Mr. Liu’s class)= .85
P(having a date on a Friday)=1/7
What’s the P(not having a date on a
Friday)?
6/7
How many outcomes are possible for the following experiments.
1. Two coins are tossed.
2. Two different 6-sided dice are rolled.
3. Two chips are drawn simultaneously without replacement from
a jar with 8 chips.
4. Two different cards are drawn from a standard deck of 52.
C
5. Evaluate without using a calculator.
C
4
2
8
2
Slid
e 981
How many outcomes are possible for the following experiments.
1. Two coins are tossed. 4
2. Two different 6-sided dice are rolled. 36
3. Two chips are drawn simultaneously without replacement from
a jar with 8 chips. 28
4. Two different cards are drawn from a standard deck of 52. 1326
C
5. Evaluate without using a calculator.
3/14
C
4
2
8
2
Slide 9- 82
If E is an event in a finite, nonempty sample space S of equally likely
outcomes, then the probability of the event E is
the number of outcomes in E
P( E )
.
the number of outcomes in S
Slid
e 983
P(5 on 1st die and 5 on 2nd die)
P(ace on 1st card and ace on 2nd card)
What is the difference between these
two situation?
In the first situation, the first result does
not effect the outcome of the 2nd result.
In the second situation, the first result
does effect the outcome of the 2nd
result.
Two events are independent if the
occurrence or nonoccurrence of one
does not change the probability that the
other will occur
What does it mean if two events are
dependent?
P(A and B)=𝑃 𝐴 ∗ 𝑃(𝐵)
This means event A AND event B both
have to happen!!! You multiply the
events. You find the probability of two
events happening together.
This is the formula if event A and event B
are independent.
Then we must take into account the
changes in the probability of one event
caused by the occurrence of the other
event.
Sample
Space
A and B
A
B
P(A and B)=𝑃 𝐴 ∗ 𝑃 𝐵 𝐴
Or
P(A and B)=𝑃 𝐵 ∗ 𝑃 𝐴 𝐵
It is known as conditional probability
𝑃 𝐴 𝑎𝑛𝑑 𝐵
𝑃 𝐵
𝑃 𝐴𝐵 =
given event B”
𝑃
𝐵𝐴
= “Probability of event A
𝑃(𝐴)
𝑃 𝐴𝐵 =
Quick group work:
𝑃(𝐵)
› What is P 𝐵 𝐴 ?
Your friend has 2 children. You learned
that she has a boy named Rick. What is
the probability that Rick’s sibling is a
boy?
Take a guess
If you guessed ½ or 50%, that is incorrect.
First: Think about all the possible outcomes
› S {BB, BG, GB, GG}
What is P(boy and boy)?
What is P(boy)?
You want to find 𝑃 𝐵 𝐵
𝑃 𝑏𝑜𝑦 𝑎𝑛𝑑 𝑏𝑜𝑦
=
𝑃 𝑏𝑜𝑦
=
1
4
3
4
=
1
3
A machine produce parts that’s either
good (90%), slightly defective (2%) or
obliviously broken (8%). The parts gets
through an automatic inspection
machine that is able to find the oblivious
broken parts and throw them away.
What is the probability of the quality part
that make it through and get shipped?
P(Good given not broken)=
𝑃 𝐺𝑜𝑜𝑑 𝑎𝑛𝑑 𝑛𝑜𝑡 𝑏𝑟𝑜𝑘𝑒𝑛
.90
=
= .978 = 97.8%
𝑃(𝑛𝑜𝑡 𝑏𝑟𝑜𝑘𝑒𝑛)
.92
Conditional Probability can be very
intriguing and complicated. We won’t
go into any more in depth…..or
maybe….
Very important to understand about
probability is that are the events
dependent or independent.
Suppose you are going to throw 2 fair
dice. What is the probability of getting a
3 on each die?
A) Is this situation independent or
dependent?
B) Create all the sample space (all the
potential outcomes)
C) What is the probability?
A) Independent because one event
does not affect the second event
B) You should have 36 total outcomes
C) 1/36
I took a die away. Now you only have
ONE die! Again you toss the die twice.
What is the probability of getting a 1 on
the first and 4 on the second try?
It is still an independent event!
1/36
The last two examples are considered
multiplication rule, independent events.
Mr. Liu has a 85% probability of teaching
statistics next year. Mr. Riley has a 15%
probability of teaching statistics next
year. What is the probability that both
Mr. Liu and Mr. Riley teach statistics next
year?
.85*.15=.1275 or 12.75% probability
Suppose you have 100 Iphones. The
defective rate of iphone is 10%. What is
the probability that you choose two
iphones and both are defective?
P(1st defective iphone)=10/100
P(2nd defective iphone)=9/99
P(1st defective iphone and 2nd defective
1
1
1
iphone)= ∗ =
= .9%
10
11
110
What is the probability of getting tail and
getting a 3 on a die and getting an ace
in a deck of cards?
P(tail)=1/2
P(3)=1/6
P(ace)=4/52=1/13
1
2
1
6
∗ ∗
1
13
=
1
156
You use addition when you want to
consider the possibility of one event OR
another occurring
1) Satisfying the humanities requirement
by taking a course in the history of Japan
or by taking a course in classical
literature
2) Buying new tires and aligning the tires
3) Getting an A in math but also in
biology
4) Having at least one of these pets: cat,
dog, bird, rabbit
1) or
2) and
3) and
4) or
Two events are mutually exclusive or
disjoint if they cannot occur together. In
particular, events A and B are mutually
exclusive if P(A and B)=0
P(A or B)=P(A)+P(B)
General Rule for any events A and B
P(A or B)=P(A)+P(B)-P(A and B)
Remember in mutually exclusive events
P(A and B)=0
Employee
type
Democrat
(D)
Republican
(R)
Independent
(I)
Row Total
Executive (E)
5
34
9
48
Production
Worker (PW)
63
21
8
92
Column Total
68
55
17
140 grand
total
a)
b)
c)
d)
e)
Compute P(D) and P(E)
Compute 𝑃 𝐷 𝐸
Are events D and E independent?
Compute P(D and E)
Compute P(D or E)
A) P(D)=68/140
B) 𝑃 𝐷 𝐸 =
P(E)=48/140
5
48
C) Determine if P(D)=𝑃 𝐷 𝐸 , they are not
the same! So they are not independent
D) P(D and E)=5/140
E) P(D or
68
48
5
E)= +
−
140
140
140
=
111
140
A tree diagram shows all the possible
outcomes of an event.
All possible outcomes of an event are
shown by a tree diagram.
If a coin and a dice are tossed
simultaneously, what is the probability of
getting tail and even number?
1/4
You are on a sports team. What is the
probability that out of three games, you
win two of them?
You have 7 balls, 4 are blue and 3 are
green. What is the probability that when
you pick the balls, you get green on 1st
and blue one 2nd?
You make free throws 85% of the time.
What is the probability of making at least
one out of the three?
P(make 1 out of 3)=99.66% of the time
Outcome
2
3
4
5
6
7
8
9
10
11
12
Probability
1/36
2/36
3/36
4/36
5/36
6/36
5/36
4/36
3/36
2/36
1/36
Slide
9- 125
Find the probability of rolling a sum divisible by 4 on a
single
roll of two fair dice.
Slide
9- 126
Find the probability of rolling a sum divisible by 4 on a
single
roll of two fair dice.
The event E consists of the outcomes 4,8,12 . To get the probability
of E we add up the probabilities of the outcomes in E:
3
5
1
9 1
P( E )
.
36 36 36 36 4
Slide
9- 127
Dylan opens a box of a dozen chocolate cremes and offers three of them
to
Russell. Russell likes vanilla cremes the best, but all the chocolates look alike
on the outside. If five of the twelve cremes are vanilla, what is the
probability
that all of Russell’s picks are vanilla?
Slide
9- 128
Dylan opens a box of a dozen chocolate cremes and offers three of them
to
Russell. Russell likes vanilla cremes the best, but all the chocolates look alike
on the outside. If five of the twelve cremes are vanilla, what is the
probability
that all of Russell’s picks are vanilla?
The experiment in question is the selection of three chocolates,
without regard to order, from a box of 12. There are C 220
12
3
outcomes of this experiment. The event E consists of all possible
combinations of 3 that can be chosen, without regard to order, from
the 5 vanilla cremes available. There are C 10 ways.
5
3
Therefore, P( E ) 10 / 220 1/ 22.
Slide
9- 129
Suppose an event A has probability p1
and an event B has probability p2 under
the assumption that A occurs. Then the
probability that both A and B occur is
p1p2.
Slide
9- 130
Dylan opens a box of a dozen chocolate cremes and offers three of them
to
Russell. Russell likes vanilla cremes the best, but all the chocolates look alike
on the outside. If five of the twelve cremes are vanilla, what is the
probability
that all of Russell’s picks are vanilla?
Slide
9- 131
Dylan opens a box of a dozen chocolate cremes and offers three of them
to
Russell. Russell likes vanilla cremes the best, but all the chocolates look alike
on the outside. If five of the twelve cremes are vanilla, what is the
probability
that all of Russell’s picks are vanilla?
The probability of picking a vanilla creme on the first draw is 5/12.
Under the assumption that a vanilla creme was selected in the first
draw, the probability of picking a vanilla creme on the second draw is
4/11. Under the assumption that a vanilla creme was selected in the first
and second draw, the probability of picking a vanilla creme on the third
draw is 3/10. By the Multiplication Principle, the probability of picking
5 4 3
60
1
a vanilla creme on all three picks is
.
12 11 10 1320 22
Slide
9- 132
If the event B depends on the event A, then P( B | A)
P( A and B)
.
P( A)
Slide
9- 133
Suppose an experiment consists of n-independent repetitions of an
experiment with two outcomes, called "success" and "failure." Let
P(success) p and P(failure) q. (Note that q 1 p.)
Then the terms in the binomial expansion of ( p q) give the respective
probabilities of exactly n, n 1,..., 2, 1, 0 successes.
Number of successes out of
Probability
n independent repetitions
n
p
n
n
n 1
1
0
n
n 1 p q
n 1
n
r pq
q
n 1
n
Slide
9- 134
Suppose Tommy makes 92% of his free throws. If he shoots 15 free
throws, and if his chance of making each one is independent of
the other shots, what is the probability that he makes all 15?
Slide
9- 135
Suppose Tommy makes 92% of his free throws. If he shoots 15 free
throws, and if his chance of making each one is independent of
the other shots, what is the probability that he makes all 15?
P(15 successes) 0.92 0.286
15
Slid
e 9136
Suppose Tommy makes 92% of his free throws. If he shoots 15 free
throws, and if his chance of making each one is independent of
the other shots, what is the probability that he makes exactly 10?
Slid
e 9137
Suppose Tommy makes 92% of his free throws. If he shoots 15 free
throws, and if his chance of making each one is independent of
the other shots, what is the probability that he makes exactly 10?
15
P(10 successes)= 0.92 0.08 0.00427
10
10
5
Slid
e 9138
Pg 728 #1-40 eoe
Evaluate each expression when a 3, r 2, n 4 and d 2.
1. a (n 1)d
2. a r
Find a .
n 1
10
k 1
k
4. a 2 3
3. a
k
k 1
k
5. a a 3 and a 10
k
k 1
9
Slide 9- 141
Evaluate each expression when a 3, r 2, n 4 and d 2.
1. a (n 1)d 9
2. a r
24
Find a .
n 1
10
k 1
k
4. a 2 3
3. a
k
k 1
k
11
10
39,366
5. a a 3 and a 10
k
k 1
9
13
Slide 9- 142
Let a be a sequence of real numbers, and consider lim a .
n
n
n
If the limit is a finite number L, the sequence converges and L
is the limit of the sequence. If the limit is infinite or nonexistent,
the sequence diverges.
Slid
e 9143
Determine whether the sequence converges or diverges. If it converges,
give the limit.
2 1 2
2
2,1, , , ,..., ,...
3 2 5
n
Slid
e 9144
Determine whether the sequence converges or diverges. If it converges,
give the limit.
2 1 2
2
2,1, , , ,..., ,...
3 2 5
n
lim
n
2
0, so the sequence converges to a limit of 0.
n
Slid
e 9145
Converges or diverges
3𝑛
𝑛+1
5𝑛2
3
𝑛 +1
𝑛3 +2
2
𝑛 +𝑛
Converges to 3
Converges to 0
Diverges
Converges or diverges
1 1 1 1
1
, , , ,…
1 2 3 4
𝑛
2 3 4 5
, , , ,…
1 2 3 4
2,4,6,8,10, …
-1, 1, -1, 1, -1, 1…
Converges to 0
Converges to 1
Diverges
Diverges
A sequence a is an arithmetic sequence if it can be written in the
n
form a, a d , a 2d ,..., a ( n 1) d ,... for some constant d .
The number d is called the common difference.
Each term in an arithmetic sequence can be obtained recursively from
its preceding term by adding d : a a d (for all n 2).
n
n 1
Slid
e 9150
Recursive formula:
𝑎𝑛 = 𝑎𝑛−1 + 𝑑
Explicit formula:
𝑎𝑛 = 𝑎1 + 𝑛 − 1 𝑑
Find (a) the common difference, (b) the tenth term, (c) a
recursive rule for the
nth term, and (d) an explicit rule for the nth term.
-2, 1, 4, 7, …
Slid
e 9152
Find (a) the common difference, (b) the tenth term, (c) a
recursive rule for the
nth term, and (d) an explicit rule for the nth term.
-2, 1, 4, 7, …
(a) The common difference is 3.
(b) a 2 (10 1)3 25
10
(c) a 2 a a 3 for all n 2
1
n
n 1
(d) a 2 3(n 1) 3n 5
n
Slid
e 9153
6, 10, 14, 18, …
A) Find the common difference
B) the tenth term
C) a recursive rule for the nth term
D) Explicit rule for the nth term
23, 27, 31, …
A) Find the common difference
B) the tenth term
C) a recursive rule for the nth term
D) Explicit rule for the nth term
A sequence a is a geometric sequence if it can be written in the
n
form a, a r , a r ,..., a r ,... for some nonzero constant r.
2
n 1
The number r is called the common ratio.
Each term in a geometric sequence can be obtained recursively from
its preceding term by multiplying by r : a a r (for all n 2).
n
n 1
Slide 9- 156
Recursive:
𝑎𝑛 = 𝑎𝑛−1 ∗ 𝑟
Explicit:
𝑎𝑛 = 𝑎1 ∗ 𝑟 𝑛−1
Find (a) the common ratio, (b) the tenth term, (c) a recursive rule
for the
nth term, and (d) an explicit rule for the nth term.
2, 6, 18,…
Slid
e 9158
Find (a) the common ratio, (b) the tenth term, (c) a recursive rule
for the
nth term, and (d) an explicit rule for the nth term.
2, 6, 18,…
(a) The ratio is 3.
(b) a 2 3 39,366
10 1
10
(c) a 2 and a 3a
1
n
n 1
for n 2.
(d) a 2 3 .
n 1
n
Slid
e 9159
3, 6, 12, 24, 48…
A) Find the common ratio
B) the tenth term
C) a recursive rule for the nth term
D) an Explicit rule for the nth term
1, 3, 9, 27…
A) Find the common ratio
B) the tenth term
C) a recursive rule for the nth term
D) an Explicit rule for the nth term
The second and fifth terms of a
sequence are 3 and 24.
Find the explicit and recursive formulas
for the sequence if
A) arithmetic
B) geometric
The fourth and seventh term of an
arithmetic sequence are -8 and 4.
Find the explicit rule and recursive rule
and the first term.
2nd and 8th term of a geometric
sequence are 3 and 192
Find the explicit rule, common ratio, first
term
The Fibonacci sequences can be defined recursively by
a 1
1
a 1
2
a a a
n
n2
n 1
for all positive integers n 3.
Slid
e 9165
Pg 739 #1-32 eoo
a is an arithmetic sequence. Use the given information to find a
n
10
.
1. a 5; d 4
1
2. a 5; d 2
3
a is a geometric sequence. Use the given information to find a
n
10
.
3. a 5; r 4
1
4. a 5; r 4
3
5. Find the sum of the first 3 terms of the sequence n .
2
Slid
e 9168
a is an arithmetic sequence. Use the given information to find a
n
10
1. a 5; d 4
41
2. a 5; d 2
19
1
3
a is a geometric sequence. Use the given information to find a
n
10
.
.
3. a 5; r 4 1,310,720
1
4. a 5; r 4
3
81,920
5. Find the sum of the first 3 terms of the sequence n . 14
2
Slide 9- 169
In summation notation, the sum of the terms of the sequence a , a ,..., a
1
2
n
is denoted a which is read "the sum of a from k 1 to n."
n
k 1
k
k
The variable k is called the index of summation.
Slid
e 9170
Let a , a ,..., a be a finite arithmetic sequence with common difference d .
1
2
n
Then the sum of the terms of the sequence is
a a a ... a
n
k 1
k
1
2
n
a a
n
2
n
2a (n 1)d
2
1
n
1
Slid
e 9171
A corner section of a stadium has 6 seats along the front row. Each
successive row has 3 more seats than the row preceding it. If the top
row has 24 seats, how many seats are in the entire section?
Slid
e 9172
A corner section of a stadium has 6 seats along the front row. Each
successive row has 3 more seats than the row preceding it. If the top
row has 24 seats, how many seats are in the entire section?
The number of seats in the rows form an arithmetic sequence with
a 6, a 24, and d 3. Solving
1
n
a a (n 1)d
n
1
24 6 3(n 1)
n7
Apply the Sum of a Finite Sequence Theorem:
6 24
Sum of chairs 7
105. There are 105 seats in the section.
2
Slid
e 9173
Write each sum using summation
notation and find the sum
-7-1+5+11+……….53
Write each sum using summation
notation and find the sum
111, 108, 105, … 27
Write each sum using summation
notation and find the sum
5, 13, 21, …..45
Let a , a ,..., a be a finite geometric sequence with common ratio r.
1
2
n
Then the sum of the terms of the sequence is
a a a ... a
n
k 1
k
1
2
a 1 r
1
n
n
1 r
Slide 9- 177
4, -4/3, 4/9, -4/27, …4
1 10
−
3
1 11
4(1− −
)
3
1
1−(− )
3
≈ 3.000016935
Find the sum of the geometric
sequences.
3, 6, 12, ….12,288
Find the sum of the geometric
sequences.
5, 15, 45, …98,415
An infinite series is an expression of the form a a a ... a ...
k 1
k
1
2
n
Slid
e 9182
The geometric series a r
k 1
k 1
converges if and only if | r | 1.
a
If it does converge, the sum is
.
1 r
Slid
e 9183
Determine whether the series converges. If it converges, give the sum.
2 0.25
k 1
k 1
Slid
e 9184
Determine whether the series converges. If it converges, give the sum.
2 0.25
k 1
k 1
Since |r | 0.25 1, the series converges.
a
2
8
The sum is
.
1 r 1 0.25 3
Slid
e 9185
Find the sum
5, 1, 1/5, ….
Find the sum
7, 7/5, 7/25, …
Find the sum of the first n terms of the
sequence. The sequence is either
arithmetic or geometric
6, -3, 3/2, -3/4, ….n=11
Find the sum of the first n terms of the
sequence. The sequence is either
arithmetic or geometric
2, 5, 8, …n=10
Find the sum of the first n terms of the
sequence. The sequence is either
arithmetic or geometric
-1, 11, -121, … n=9
Pg 749 #1-40 eoe
1. Expand the product k ( k 2)( k 4).
Factor the polynomial.
2. n 7 n 10
3. n 3n 3n 1
2
3
2
x
.
x 1
5. Find f (t 1) given f ( x) x 1.
4. Find f (t ) given f ( x)
2
Slid
e 9193
1. Expand the product k (k 2)(k 4). k 6k 8k
3
2
Factor the polynomial.
n 2 n 5
3n 1 n 1
2. n 7 n 10
2
3. n 3n
3
2
3
x
t
4. Find f (t ) given f ( x)
.
x 1 t 1
5. Find f (t 1) given f ( x) x 1. t 2t 2
2
2
Slid
e 9194
The Tower of Hanoi Problem
Principle of Mathematical Induction
Induction and Deduction
… and why
The principle of mathematical induction is a
valuable technique for proving combinatorial
formulas.
Slid
e 9195
The minimum number of moves required
to move a stack of n washers in a Tower
of Hanoi game is 2n – 1.
Slid
e 9196
Let Pn be a statement about the integer n.
Then
Pn is true for all positive integers n provided
the
following conditions are satisfied:
1. (the anchor) P1 is true;
2. (inductive step) if Pk is true, then Pk+1 is
true.
Slid
e 9197
Bars can be vertical or horizontal.
Bars are of uniform width and uniformly
spaced.
The lengths of the bars represent values of
the variable being displayed, the frequency
of occurrence, or the percentage of
occurrence. The same measurement scale
is used for the length of each bar.
The graph is well annotated with title, lables
of each bar, and vertical scale or actual
value for the length of each bar.
Age
Life Expectancy
82
80
78
76
74
72
70
68
66
64
Men
Women
1980
1990
2000
Year of birth
2010
Look at the number where y-axis started.
You might see the graph with squiggle
on the changed axis. Sometimes, if a
single bar is unusually long, the bar
length is compressed with a squiggle in
the bar itself. (look at pg 51 example 211b with the graph)
Life Expectancy
Year of birth
2010
2000
Women
1990
Men
1980
60
65
70
75
Age
80
85
# of families
Survey of the type of car families own
9
8
7
6
5
4
3
2
1
0
Amount
Toyota
Nissan
Haundai
Type of car
Ford
Use the info below to create a bar graph.
Average annual income (in thousands) of a
household headed by a person with the
stated education level is as follows: 16.1 for
highschool, 34.1 for highschool graduates,
48.6 for associated degrees, 62.1 for
bachelor’s degrees, 71.0 for master’s
degrees and 84.1 for doctoral degrees
What can you conclude?
Pareto chart is a bar graph in which the
bar height represents frequency of an
event. In addition, the bars are
arranged from left to right according to
decreasing height.
Consider this situation:
Causes for lack of sleep(two month study
61 days)
Cause
Frequency
Playing x-box or ps3
14
Texting
9
Watching movie/TV
5
Talking on the phone
10
Doing homework/project
20
Other
3
Frequency
Frequency
20
14
10
9
5
3
Use the info below to create a pareto
chart.
Here are a list of the most common
stolen items per 100000 cases: 10.1
electronics; 15.6 jewelries; 7.3 cars; 20.4
cash; 26.7 identity
What can you conclude?
Circle graph or pie chart, wedges of a
circle visually display proportional parts
of the total population that share a
common characteristic.
Consider the situation:
Monthly Financial Budget (based on $4000
monthly)
Categories
Amount spent Fraction
Percentag Degree of the pie
e
Food
800
800/4000
0.2
.2*360°=72°
Investment
500
500/4000
0.125
.125*360°=45°
Bills/debt
1750
1750/4000
.4375
.4375*360°=157.5°
Rent
950
950/4000
.2375
.2375*360°=85.5°
Monthly Financial Budget ($4000)
Rent
24%
Food
20%
Investment
12%
Bills/debts
44%
Is the chart consistent with our data?
Create a circle graph with the following
info:
Gamestop took a survey on the first 500
customers to see what genre of games
they bought. 70 Fighting, 123 shooter,
150 action-adventure,53 role-playing, 12
strategy, 92 others.
What can you conclude?
Cause of Death
Heart Disease 700,142
Cancer 553,768
Stroke 163,538
Other 1,018,977
Create a circle graph, and bar graph
Stem-and-leaf is a method of exploratory
data analysis that is used to rank-order
and arrange data into groups.
Similarity: Both display frequency
distributions
Difference: In histogram, we lose most of
the specific data values (because of
intervals). Stem-and-leaf display is a
device that organizes and groups data
but allow us to recover the original data
if desired.
Write out all the numbers
Put this chart into a stem-and-leaf
display
30
27
12
42
35
47
38
36
27
35
22
17
29
3
21
0
38
32
41
33
26
45
18
43
18
32
31
32
19
21
33
31
28
29
51
12
32
18
21
26
A frequency table partitions data into
intervals and shows how many data
values are in each interval. The intervals
are constructed so that each data value
falls into exactly one interval.
Note: intervals are known as classes. The
book uses the word “classes”, but I use
“intervals” because it makes more sense.
Consider this situation: You are collecting
how many minutes each student study
for a particular class. You interviewed 50
students and here is the chart.
15
47
10
5
30
20
18
40
3
1
12
45
1
8
4
15
10
19
25
13
34
16
17
7
16
17
13
10
17
35
7
6
25
27
18
31
4
12
48
14
6
2
14
13
7
15
46
12
9
18
1) Determine how many intervals you want.
› between 5-15 is usually preferred
› Anything less than 5, you risk losing information
› Anything more than 15, data might not be
sufficiently analyzed
Let’s use 6 intervals for this case. (remember
you can any number between 5 and 15)
With this, you can find the width of each
interval.
𝐿𝑎𝑟𝑔𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒−𝑠𝑚𝑎𝑙𝑙𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒
Interval width=
𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙𝑠
48−1
47
So in our case:
= ≈ 7.8 = 8
6
6
› Note: You always round to the next whole
number, even if the number is 2.3. 2.3 would
become 3
So in each interval, it will include 8
numbers and this tells you the limit of
each interval
The lower interval limit is the lowest data
value that can fit in an interval.
The upper interval limit is the highest data
value that can fit in an interval.
The interval width is the difference between
the lower class limit of one interval and the
lower class limit of the next interval.
› In our case, our lowest number is 1, so 1+8=9,
therefore, 9 would be the start of the next
interval (remember we will have 6 intervals total)
Find the starting number of each interval
Start of 1st interval=1
Start of 2nd interval=9
Start of 3rd interval=17
Start of 4th interval=25
Start of 5th interval=33
Start of 6th interval=41
Start of 7th interval=49
Lower interval
limit
Upper interval
limit
1
8
9
16
17
24
25
32
33
40
41
48
Tally is the mark that is used to count the
amount of numbers that lies in each
interval.
Frequency (represented by 𝑓) is the
number of tally marks corresponding to
that interval
Now tally up all the numbers that fall in
each interval. Find the frequency also
Lower interval
limit
Upper interval
limit
Tally
Frequency 𝒇
1
8
13
9
16
17
17
24
8
25
32
5
33
40
3
41
48
4
Midpoint=
𝐿𝑜𝑤𝑒𝑟 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑙𝑖𝑚𝑖𝑡 + 𝑢𝑝𝑝𝑒𝑟 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑙𝑖𝑚𝑖𝑡
2
Find the midpoint of each interval
Lower interval limit
Upper interval limit
Interval midpoint
1
8
4.5
9
16
12.5
17
24
20.5
25
32
28.5
33
40
36.5
41
48
44.5
Upper interval boundaries, add 0.5 to the
upper interval limit.
Lower interval boundaries, subtract 0.5
from the lower interval limits.
Find the interval boundaries for all
interval.
Lower interval
limit
Upper interval
limit
Lower interval
boundary
Upper interval
boundary
1
8
0.5
8.5
9
16
8.5
16.5
17
24
16.5
24.5
25
32
24.5
32.5
33
40
32.5
40.5
41
48
40.5
48.5
Relative Frequency shows the probability
of data values that falls in each interval
Relative
𝑓
frequency=
𝑛
=
𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦
𝑇𝑜𝑡𝑎𝑙
Find the relative frequency of each
interval
Frequency 𝒇
13
Relative frequency
𝒇
𝒏
13/50=0.26
18
18/50=0.36
7
7/50=0.14
5
5/50=0.1
3
3/50=0.06
4
4/50=0.08
This is a data represent glucose blood level
after 12 hour fast for a random sample of
70 women. Use 6 intervals (classes)
45
66
83
71
76
64
59
59
76
82
80
81
85
77
82
90
87
72
79
69
83
71
87
69
81
76
96
83
67
94
101
94
89
94
73
99
93
85
83
80
78
80
85
83
84
74
81
70
65
89
70
80
84
77
65
46
80
70
75
45
101
71
109
73
73
80
72
81
63
74
Lower
interval
limit
Upper
interval
limit
Lower
interval
boundar
y
Upper
Tally
interval
boundar
y
Frequenc
y𝒇
45
55
44.5
55.5
3
Interval Relative
midpoi frequenc
nt
y
𝒇
𝒏
50
0.04
56
66
55.5
66.5
7
61
0.1
67
77
66.5
77.5
22
72
.31
78
88
77.5
88.5
26
83
.37
89
99
88.5
99.5
9
94
.13
100
110
99.5
110.5
3
105
.04
You use the frequency table to graph a
histogram (use the example we did
together in class about study minutes
with 50 students)
You use lower/upper interval boundaries
for the x axis because you don’t want
any gaps.
Let’s graph both frequency histogram
and relative-frequency histogram
Frequency
Frequency f
20
15
10
5
0
Minutes
Time-series graph, data are plotted in
order of occurrence at regular intervals
over a period of time
Consider this situation:
Points Scored in a game (49er 2012)
Wee
k
1
2
3
4
5
6
7
8
9
Point
s
30
27
13
34
45
3
13
24
0
Wee
k
10
11
12
13
14
15
16
17
Point
s
24
32
31
13
27
41
13
27
Points for
Points Scored
50
40
30
20
Points for
10
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
Weeks
Is there a pattern? Is there anything you
can conclude?
Create a time-series graph from the
following data
What can you conclude?
Week
1
2
3
4
5
6
7
8
9
Distanc
e
1.5
1.4
1.7
1.6
1.9
2.0
1.8
2.0
1.9
Week
10
11
12
13
14
15
16
17
18
Distanc
e
2.0
2.1
2.1
2.3
2.3
2.2
2.4
2.5
2.6
Bar graphs are useful for quantitative or
qualitative data.
Pareto Charts identify the frequency of
events or categories in decreasing order of
frequency of occurrence.
Circle graph display how a total is dispersed
into several categories.
Time-series graph display how data change
over time.
Note: Make sure you provide title, label axes
and identify units of measure in all type of
graphs!!
Think back to middle school and high
school (before taking Mr. Liu’s awesome
statistics class). What do you think of
when you heard of statistics?
The 3 M’s. Mean, median and mode
Mode is the value that occurs most
frequently
25, 37, 12, 27, 69, 64, 44, 30, 50, 52, 65, 25,
10, 12, 60, 27, 12, 65, 25
12 and 25
What is the mode out of this data set?
537244248343486245345987
1
4
Median can be represented as 𝑥 “x
tilde”. Median is the central value of an
ordered distribution (the exact middle)
1) Order the data from smallest to largest
2) For an odd number of values in the
distribution, Median= Middle data value
3) For an even number of data values in
the distribution, 𝑀𝑒𝑑𝑖𝑎𝑛 =
𝑆𝑢𝑚 𝑜𝑓 𝑚𝑖𝑑𝑑𝑙𝑒 𝑡𝑤𝑜 𝑣𝑎𝑙𝑢𝑒𝑠
2
Find the Median
19 20 25 36 48 35 21 18
18 19 20 21 25 35 36 48
Because it is even, I add (21+25)/2 = 23
So median = 23
Find the median
36, 4, 69, 47, 81, 84, 6, 38, 17, 93, 46
Median uses the position rather than the
specific value of each data entry. If the
extreme values of a data set change,
the median usually does not change.
It is often used as the average for house
prices.
For an ordered data set of size n,
𝑛+1
2
𝑃𝑜𝑠𝑖𝑡𝑖𝑜𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑚𝑖𝑑𝑑𝑙𝑒 𝑣𝑎𝑙𝑢𝑒 =
If n = 77, then the middle value is
(77+1)/2= 39th number.
If n=90, then the middle value is
(90+1)/2=45.5th value. Which is in
between 45th and 46th number.
𝑠𝑢𝑚 𝑜𝑓 𝑎𝑙𝑙 𝑒𝑛𝑡𝑟𝑖𝑒𝑠
𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑒𝑛𝑡𝑟𝑖𝑒𝑠
𝑀𝑒𝑎𝑛 =
Mean means average.
It is aka arithmetic mean
Find the mean: 58, 92, 50, 68, 80, 64
Mr. Liu is a Paladin Tank that’s low in HP.
He only has 2 hits left before the boss
unleash his ultra super attack and pwns
Mr. Liu. The boss has a total of 750 HP
and Mr. Liu has damaged the boss with
the following damage points: 98, 50, 65,
71, 120, 84, 93, 27. What is the mean
damage of the next 2 hits so that Mr. Liu
survives?
Many ways to solve this… depend on
how you think about it.
The mean is 71 damage points for the
next 2 hits.
(𝑠𝑖𝑔𝑚𝑎) 𝑚𝑒𝑎𝑛𝑠 𝑠𝑢𝑚
𝑥 𝑚𝑒𝑎𝑛𝑠 𝑠𝑢𝑚 𝑜𝑓 𝑎𝑙𝑙 𝑥, 𝑜𝑟 𝑎𝑙𝑙 𝑑𝑎𝑡𝑎 𝑣𝑎𝑙𝑢𝑒𝑠
Sample statisic 𝒙 (x bar)
𝑥
𝑥=
𝑛
n=number of data values in
the sample
𝑥 = 𝑠𝑎𝑚𝑝𝑙𝑒 𝑚𝑒𝑎𝑛
Population parameter 𝝁 (Mu)
𝑥
𝜇=
𝑁
N=number of data values in
the population
𝜇 = 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
Resistant measure is one that is not
influenced by extremely high or low data
values.
Which of the following data has more of
a resistant measure, mean or median?
Median, although the disadvantage of
the median is that it is not sensitive to the
specific size of a data value.
Mean is too sensitive to the data value.
It is called 10% trimmed mean
Procedures:
› 1) Order the data from smallest to largest
› 2) Delete the bottom 5% or the data and the top
5% of the data. Note: If the calculation of 5% of
the number of data values does not produce a
whole number, round down to the nearest
integer
Ex: if you get .75, you round to 0
Ex: if you get 1.95, you round to 1
› 3) Compute the mean of the remaining 90% of
the data
Here are some sample class sizes for introductory
lecture courses
14, 30, 45, 11, 20, 80, 65, 34, 18, 60, 100, 42, 40, 40, 78,
31, 52, 35, 120, 19
A) compute the mean
B) compute the 10% trimmed mean
C) Find the median
D) Find the median of the 10% trimmed data set.
Does the median change when you trim the data?
E) Is the trimmed mean or the original mean closer to
the median?
A) 46.7
B) 44.61
C) 40
D)40
E) Trimmed mean is closer to the median
How old are basketball players? Here are some of
the ages: 40, 38, 21, 18, 24, 23, 28, 25, 21, 27, 26, 25,
31, 32, 19, 29, 33, 24, 23, 35, 24, 23, 25, 32, 39, 26, 31,
28, 30, 31, 22, 23, 24, 25, 26, 27, 38, 27, 26, 22
A) compute the mean
B) compute the 10% trimmed mean
C) Find the median
D) Find the median of the 10% trimmed data set.
Does the median change when you trim the data?
E) Is the trimmed mean or the original mean closer to
the median?
A) 27.275
B) 27.083
C) 26
D) 26
E) Closer
This deal with my class especially!!!
𝑊𝑒𝑖𝑔ℎ𝑡𝑒𝑑 𝑎𝑣𝑒𝑟𝑎𝑔𝑒 =
w= weight
𝑥𝑤
𝑤
If your midterm is worth 40% of your total
grade and your final is worth 60% of your
total grade, you got 70 on your midterm
and 90 on your final. What is your final
grade?
𝑊𝑒𝑖𝑔ℎ𝑡𝑒𝑑 𝑎𝑣𝑒𝑟𝑎𝑔𝑒 =
82
you would have a B
.40 70 +.60(90)
.40+.60
=
28+54
1
=
Test/Quizzes
Participation/bell work
Notebook
Projects
Homework
Finals
30 %
10 %
15 %
20 %
15 %
10 %
You scored 85% on test/quizzes
You scored 60% on Homeworks
You scored 100% on participation/bell work
You scored 80% on projects
You scored 98% on notebook
You scored 50% on finals
What is your grade for the class?
.85(.30)+.60(.15)+1(.10)+.80(.20)+.98(.15)+
.50(.10)=.255+.09+.10+.16+.147+.05=.802
or 80.2% which is a B-
These are a measure of the distribution or spread of data
around an expected mean (𝑒𝑖𝑡ℎ𝑒𝑟 𝜇 𝑜𝑟 𝑥)
Variance is a measure of how far a set of numbers is
spread out.
A variance of zero indicates that all the values are
identical.
A non-zero variance is always positive:
› a small variance indicates that the data points tend to be
very close to the mean (expected value) and hence to
each other.
› a high variance indicates that the data points are very
spread out from the mean and from each other.
Standard Deviation shows how much variation
or dispersion from the average (mean, also called
Are you prepared?
Note: I will be covering both sample
statistics and population parameters
Sample mean=𝑥 =
𝑥
𝑛
Sample variance=𝑠 2 =
𝑥−𝑥 2
𝑛−1
𝑥−𝑥 2
𝑛−1
Sample standard deviation=𝑠 =
Where x is a member of the data set, 𝑥 is
the mean, and n is the number of data
Random Sample
5 7 8 12 15 15
Find the variance and standard
deviation
𝑥
𝑥−𝒙
𝒙−𝒙
𝟐
4
4-10=-6
−6
7
7-10=-3
−3
2
=9
7
7-10=-3
−3
2
=9
11
11-10=1
1
15
15-10=5
5
2
= 25
16
16-10=6
6
2
= 36
𝑥 = 60
𝑥 𝟔𝟎
𝒙=
=
= 𝟏𝟎
𝐧
𝟔
2
2
=1
𝑥−𝑥
𝑠2
=
𝑥−𝑥 2 𝟏𝟏𝟔
= 𝟓
𝑛−1
𝑠=
= 36
2
= 116
= 𝟐𝟑. 𝟐
𝑥−𝑥
𝑛−1
2
= 23.2 ≈ 4.82
Random Sample
5 5 5 6 7 8
Find the variance and standard
deviation
𝑠 2 = 1.6
𝑠 ≈ 1.26
Population mean=𝜇 =
𝑥
𝑁
𝑥−𝜇 2
𝑁
Population Variance=𝜎 2 =
Population standard deviation=𝜎 =
𝑥−𝜇 2
𝑁
Where N is the number of data values in
Samples are usually in “english
alphabets”
Populations are usually in “greek
alphabets”
Samples are in lower case
Population are in upper case
Have you guys taken your SAT yet?
Remember how you look at the score
and it says your “percentile”?
If you are in the 70th percentile, it means
that you scored the same or better than
70% of the people who took the SAT.
30% Scored the same or above you.
For whole numbers P (where 1 ≤ 𝑃 ≤
99), the Pth percentile of a distribution is
a value such that P% of the data fall at
or below it and (100-P)% of the data fall
at or above it.
It gives indication of “relative” position.
Basically, it tells you how you scored
against others that took the same test.
A) If your score is at the 27th percentile,
what percentage of scores are at or
below yours?
B) If the score ranged from 1 to 100 and
your score is 90%, does that necessarily
mean that your score is at the 90th
percentile?
A) 27%
B) No because percentile gives an
indication of relative position of the
scores.
Median is an average computed by
using relative position of the data.
Median is an example of a percentile; it
is the 50th percentile.
What is a Quartiles? (Look at the word
itself, what does it sound like?)
Quartile sounds like “quarters” so it has 4
intervals
1) Order the data from smallest to largest
2) Find the median (Q2). This is the second
quartile (Q2)
3) The first quartile Q1 is then the median of
the lower half of the data; that is, it is the
median of the data falling below the Q2
position (and not including Q2)
4) The third quartile Q3 is the median of the
upper half of the data; that is, it is the
median of the data falling above the Q2
position (and not including Q2)
16
17
18
20
25
33
33
35
37
38
40
47
50
50
50
63
84
97
97
99
100
103
107
107
107
108
123
1) Where is the median (Q2)? Since there are 27
data, It is at the14th number, which is 50
2) Where is Q1? Since Q2 is at 14th spot, there are 13
numbers below. The median between Q1 and Q2 is
at the 7th spot, which is 33
3) Where is Q3? Since Q2 is at the 14th spot, there are
13 numbers above it. The median between Q2 and
Q3 is at the 21th spot, which is 100
4) Interquartile is Q3-Q1, which is 100-33=67
7
15
23
6
12
35
30
14
6
21
25
39
5
16
9
25
21
7
12
10
31
Q1=8
Q2=15
Q3=25
IQR=17
540
570
520
550
490
430
490
490
570
380
510
640
310
400
380
390
410
340
390
350
330
350
Q1=380
Q2=420
Q3=520
IQR=140
Quartiles together with the low and high
data values give us a very useful fivenumber (no…not $5 foot long) summary
of the data and their spread.
Max
Q3
Q2
Q1
Min
The longer the whisker is, the more
skewed it is toward that value.
If the median is closer to the lower part
of the box, it means that data are more
concentrated toward the lower end.
If the median is closer to the upper part
of the box, it means that data are more
concentrated toward the upper end.
1) Draw a vertical/horizontal scale to
include the lowest and highest data values.
2) To the right/top of the scale, draw a box
from Q1 to Q3
3) Include a solid line through the box at
the median level
4) Draw vertical/horizontal lines, called
whiskers, from Q1 to the lowest value and
from Q3 to the highest value
(Look at previous slide for examples)
Lower outlier = 𝑄1 − 1.5 𝑥 𝐼𝑄𝑅
Upper outlier = 𝑄3 + 1.5 𝑥 (𝐼𝑄𝑅)
SAT example:
Q1=380
Q2=420
Q3=520
IQR=140
Lower Outlier: 380 − 1.5 𝑥 140 = 170
Upper Outlier: 520 + 1.5 𝑥 140 = 730
Points Mr. Liu scored in basketball games:
Q1=8
Q2=15
Q3=25
IQR=17
Q1=8
Q2=15
Q3=25
IQR=17
Lower Outlier: 8 − 1.5 17 = −17.5
Upper Outlier: 25 + 1.5 17 = 50.5
Mr. Liu’s points never goes below -17.5 or
65 72 68 4 64 60 55 73 71 52 63 61
75 62 89 64
Pg 769 #1-24 eoe
Pg 782 #1-39 eoo