ch 16--statistics File - Solanco School District Moodle
Download
Report
Transcript ch 16--statistics File - Solanco School District Moodle
Unit Activation
M & M distribution project
How are probability and statistics related?
Statistics
HOW CAN WE REPRESENT A SET OF DATA SO THAT IT HAS
MEANING TO THOSE WHO NEED TO VIEW IT?
Statistics Review
16-1 & 2
HOW DO STATISTICS HELP ME RELATE TO THE WORLD
AROUND ME?
Greed
A STATISTICS REVIEW GAME
Set up your sticky note like this
with the sticky part on top
Instructions
1 ) E V E RY O N E B E G I N S T H E GA M E B Y S TA N D I N G
2) TWO SIX-SIDED DIE ARE ROLLED FOR THE INITIAL
SCORE
3 ) T H O S E T H AT A R E H A P P Y W I T H T H E I R S C O R E M AY
SIT DOWN AND RECORD THEIR SCORE ON THEIR
S T I C K Y N OT E
4) THE ROUND ENDS WHEN A ONE IS ROLLED OR
E V E RY O N E S I T S D O W N . I F A O N E I S R O L L E D
E V E RY O N E S TA N D I N G G E T S A Z E R O F O R T H E
ROUND
5 ) A GA M E C O N S I S T S O F 5 R O U N D S
6 ) AT T H E E N D O F F I V E R O U N D S, S T U D E N T S A D D
THEIR SCORES TOGETHER
7 ) T H E DATA I S T H E N U S E D T O R E V I E W S TAT I S T I C S
CONCEPTS
Roll two six-sided dice for the initial
score
Click to
Roll
Roll one six sided die
Click to
roll
Calculate your score
Place the papers on the board in numeric order
FIND THE:
Mean
Median
Mode
Create a Box and Whisker Plot
Discuss outliers (not just extremes)
Create a dot plot
Create a stem plot
State the similarities and differences between dot and
stem plots
Play round two
Set up your sticky note like this
with the sticky part on top
Roll two six-sided dice for the initial
score
Click to
Roll
Roll one six sided die
Click to
roll
Calculate your score
Place the papers on the board in numeric order
Create a back to back stem plot
Compare and Contrast the two stem plots (Center,
Shape and Spread—range)
Homework
NONE.
Chapter 16-2a
HOW DO I INTERPRET GRAPHS AND CHARTS?
Statistics in the World
How many sectors are in this circle
graph?
5
What percentage of people in Shrub
Oak preferred chocolate ice cream?
35
What percentage of people in Shrub
Oak preferred butter pecan ice
cream?
13
If a total of 50 people were surveyed,
then how many people preferred
vanilla ice cream?
13
Not given
0 to 100
6
Visit with friends
School clubs
53
44
Watch TV and earn money
Visit w friends, chat online, talk on phone, earn money, watch tv, school clubs
Choice 3
Discuss this as a class
What is wrong with the graph?
a) the labels are missing
b) scale is incorrect
c) too much data is presented for this type of graph
d) none of the above
How can we represent this data?
what type of graphs do you think is best for this data and why?
One option is to the right,
Is this the only option?
No you could use a bar graph too
Homework
WORKSHEET Graphs
Chapter 16-3
HOW TO WE FIND THE MEASURES OF VARIATION AND WHAT
DO THEY MEAN?
definitions
Mean deviation—the average distance
each piece of
n
data is from the mean
meandev
| x x |
i 1
i
n
Variance—the measure of the amount of variation in
a set of data
n
s2
( x x)
i 1
2
i
n 1
Standard deviation—the square root of the variance
or the typical deviation from the mean in normal
data all data should fall within three SD of the mean
Example
20, 15, 12, 18, 17, 15, 17, 16, 18, 25
x
Reorder
12, 15, 15, 16, 17, 17, 18, 18 20, 25
Q1= 15
Median= 17
Q3= 18
range = 25-12=13
iqr = 18-15=3
173
10
=17.3
Just Watch!!!!!!!!!
Find the standard deviation by hand
i
xi
Xi -
xx
(xii-
x )22
1
12
-5.3
28.09
2
15
-2.3
5.29
3
15
-2.3
5.29
4
16
-1.3
1.69
5
17
-.3
.09
6
17
-.3
.09
7
18
.7
.49
8
18
.7
.49
9
20
2.7
7.29
10
25
7.7
59.29
totals
173
108.1
173
x
17.3
10
n
s2
s2
( x x)
i 1
2
i
n 1
108.1
12.01
9
s s 2 3.46
By Calculator
Press Stat
Press enter/edit
Enter the data in L1
Press STAT
Choose calc
Choose 1 var stat
xi
12
15
15
16
17
17
18
18
20
25
x average
x total of x' s
x total of x' s squared
2
Sx st. dev of the SAMPLE
σ x st. dev of the Population
n number of items
min x min value
Q1 1st quartile
med median
Q3 3rd quartile
max x max value
Finding the Mean Deviation with the Calc
Enter the data in L1
Go to the top of L2 and enter |xi - 𝑥|
Run 1 variable statistics on L2
The 𝑥 in this run is the mean deviation
Homework
Pg. 702
2, 4, 6, 7
The Normal Distribution
16-4
Activation
Roll two die 5 times recording your results. Take turns plotting the results on the
graph below.
2
3
4
5
6
7
8
9
10
11
12
The normal curve
Normally Distributed data—data which is bell shaped and symmetric
about the mean in a way that 68% of the data fall within one st. dev.,
95% fall within two st. dev. And 99.7% falls within 3 standard dev.
- 3
13.5
2.35
%
%
-2
-1
68%
mean
95%
99.7%
13.5
%
1
2.35
%
2
3
Z-scores
Z-score—the number of standard deviations a
piece of data falls from the mean
z
xx
Example: if the mean is 10 and the st dev is 2
What is the z-score of 7
Open the book to pg 850.
7 10 3
z
2
2
What does a z-score mean?
Z-chart—because the data is symmetric we
can find the likelihood that we will have this
piece of data or one less than it.
7 10 3
z
2
2
z .0668
*
6.68% of the data is less than or equal to 7 when the mean is 10 and the st dev is 2
Example
Samples of a certain type of concrete specimen are selected, and the
compressive strength of each one is determined. The mean and st. dev are
3000 and 500 respectively. The sample box-plot appears relatively
normal. (i.e.—we can use z-scores)
Approx. what % of the sample falls below 2500?
Approx what percent falls between 2500 and 4300?
Approx what percent falls above 4500?
Homework
Pg 708
15-18
16-5
SAMPLING
Simple Random Sample—samples chosen so that
Each item has an equal chance of being selected
The choice of one item has no bearing on the choice of the next
Example:
A scientist is testing the impact on weight gain and loss on rats fed a
certain supplement. He reaches into the cage and chooses the five largest
rats. Is this a random sample?
This is not random in fact it is what we call biased.
Bias—when data is overly influenced for some reason.
In the last example the scientist was biased towards the larger rats.
Examples:
The county decides to survey the size of its constituents households by calling
every 10th person on the list?
Is this random?
Yes –not necessarily the best way to achieve randomness but legitimate
Is this biased?
Yes—not all constituents may have a home phone
Closer—Happyville
• Keep the worksheet face down until told differently
• When I tell you turn the worksheet over and estimate the average
household size in Happyville—you have five seconds turn the paper back
• Get everyone’s values and find the average of the estimates
• When I tell you turn the paper over and randomly choose any ten
households by circling them (you have 30 seconds)—turn the paper back
• Average the number of people in the 10 households
• Get everyone’s values and find the average of the averages
• Compare the two values
• Seed the random number generator
• Type in a value from 0 to 1000 press STO math prb rnd
• Then type math prb 5 (1,100) press enter until you have 10 unique
values
• Average the number of people in these 10 households
• Get everyone’s values and find the average of the averages
• Compare the three values
• Which values are closest together?
Homework
Pg 713
evens
Simulations
16-7
Activation
What is the probability of getting 5 heads
when you flip a coin ten times?
Is it sufficient to flip the coin just ten times?
Ten sets of ten?
What if the experiment had been what is the
probability of getting 5 correct answers on a
ten problem true false test?
Simulations
Theoretical probability
The actual probability of an event
Experimental probability
The results that you get when you run an experiment
Simulation
Used to approximate the probability when money or
time is too great a factor in running an actual
experiment
Design
The method used to run the simulation
Trial
One run of the method described
Example one:
A restaurant is giving away 6 actions figures
to the latest movie with the purchase of each
child’s meal. If you are equally like to get
any figure, what is the probability of getting
one of each in the purchase of ten meals?
Click
to roll
Example Two:
A basketball player has a free throw percentage of
80%. What is the probability of making two free
throws out of three baskets?
Example Three:
How does this change our simulation?
A basketball player has a free throw
percentage of 78%. What is the probability
of making two free throws in a out of three?
Homework
Worksheet
Types of sampling
16-6
IS ONE TYPE OF SAMPLING BETTER THAN ANOTHER?
River Project
WHAT ARE THE TYPES OF SAMPLING AND HOW IS EACH
ACCOMPLISHED?
Harvest Time
A farmer has just cleared a new field for corn. It is a unique plot of land
in that a river runs along one side. The corn looks good in some areas
of the field but not others. The farmer is not sure that harvesting the
field is worth the expense. He has decided to harvest 10 plots and use
this information to estimate the total yield. Based on this estimate, he
will decide whether to harvest the remaining plots.
Convenience Sample
The farmer decided to harvest the 10 easiest plots to harvest.
X
X
X
X
X
River
X
X
X
X
X
He then had second thoughts and decided to hire you, a
statistics student, somewhat knowledgeable about these
matters but far cheaper than a statistician. You are still to
choose 10 plots but try different sampling methods to
determine which one will give the best estimate of the actual
yield.
Simple Random Sample
Use a random number table to select 10 plots.
Describe the method used and mark the plots.
River
Stratified Sample (vertical)
Determine a method that will allow you to choose
one box in each column. Describe the method and
select the plots
River
Stratified Sample (horizontal)
Determine a method that will allow you to choose
one box in each row. Describe the method and
select the plots
River
Determining best method
The table below represents the yield per plot. You
will need this to determine which method is best
and why.
0
0
1
2
3
4
5
6
7
8
9
1
2
3
4
5
6
7
8
9
Determining best method
Method
Convenience
sample
Simple Random
sample
Vertical strata
Horizontal strata
Mean yield
per plot
Estimated
Total yield
Observations
1) You looked at four different methods of choosing plots. Is there a
reason, other than convenience, to choose one method over another?
2) How did your estimates vary according to the different sampling
methods used?
Homework
None
Hypothesis testing
HOW MUCH OF A DIFFERENCE IS ENOUGH?
Distracted Driver Project
1. Get a standard deck of 52 cards (i.e. no jokers)
2. Work with your group to determine a method for simulating this experiment
using the cards. Take five minutes to determine your method.
3. We will discuss the methods before proceeding and all use the same
method.
4. Shuffle and deal two piles of 24 cards. The pile on the left will be the cellphone drivers. Record the number who missed. Why don’t we need to
count the cards in the other pile?
5. Make a chart like the one below and repeat the experiment 9 more times
recording the number of cell phone misses.
6. In the original experiment, 7 of 24 drivers using cell phones missed the
freeway exit, compared to only 2 of the 24 drivers talking to a passenger. In how
many of your 10 simulation trials did 7 or more drivers in the cell phone group
miss the exit?
7. Place your results in the class dot plot on the board. In what percent of the
class’s simulation trials did 7 or more people in the cell phone group miss
the freeway? (be careful to keep the rows and columns of dots even)
8. Since 7 of the cell phone talkers missed the exit in the actual experiment,
do you think it’s possible that cell phones and passengers are equally
distracting to drivers based on the simulation results of the class? Why or
why not?
Homework
None