Statistics Chapter 2 Exploring Distributions
Download
Report
Transcript Statistics Chapter 2 Exploring Distributions
Section 2.3
Measures of Center and Spread
Mean (average):
Sum of x’s / total number of values
𝑥
𝑥=
𝑛
It is the balance point of a distribution
Median is the value the divides the data into 2
halves.
The middle number or average of middle 2 numbers.
1,3,6,8,9
Median is 6
1,3,7,9,10,15
Median is (7+9)/2=8
An extreme outlier has a larger effect on the
mean than the median. Why?
1,4,6,8 Find the mean and median
1,4,6,8 and 55: Now find the mean and median
Remember the mean is the balance point of a
distribution and the median is the middle number.
Use the Speed of Mammals to investigate median and the 5-Number Summary:
Speed (mi/h) Predator (1=Y, 0=N
30
1
30
1
70
1
30
0
39
1
40
0
25
0
45
0
42
1
32
0
20
0
48
0
40
0
50
1
11
0
35
0
12
0
40
0
Find the median of the Predators:
Find the lower or first quartile of the Predators: (omit the median)
Find the upper of third quartile of the Predators: (omit the median)
What are the minimum and maximum values for Predators:
The 5-Number Summary and gives a measure of center and spread.
Now do the same for the non-predators:
Use the 5-Number Summary to create a box
and whisker plot.
When comparing two groups use the same
scale and place box and whisker plots on top
of each other.
Use the mammal speed data to compare wild
mammals to non-wild mammals with box and
whisker plots.
Page 63 D16
Page 69 P16, 18, 19, 21
The IQR is a measure of the spread of the
middle 50% of the data.
IQR=Q3-Q1
This can be use to determine if a value is an
outlier.
If the value is more than 1.5 times the IQR from
the nearest quartile.
Outliers in a Boxplot.
Measuring Hand Span Activity
Using a ruler, measure your hand span to the
nearest mm.
Write your result on the board.
Enter all data into calculator.
Find the 5 Number Summary of the data.
▪ (1-var stats)
▪ Create a box plot
Enter L1-mean into L2
Use 1-var stats to find the sum of this:
x
Think about what subtracting the mean (x)
from your data values does.
It tells how far your hand span is from the class
average.
▪ + and your hand span is larger
▪ - and your hand span is smaller
Now think about what summing all of your
differences does. What was the result?
Differences from the mean will always sum to be Zero.
Rapid Fire Example , Rapid Fire 2
A measure of the average difference from the mean gives
you a kind of level of consistency.
Think of a target in an archery contest.
If you wanted to measure someone’s accuracy, how would you
suggest doing it?
The standard deviation of a data set is typically used when
the data is roughly normally distributed.
It measures the “average” distance from the mean.
This is a measure of spread. (how far are the arrows spread apart)
To calculate a SD, we must figure out a way to find the
average difference from the mean, but we know that the
sum of the differences is always = zero.
So we square the difference, then sum that, then divide to find
average sum of squares, then take the square root.
s
(x x)
n 1
2
The SD squared: (s2) is called the variance.
It is not used as often, but is important to know
what it is, in case it is a piece of information given.
You can get the SD from the calculator.
Sx is the SD of a sample of data.
σx is the SD of an entire population (we will
discuss the difference later)
Note the other sums and values you can get.
Σx will give the sum of you list of values
Σx2 will give the sum of squared values
n is your total number of data values in the list
Then is your 5-Number Summary
Frequency Tables show the values and the
frequency that they occur in the data set.
You can easily find the mean of this data:
(x f )
x
n
(x x)
You can find the SD:
It is easy to find with Calculator:
s
2
f
n 1
L1:x values, L2: frequencies, 1-var stat L1,L2
Page 70 P22-P25
Page 71: E29, 31, 33, 35, 37, 38, 43