Statistics Chapter 2 Exploring Distributions

Download Report

Transcript Statistics Chapter 2 Exploring Distributions

Section 2.3
Measures of Center and Spread

Mean (average):
 Sum of x’s / total number of values
𝑥
𝑥=
𝑛
 It is the balance point of a distribution

Median is the value the divides the data into 2
halves.
 The middle number or average of middle 2 numbers.
 1,3,6,8,9
Median is 6
 1,3,7,9,10,15
Median is (7+9)/2=8

An extreme outlier has a larger effect on the
mean than the median. Why?
 1,4,6,8 Find the mean and median
 1,4,6,8 and 55: Now find the mean and median
 Remember the mean is the balance point of a
distribution and the median is the middle number.

Use the Speed of Mammals to investigate median and the 5-Number Summary:

Speed (mi/h) Predator (1=Y, 0=N
30
1
30
1
70
1
30
0
39
1
40
0
25
0
45
0
42
1
32
0
20
0
48
0
40
0
50
1
11
0
35
0
12
0
40
0
Find the median of the Predators:
Find the lower or first quartile of the Predators: (omit the median)
Find the upper of third quartile of the Predators: (omit the median)
What are the minimum and maximum values for Predators:
The 5-Number Summary and gives a measure of center and spread.
Now do the same for the non-predators:

Use the 5-Number Summary to create a box
and whisker plot.

When comparing two groups use the same
scale and place box and whisker plots on top
of each other.

Use the mammal speed data to compare wild
mammals to non-wild mammals with box and
whisker plots.

Page 63 D16

Page 69 P16, 18, 19, 21

The IQR is a measure of the spread of the
middle 50% of the data.

IQR=Q3-Q1

This can be use to determine if a value is an
outlier.
 If the value is more than 1.5 times the IQR from
the nearest quartile.
 Outliers in a Boxplot.

Measuring Hand Span Activity
 Using a ruler, measure your hand span to the
nearest mm.
 Write your result on the board.
 Enter all data into calculator.
 Find the 5 Number Summary of the data.
▪ (1-var stats)
▪ Create a box plot
 Enter L1-mean into L2
 Use 1-var stats to find the sum of this:
x

Think about what subtracting the mean (x)
from your data values does.
 It tells how far your hand span is from the class
average.
▪ + and your hand span is larger
▪ - and your hand span is smaller

Now think about what summing all of your
differences does. What was the result?
 Differences from the mean will always sum to be Zero.


Rapid Fire Example , Rapid Fire 2
A measure of the average difference from the mean gives
you a kind of level of consistency.
 Think of a target in an archery contest.
If you wanted to measure someone’s accuracy, how would you
suggest doing it?

The standard deviation of a data set is typically used when
the data is roughly normally distributed.
 It measures the “average” distance from the mean.
 This is a measure of spread. (how far are the arrows spread apart)

To calculate a SD, we must figure out a way to find the
average difference from the mean, but we know that the
sum of the differences is always = zero.
 So we square the difference, then sum that, then divide to find
average sum of squares, then take the square root.
s
 (x  x)
n 1
2

The SD squared: (s2) is called the variance.
 It is not used as often, but is important to know
what it is, in case it is a piece of information given.

You can get the SD from the calculator.
 Sx is the SD of a sample of data.
 σx is the SD of an entire population (we will
discuss the difference later)

Note the other sums and values you can get.
 Σx will give the sum of you list of values
 Σx2 will give the sum of squared values
 n is your total number of data values in the list
 Then is your 5-Number Summary


Frequency Tables show the values and the
frequency that they occur in the data set.
You can easily find the mean of this data:
(x  f )

x
n
 (x  x)

You can find the SD:

It is easy to find with Calculator:
s
2
f
n 1
 L1:x values, L2: frequencies, 1-var stat L1,L2

Page 70 P22-P25

Page 71: E29, 31, 33, 35, 37, 38, 43