Transcript 01/18/2008
Univariate Statistics
Demonstration Day
Topics
Questions
Calculating measures of central tendency &
dispersion by hand and in Excel
Summation Notation & Rules
Skewness
Questions
Which measure of central tendency is most
appropriate for the following distributions
and why?
–
–
–
–
Bimodal distribution?
Skewed distribution?
Dataset with an outlier (an extreme value)?
Normal distribution (unimodal)?
Symbols Review
n
: the number of observations in a sample
N
: the number of elements in the population
Σ
: this (capital sigma) is the symbol for sum
i
: the starting point of a series of numbers
X
: one element in our dataset, usually has a subscript (e.g., i, min, max)
x
: the sample mean
: the population mean
s2
: the sample variance
σ2
: the population variance
s
: the sample standard deviation
σ
: the population standard deviation
Equations Review
Sample mean
Sample standard deviation
n
x
x
i 1
n
n
i
s
( x x)
i 1
i
n 1
Summation Notation
The order of operations for statistical
equations
Similar to Please Excuse My Dear Aunt
Sally from algebra
Summation Notation: Examples
Example I: All observations are included in the sum:
1, 2, 3, 4, 5, 6, 7, 8, 9, 10
10
x
i 1
i
x1 x2 x3 x4 x5 x6 x7 x8 x9 x10
1 2 3 4 5 6 7 8 9 10
Example II: Only observations 3 through 5 are included in the sum:
5
x
i 3
i
x3 x4 x5 3 4 5 12
Summation Notation: Rules
Rule I: Summing a constant n times yields a result of n*b:
n
b b b b nb
i 1
Here we are simply using the summation notation to carry
out a multiplication, e.g.:
5
4
i 1
4 4 4 4 4 4 5 20
Summation Notation: Rules
Rule II: Constants may be taken outside of the summation
sign
n
ax
i 1
n
ax
i 1
i
i
n
a xi
i 1
ax1 ax2 axn
n
a( x1 x2 xn ) a xi
i 1
• Rule II: Constants may be taken outside of the
summation sign
Example: Now let a = 3, and let the values of a set
(n = 3) of x and y values be:
x1 = 4, x2 = 5, x3 = 6
y1 = 7, y2 = 8, y3 = 9
3
3
ax
i 2
a xi 3(5 6) 33
i
i 2
ax
i
a xi 3(4 5 6) 45
Summation Notation: Rules
Rule III: The order in which addition operations are carried
out is unimportant
n
(x
i 1
i
n
n
i 1
i 1
yi ) xi yi
( x1 x2 x3 xn 1 xn )
+
( y1 y2 y3 yn1 yn )
• Rule III: The order in which addition operations are
carried out is unimportant
Example: Now let a = 3, and let the values of a set
(n = 3) of x and y values be:
x1 = 4, x2 = 5, x3 = 6
y1 = 7, y2 = 8, y3 = 9
2
2
2
(x y ) x y
i 1
i
i
i
i 1
2
i 1
i
(4 5) (7 8) 24
( x y ) (4 7) (5 8) 24
i 1
i
i
Summation Notation: Rules
Rule IV: Exponents are handled differently depending on
whether they are applied to the observation term or the whole
sum
n
k
k
k
k
x
x
x
x
i 1 2
n
i 1
k
k
x
(
x
x
x
)
i
1
2
n
i 1
n
• Rule IV: Exponents are handled differently depending
on whether they are applied to the observation term or
the whole sum
Example: Now let the values of a set (n = 3) of x values be:
x1 = 4, x2 = 5, x3 = 6
2
2
2
2
x
4
5
6
77
i =
( xi ) 2 (4 5 6) 2 225
Summation Notation: Rules
Rule V: Products are handled much like exponents
n
x y
i
i 1
i
( x1 y1 x2 y2 xn yn )
n
n
n
x y x y
i 1
n
n
x y
i 1
i
i 1
i
i
i
i 1
i
i 1
i
( x1 x2 xn ) ( y1 y2 yn )
• Rule V: Products are handled much like exponents
Example: Now let the values of a set (n = 3) of x and y
values be:
x1 = 4, x2 = 5, x3 = 6
y1 = 7, y2 = 8, y3 = 9
x y
i
4 7 5 8 6 9 122
i
x y
i
i
(4 5 6) (7 8 9) 360
Summation Notation: Compound Sums
We frequently use tabular data (or data drawn from matrices), with which we
can construct sums of both the rows and the columns (compound sums), using
subscript i to denote the row index and the subscript j to denote the column
index:
Columns
Rows
2
3
x
i 1 j 1
ij
x11 x12 x13
x21 x22 x23
( x11 x12 x13 x21 x22 x23 )
Pearson’s Skew Equation
3( x median)
s
Excel’s Skew Equation