Lecture 11 - Statistics
Download
Report
Transcript Lecture 11 - Statistics
Statistics 400 - Lecture 11
Today: Finish 8.4; begin Chapter 9
Assignment #4: 8.54, 8.103, 8.104, 9.20, 9.30
Not to be handed in
Next week…Case Studies and Review
No Lab Friday and Next Tuesday
Example
A company wishes to monitor customer service
A senior manager claim that the average waiting time is 3 minutes
The waiting times of 75 randomly selected calls to a customer service hotline were recorded
The calls had a sample mean of 3.4 minutes and sample standard deviation
of 2.4 minutes
Test this claim with a significance level of 0.05
Types of Errors
What is the probability of rejecting H0 when it is indeed true?
If H0 is true, how often do we make the right decision?
Have only considered probability of rejecting when H0 is true
Suppose H0 is not true. Ideally, what happens?
Have 2 types of error:
Type I error: Reject H0 when H0 is true
Type II error: Fail to Reject H0 when H0 is false
The probability of a type I error is:
Probability of type II error is:
Power of a Test is:
Small Sample Inference for Normal Populations
Do not always have large samples
When is it reasonable to use the Z -test statistic with an unknown
standard deviation?
X
S/ n
This statistic has a different sampling distribution called the
Student’s t-distribution
Student’s t-Distribution
Have a random sample of size n from a normal population
The distribution of the sample mean is:
Distribution of Z is:
When population standard deviation is unknown, it is estimated by
the sample standard deviation
Will use
T
X
as our test statistic
S/ n
Student’s t-Distribution
If x1, x2, …, xn is a random sample from a normal population with
mean , and standard deviation , then
t
X
S/ n
has a t-distribution with (n-1) degrees of freedom
t-distributions are:
Bell shaped
Symmetric
They are not exactly like normal distributions. Why?
Probabilities for the t-distribution
How many different t-distributions are there?
Table 4 can be used for computing probabilities for various
t-distributions
The probability being computed is a “greater than” probability
Example
Suppose T has a Student’s t-distribution with 10 degrees of
freedom
P(T>2.228)=
P(T<2.764)=
P(T>0)=
P(T>2.5)=
What is the 90th percentile of this distribution?
What is the 10th percentile of this distribution?
Small Sample Confidence Interval for the
Population Mean
If x1, x2, …, xn is a random sample from a normal population with
mean , and standard deviation , then a 100(1 )% confidence
interval for the population mean is:
S
X t / 2 n
If you have use a
distribution instead!
Example:
Heights of males are believed to be normally distributed
Random sample of 25 adult males is taken and the sample mean &
standard deviation are 69.72 and 4.15 inches respectively
Find a 95% confidence interval for the mean
Small Sample Hypothesis Test for the
Population Mean
Have a random sample of size n ; x1, x2, …, xn
H 0 : 0
Test Statistic:
t
X
S/ n
Small Sample Hypothesis Test for the
Population Mean (cont.)
P-value depends on the alternative hypothesis:
H1 :
0 : p - value P(T t )
H1 :
0 : p - value P(T t )
H1 :
0 : p - value 2P(T | t |)
Where T represents the t-distribution with (n-1 ) degrees of
freedom
Example:
An ice-cream company claims its product contains 500 calories per
pint on average
To test this claim, 24 of the company’s one-pint containers were
randomly selected and the calories per pint measured
The sample mean and standard deviation were found to be 507 and
21 calories
At the 0.01 level of significance, test the company’s claim
What assumptions do we make when using a t-test?
How can we check assumptions?
Can use t procedures even when population distribution is not
normal. Why?
Practical Guidelines for t-Tests
n<15: Use t procedures if the data are normal or close to normal
n<15: If the data are non-normal or outliers are present DO NOT
use t procedures
n>15: t procedures can be used except in the presence of outliers
or strong skewness
t>30: t procedures tend to perform well