Powerpoint slides
Download
Report
Transcript Powerpoint slides
WELCOME TO
MARKETING/BUSINESS
RESEARCH
1
MARKETING RESEARCH
Definition:
Used to implement the
____________
What is that?? (think intro)
2
Research is used to:
Identify problems/opportunities
3
Research is used to:
generate, and refine
marketing actions
4
Research is used to:
Plan and Implement the Marketing Mix
5
Research used to:
Monitor marketing
performance
6
When is Market Research
Warranted:
Time Constraints
Availability of Data
Nature of Decisions
Costs vs. Benefits
7
Sources of Marketing Data
Internal
sales records
customer complaints
inventory ...
External
Syndicated
Standardized
Customized
Advertising Agencies
Field Services
Tabulation Houses
Commercial
Databases
8
Research is NOT a Cure-All!
Classic Blunders
9
Why do I have to be here?
You will use
research for
decisions
Can easily bias
research
Numbers lie
10
RESEARCH ETHICS
11
In The Beginning
1950 Fear and Authority Studies
Animal Protection
Internal Review Boards
http://www.wvu.edu/~rc/irb/irb_guid/e
xempt.rtf
12
Business Ethics
Definition:
13
Teleological Ethics
Definition:Teleological moral
systems are characterized
primarily by a focus on the
consequences which any action
might have (for that reason,
they are often referred to as
consequentalist moral systems,
and both terms are used here).
Thus, in order to make correct
moral choices, we have to have
some understanding of what
will result from our choices.
When we make choices which
result in the correct
consequences, then we are
acting morally; when we make
choices which result in the
incorrect consequences, then
we are acting immorally.
14
Deontological Ethics
Definition:Deontological moral systems are
characterized primarily by a focus upon
adherence to independent moral rules or
duties. Thus, in order to make the correct
moral choices, we simply have to understand
what our moral duties are and what correct
rules exist which regulate those duties. When
we follow our duty, we are behaving morally.
When we fail to follow our duty, we are
behaving immorally.
15
Kohlberg – Value Maturity
Model
Three levels of maturity with six stages
of development
Self-centered level – (1) obedience and
punishment, (2) naively egoistic
orientations
Conformity level – (3) good person, (4)
“doing duty” orientations
Principled level – (5) contractual legalistic,
(6) conscience of principle orientations
16
Which is the “right”
perspective
17
Respondent’s Right to Choose
Can’t force compliance
Captive subject pools
Status of the researcher
Insure that incentives
do not create pressure
18
Respondent’s Right To Safety
Preserve anonymity
Preserve privacy
No mental stress
respect subjects
debrief subjects
Protect when questions are detrimental
to subject
Inform when special equipment used
19
Respondent’s Right to be
Informed
Informed consent/assent
Parental consent
Observation??
consider risks
consider alternative methods
Deception
20
Solutions
Actively think about ethics when
designing the study
Government
Institutional Review Board
Ethics Codes
http://cme.cancer.gov/c01/
AMA
Ethics Checklist
21
THE RESEARCH PROCESS
Stages in the Research Process
22
Define the Problem
(Stage 1)
Research objectives
Research questions
Properly formulate the problem
23
Conduct a Situation Analysis -Part of Problem Definition
General environment
Competitive
products or services
Consumers
Marketing Programs
24
Determine Research Design
(Stage 2)
How much should you spend?
What type of design should you use?
exploratory
descriptive
causal
25
Exploratory Designs
Used when you do
not have a good
understanding of the
problem and need to
gain insight
Used to:
Methods:
26
Descriptive Designs
Used to describe the
characteristics of
consumers,
competitors, etc.…
Methods
27
Causal Designs
Used to determine cause and effect
relationships.
MUST use experiments which include:
28
Preparation of the Design
Determine source of the data
primary
secondary
Determine data collection method
qualitative
quantitative
29
Sampling
(stage 3)
Sampling defined:
Who is to be sampled (the target
population)?
How big should the sample be?
Which sampling technique should be used?
30
Data Gathering
(stage 4)
Method Used
Stages
31
Data Processing and Analysis
(Stage 5)
Editing
Coding
Analysis
32
Conclusion and Report
Preparation
(Stage 6)
Written--the only tangible from the
study
interesting
easy to read
managerial implications
Oral
interesting
convincing
33
Secondary Research
The Place to Begin
34
Secondary Data
35
Secondary Data Advantages
Time
Money
Improve over other studies
Point of comparison for trends
Increase understanding of problem
36
Secondary Data Disadvantages:
Problems of Fit
Measurement units
differ
Class definitions
differ
Out of date
37
Secondary Data Disadvantages:
Problems of Accuracy
Primary vs.
secondary source
Purpose of
publication
General evidence of
quality
38
Internal Secondary Data
Sales invoices
Warranty cards
Departmental
records
Sales records
39
Locating External Secondary
Data
Identify what you need to know
Develop a list of key terms and people
Examine directories and guides
Write letters to key contacts
Talk to reference librarian
Do a computer search
Pull the information together
40
THE LAW
Always conduct secondary data
search before you do primary
data collection.
41
Qualitative Interviewing
Techniques
Focus Groups
Projective Techniques
3.
Depth Interviews
4.
Observation
1.
2.
42
Definitions (Yuck!):
Inquiry -- Person responds to a set of
Questions
Disguised:
Undisguised:
43
Final Definitions
Structured:
(I promise):
Questions:
Answers:
Unstructured:
Questions:
Answers:
44
Qualitative Methods
“Touchy-feely” – no numbers
Examine thoughts, feelings,
motivations…
Can be results be projected to the
population? Yes No
Can spot trends 3 to 4 years before
they show up in surveys
45
Focus Groups
______ homogeneous people carefully
recruited
Lasts _____
Types:
Round-table (Comfortable room with one-way
mirror)
Telephone
Internet
46
Focus Group Moderator
Keeps discussion focused
Truly believe that participants have wisdom
Encourages shy to talk and dominant
participants to be quiet
Should say little, but keep eye contact
Accepts all answers
Must be a quick study
47
Uses of Focus Groups
48
Advantages/Disadvantages of
Focus Groups
Advantages:
1.
2.
3.
4.
5.
6.
7.
Disadvantages:
1.
2.
3.
4.
49
Conducting a Focus Group
Register participants
(demographic information)
Small talk
Introductions
welcome
why they are here
guidelines or ground
rules
opening question
Ask questions
Anticipate flow
Control your
reactions
Probe as needed
Summarize the
discussion
50
Conducting a Focus Group:
Guidelines
Always Include:
taping discussion
do not talk over others
no names attached
sponsor of study
role of moderator to guide only
feel free to talk to each other
done by
first name basis
no wrong answers only differing
opinions
May Include:
don’t need to agree but listen to
their views
no cell phones or pagers
who will listen to tapes
who will see the report
how the report will be used
strictly research and no sales
location of the bathrooms
help yourself to refreshments
51
Developing Questions for
Focus Groups
Where to Begin:
General Rules:
52
Question Categories
Opening questions
Introductory questions
Transition questions
Key questions
Ending Questions
53
Projective Questions:
Used when subjects
cannot or will not
directly communicate
feelings
“A man is least himself
when he talks in his
own person; when he is
given a mask he will tell
the truth.”
E.g., TATs, inkblot
54
Word Association
Examine brand/service image
Measure frequency of responses and no
responses
Response Latency
Example
55
Sentence Completion
Gives more direction
than word
association
Examples:
When visiting the
President be sure
to_____________.
56
Unfinished Story
Finish the story or
tell why the person
acted the way he or
she did.
57
Third Person Role Play
What would the typical person do in
this situation?
We tend to think others are like
ourselves, yet we are more willing to
tell the truth about “others”
Example:
Why would your neighbor buy a Mercedes
58
Cartoon Completion
Subjects fill in the
bubble – suggests a
dialogue between
the characters
59
Draw a Picture
Subject given a topic to draw
Examples:
60
DEPTH INTERVIEWS
One-on-one interviews
Try to uncover underlying motivations,
prejudices and attitudes toward
sensitive information
61
Depth Interviewing Analysis
Laddering
Attributes
Consequences
Values
62
When to use depth interviews:
Sensitive subject matter
Need intensive probing
Respondent interaction unlikely to be
helpful
Have lots of $$$$ and time
Need detailed responses (> 15 minutes)
63
Some Boring Definitions:
Ethnographic/Observational Research
Direct Observation:
Indirect Observation
64
Observation can be disguised
or undisguised
65
Observation of Physical
Objects
Naturalist Inquiry
Physical Trace
evidence
wear on floor tiles
Garbology
Pantry Audit
66
Mechanical Observation
Television/Internet
Scanners
Eye Tracking
Psychogalvanometer
Response Latency
67
Experimental Research
Methods
Looking at Cause and Effect
Relationships
68
Experiment
Definition:
variable
manipulate
independent variable
dependent variable
69
Requirements for an
Experiment
Must have two or more groups of
subjects
experimental group(s)
control group(s)
Must use random assignments to
groups
controls for extraneous factors
70
Research Environments
Laboratory
experiment
Field experiment
71
Can NEVER prove causation ( X
Y)
Can only INFER such a relationship
72
Reasons for Association
between X & Y :
Common causes
Confounded factors
drowning and ice cream consumption
AIDS test of Rivavion
Coincidence
Causation
73
Evidence to Support Causation
Concomitant Variation
Temporal Ordering (time order of occurrence)
Elimination of Other Causes
74
Concomitant Variation
Required for Causation
1. Concomitant
variation
positively
negatively
75
Temporal Ordering Required
for Causation
76
Elimination of Other Possible
Causes Required for Causation
You must think this through, no one will
give you a list to check
Most difficult of the criteria to
determine
77
Internal Validity
Definition:
Threatened by:
history
maturation
instrumentation
selection bias (non-random assignment )
testing
78
External Validity
Definition:
Threats to external validity
reactive/interactive testing effects
surrogate situations
demand artifacts
79
Experimental Designs -Notation:
RR = random assignment of respondents
X = exposure to one of the possibly many
treatments
0 = observation of measurement of the
respondent
T = treatment effects
80
One-Shot (After Only)
X
O
Problems?
81
One-group Pretest-Posttest
O1
X
O2
Problems?
82
Static Group
X
O1
O2
PROBLEMS?
83
Before/After With Control
RR
RR
O1
O3
X
O2
O4
Problems?
84
After Only With Control
RR
RR
X
01
02
Problems?
85
SURVEY INTERVIEWING
TECHNIQUES
Methods that Use Large Sample
Sizes and Create Results that Can
Be Projected to the Populations
86
Mail Surveys/Self-Administered
Questionnaires
Def:
-cold
-panels
-fax
- e-mail
87
Internet/Computer Assisted
Surveys
Allow for lots of
branching/interactive
Allows for
personalization
Great anonymity
Representative Samples
88
Other Survey Methods
Telephone
Personal in-home (Door-to-Door)
Mall intercepts
-can interact with product
replacing ___________
89
Each Method Has Advantages
and Disadvantages
See page 172 for a summary
TREND – USE A COMBINATION OF
METHODS
90
Things to consider when
choosing method
Versatility
- Visual cues
- Degree of structure
- Complexity of questions
91
Consider Quantity of Data
Function of
questionnaire length shortest
________
- moderate length
________
-longest
_________
92
Consider Sample Control
Contact the right people
mailing list quality
interviewer qualifying
phone unlisted
Random Sampling error
93
Consider the Quality of Data
Response bias (see next slide)
Interviewer bias
Interviewer cheating
Poor questionnaire design
Sample bias
Systematic Errors
94
Response Biases
Acquiescence
Extremity
Interviewer
Auspices
Social Desirability
95
Consider Non-Response Error
Problems occur because the people
responding to the questionnaire differ
significantly from those not responding
Possible Self-selection bias
Example
-survey 500 students to see if they need transportation
to and from school
- 50 answer and say yes
-conclude that all 450 that did not answer do not need it
Did you make the correct conclusion?
96
Your Turn
Make up your own example of
nonresponse bias:
97
How to Increase Response
Rate
Prior notification
Motivate with rewards
Good looking
questionnaires
Good cover letter
Follow-up
Make it fun!!
98
Consider Speed
Phone is ____
Computer-assisted
phone/internet is
_____
Mail is ____
99
Consider Cost
Internet: relatively
inexpensive
Mail: depends on
pre-contacts and
follow-ups
Telephone: next
most expensive
Mall/In-home $30
up to $100
100
Specific Uses for Methods
Cold mail
Mail panels
general information, in-home use
Phone
respondents very interested in topic
nationwide samples
Mall intercept
copy tests, product tests, branding/package
testing
101
Measurement
Assigning Numbers To Reflect the
Degree or Amount of a
Characteristic
102
MEASURES OF
CENTRAL TENDENCY
MODE
MEDIAN
MEAN
103
Measurement Scales
Series of items that
are arranged
progressively
according to value
or magnitude
A series into which
an item can be place
according to its
quantification
104
Nominal Scale
Identification only
No order to the
numbers
Examples:
Measure of Central
Tendency:
105
Ordinal Scale
Ranked data
Distance between two
numbers is unknown
and uneven
Examples:
Measure of Central
Tendency:
106
Interval Scale
Rank to the data
Equal distance between
numbers
No “natural zero”
We assume a lot of scales
are interval
Measure of Central
Tendency:
107
Ratio Scales
Rank to the data
Equal distance between
numbers
“natural zero” where
zero means “none”
Measure of Central
Tendency:
108
YOUR TURN --Write a question
for each type of scale
Nominal
Ordinal
Interval
Ratio
109
Criteria For Good
Measurement
Reliability
Validity
Sensitivity
110
1) Reliability of Scales
Coefficient Alpha
Are the results on
questions measuring the
same thing consistent?
Single item scales more
suspect to random error
Test/retest
Are consistent results
found on repeated
measures
111
2) Validity
Are we measuring what we think we are
measuring?
Content validity (Face validity)
Pragmatic validity
112
3) Sensitivity
Refers to an instruments ability to accurately
measure variability in stimuli or responses
Example: I love to eat chocolate
Agree vs Disagree
Strongly
strongly
agree
mildly
neither
agree
agree or
disagree
mildly
disagree disagree
113
Noncomparative Continuous
Graphic Rating Scales
Place a mark on the line indicating how
important it is to have each of the
following at your vacation resort:
Alpine slides _____________________
unimportant
important
5 inch line
127 mm
1/20 inch
114
Graphic Rating Scales
Happy faces
Thermometer
115
Noncomparative Itemized
Rating Scale
Several categories from which the respondent
can choose
Top-box method:
How likely are you to buy a Sony DVD player in the next 3
mos.
definitely will buy
Probably will buy
Might buy
Probably will not buy
Definitely will not buy
116
Examples of Itemized Rating
Scales
Likert
Semantic Differential
Staple
117
Likert-type Scales
Sentences with which the respondent
agrees or disagrees
It would be cool to have a candy-red
1965 convertible Mustang
SD D Neither A SA
118
Likert-type Scales
Code such that higher numbers mean
better things
Can create summated scales to form an
index
Assume __________ scale
119
Semantic Differential
Series of attitude scales where repeated
judgments about a concept are made
Opposite adjective words or phrases
Use several of these and sum them
Fast
Bad
Service
Tasty
Food
__:__:__:__:__:__:__
Slow
__:__:__:__:__:__:__
Good
Service
__:__:__:__:__:__:__
Not Tasty
Food
120
Semantic Differential
Code such that higher numbers mean
better things or more of something
Make an overall score--sum the items
Develop a snake diagram (image
profile) to compare competitors
Assume _____ scaling
121
Staple Scale
Use +5 (describes completely) to -5
(does not describe at all)
Assume _______ scaling
Good for phone
Easy to construct
May look difficult for respondent
-5 -4 -3 -2 -1 FUN +1 +2 +3 +4 +5
122
Questions for Itemized
Response Scales
How many categories?
Balanced or Unbalanced?
Should you have a neutral point?
Forced or unforced?
123
Comparative Scales
Compare one set of objects directly
with another
sensitive
easy
can create artificial differences
124
Paired Comparison
Which do you prefer?
____ Barry Manilow
____ Counting Crows
____ Barry Manilow
____ Rolling Stones
____ Rolling Stones
____ Counting Crows
125
Paired Comparison Table
Manilow Crows
Manilow
Stones
-----
0.90
0.85
Crows
0.10
----
0.60
Stones
0.15
0.40
---126
Calculation of Rank-Order
Values
Manilow Crows
Stones
Manilow
Crows
Stones
127
Rank-order Scales
Respondents are simultaneously
presented with several objects that they
rank order
Please rate the following from 1=most
preferred to 4= least preferred
Pizza Hut
Mario’s
-Domino’s
-Little Ceaser’s
128
Comparative Continuous
Graphic Rating Scale
Similarity ratings used for perceptual
maps
Pitt and WVU
_________________________
Exactly
Completely
the same
different
129
Constant Sum Scales
Assign chips or
points to attributes
Very careful with
instructions
Difficult for the
respondent
130
Developing Questionnaires
The Art and Science of
Questionnaire Design
131
Preliminary Considerations
What information is required?
Who are the target respondents?
What data collection method will be
used?
132
Managerial Orientation
Make sure that all
information in the
questionnaire is
useful to the
manager
(demographics and
first question are
possible exceptions)
133
Make Sure Questions Are
Understandable
Do you need more
than one question?
Do respondents
have the information
needed to answer
the question?
134
Understandable Questions,
cont.
Can respondents remember the
information?
Is it too much work to get the
information?
135
Ways of Dealing with Sensitive or
Embarrassing Questions
State behavior is not unusual.
Early or late in the questionnaire?
early
late
Give categories for responses.
Phrase how others might act.
136
Need Mutually Exclusive and
Exhaustive Responses
Responses should not overlap
Must cover the entire range
Example:
137
Use Natural and Familiar
Language
Simple language
Language that the target market uses
Avoid ambiguous words:
DO NOT USE:
138
Avoid Bias
No loaded questions
Watch for sequence bias
139
No Double-Barreled Questions
A question that calls
for two responses
140
Response Formats
Open ended--respondent answers in his
or her own words
Uses:
Bad points:
141
Itemized Questions (closeended)
Fixed alternatives
Advantages:
MUST PRETEST
142
Types of Close-ended
Questions
Multichotomous (More than 2
responses)
Dichotomous (Only two responses)
143
Questionnaire Flow
Cover letter
First question very important, must be
_____________
_____________
Demographics late in the questionnaire
144
Sequencing
Funnel
Inverted funnel
Keep questions on related topic
together
Be very careful with branching
145
Layout
Booklets for multipage questionnaires
Attractive
Title, date, return
address on first
page
Color code
branching
146
Layout
Number the questions
Put the answers in all UPPER CASE
letters
What is better?
white space
save a page
147
Pretest the Questionnaire
First with a personal
interview
Make corrections
Next using the real
method
If you do not
pretest, you are
being
_________________
_
148
Sampling
The Statistical Adventure Begins
149
Populations
Def:
Census
Sample
Which is better?
census?
sample?
150
Step 1: Define the Target
Population
Must be very specific:
What is a user?
What demographics matter?
Are there geographic boundaries?
What is the relevant time period?
What is an element?
151
Step 2: Specify a Sampling
Frame
Def:
152
Sample Frame Problems
List may not match the target
population
over-registration
under-registration
153
Step 3: Selecting a Sampling
Method
Probability samples
example:
Non-probability samples
example:
154
What’s the Big Deal?
Probability samples let us estimate
_________
We can calculate a confidence interval
So, probability samples are more
representative than non-probability
samples.
true
false
155
Simple Random Sampling
Probability sample
Number each unit in the sampling
frame
Pick ___ units using a random numbers
table
NOT haphazard
156
Take a Simple Random Sample
(SRS) of n=3
Element
Natasha
Scotty
Kalie
Lynn
Gregory
Paul
John
Attitude toward Motel 6
6
7
4
2
8
4
7
157
Stratified Sample
Decide on stratification
variable
homogeneous groups
related to dept. variable
Divide population into a
few mutually exclusive
and exhaustive strata
Take a SRS from each
strata
158
Proportionate Stratified
Sample
Choose sample from strata in same
proportion as they are in the population
Population
Sample
Strata
proportion
proportion
159
Disproportionate Stratified
Sample
Take a larger sample from the strata
with ________ variance
What is variance?
Exercise: Develop two populations with
8 elements each.
Population 1: high variance, low mean
Population 2: low variance, high mean
160
Disproportionate Stratified
Sample
Population
Strata
Variance proportion
proportion
Sample
161
Why use Stratified Samples?
Make sure that you include certain
subgroups
More precise, IF we use the right
stratification variable
margin of error is ___________
sampling distribution is __________
confidence intervals are __________
What is the right variable?
162
Cluster Sampling
Divide population
into lots of
heterogeneous
clusters
Take a SRS of
clusters
Either:
sample all elements
in the selected
clusters
OR take a SRS of 163
Why use Cluster Samples
Cheap
Easy
Likely to be the way
the sampling frame
is set up
Problem
not precise, lacks
statistical efficiency
164
Non-probability Sample:
Cannot estimate margin of
error
Convenience or
accidental sample
If the sample size is
really large, we
know we have a
representative
sample
true
false
165
Judgment or Purposive
Sample
Elements selected because they can
serve the research purpose--they are
believed to be representative
Snowball sample
166
Quota Sample
Attempts to be
representative by
sampling
characteristics in the
same proportion as
the population
Interviewer chooses
sample
Are these
representative?
167
_____
Step 4: Determine the
Sample Size
Must take into consideration:
cost
time
industry standards
statistical precision
Discuss this in detail in the next chapter
168
Step 5: Select Elements
Actually collect the data
Clean-up the data
Put the data into the computer
169
Characteristics of Interest
# of elements
Population
N
Sample
n
Mean
U (mu)
X (x bar)
Variance
o2 (sigma
Sx2
Standard Deviation
O (sigma)
Sx
squared)
170
Step 6: Estimate the
Characteristics of Interest
Sample mean:
sum of the sample elements
X=
number of elements in sample
Sample variance = Sx
2
sum of deviations around the mean squared
sample size minus 1
171
Sample Standard Deviation
The square root of
the sample variance
= sx
Has a specific
meaning
172
Sampling Error
The difference between the :
population parameter
and the sample statistic
We look at confidence intervals to
estimate this but not until the next
chapter
173
Non-sampling Error
(i.e., all other kinds of errors
except for sampling error!)
174
Types of Non-Sampling Error
Sampling frame
Poor questions
Poor branching
Item non-response
175
More Non-Sampling Errors
Non-response
Interviewer bias
Interviewer cheating
Coding and editing problems
176
Which is the Larger Problem?
Sampling error
Non-sampling error
177
Sample Size Determination
Everything You Ever Wanted to
Know About Sampling
Distributions--And More!
178
Sampling Distribution
A frequency distribution of all the
means obtained from all the samples of
a given size
Example: $$ spent on CD’s at Tracks
Daffy
Donald
Sylvester
34.00
72.00
36.00
All samples of n=2
179
Your Turn
Develop a sampling distribution using
n=2
Calculate the population mean
CAR
A B C D E
Expected
Life
3
4
5
0
1
180
Sampling Distributions
The distribution of sample means is
skinnier than the distribution of
elements
Why?
The distribution is normal
The sampling distribution mean equals
the population mean
181
Standard Error
The variability in the sampling
distribution
Tells you how reliable your estimate of
the population mean is
If this is big (good or bad)
If this is small (good or bad)
WHY?
182
Standard Error
Sx
standard deviation
square root of the sample size
As the samples size gets bigger, the
standard error gets __________
183
Confidence Intervals
CI= Xbar +/- z (standard error)
Where:
z= _____ for 68% confidence
z= _____ for 95% confidence
z= _____ for 99.7% confidence
What confidence level should you use?
184
Develop a Confidence Interval
Estimate the
average number of
trips to the beach
taken by WVU
students during their
4-6 year career
xbar = 5
SD = 1.5
95% Confidence
Level
n=100
185
So,
There is a 95% chance that if all WVU
students were sampled regarding the
number of beach trips that the findings
would differ from our results by no
more than ____ in either direction.
186
or, maybe better,
If I were to conduct this study 100
times, then I would get _____ different
confidence intervals. If I have a 95%
confidence interval the ____ of the 100
CI’s will contain the true population
mean (mu) and ____ will not.
I sure hope that the confidence interval
I got is one of the 95 that contains mu!
187
Confidence Interval Issues
Reliability
Precision
how often we are correct
how wide the confidence interval is
The smaller the n, the _____ the CI
Given a particular n, the CI will be
_______ when we increase the
reliability
188
Factors that Influence n
Precision (H)
how skinny must can your CI be in order to
be able to take action on the results?
I will go to a water park.
DW
PW
Maybe PWN DWN
I will pay _____ for a musical card.
I will pay _____ for a motorcycle.
189
More Factors That
Influence n
Confidence level (z)
Population SD
Time, money and
personnel
190
Sample Size for Interval or
Ratio Data
Z2
H2
n=
* s2
Where:
z= 1, 1.96, or 3
H= precision (+/-) H
s2= variance (or standard deviation
squared)
191
Example: Average Number of
Books Bought Per Semester
H=0.25
s=1.5
Confidence = 95%
192
Sample Size for Nominal Data
n=
Z2
H2
*
(P) (Q)
Where:
Z= 1, 1.96, or 3
H= a percentage (e.g., 0.03--NOT 3)
P = initial estimate of the population
proportion
Q= (1-P)
193
n for Proportion of WVU
Students Who Read the DA
Do you read the DA?
1. YES
2. NO
Estimate that 60%
read the DA
Want a 99 % CI
Want a +/- 3%
precision
194
The Final Sample Size
Compute n for all nominal, interval and
ratio questions
most conservative
limited resources
195
Non-statistical Approaches to
n
All you can afford method:
subtract costs from budget
figure out cost per interview
divide leftover budget by cost per interview
Rules of thumb
196
Coding and Editing
Getting the data ready for analysis
197
Coding
Each response must have its own
variable name
Variable names can have up to 8
characters
Assigning numbers to responses to
enter data into computer
198
Creating a Coding Sheet
Must have a filename at top of questionnaire
Name_data.txt
First variable is ALWAYS the ________
Why?
Write the variable names on the
questionnaire next to the matching response
199
Coding
Coding Open-Ended Questions:
Code open-ended nominal __________
EX: What State is your current state of residence?
Code open-ended numerical – enter _______
Ex: How much would you pay for this product?
200
Coding
Coding fixed-alternative responses:
Assigned numbers should be logical
One variable needed for each answer the
respondent will give
rank order
semantic differential
“Check all that apply”
201
Editing
Cleaning up the data
Field edit
check for legibility
check for completeness
202
Office Editing
Outliers
Missing data
Blunders
Inconsistencies
203
Hypothesis Testing
Using the SAS System to Analyze
Questionnaires
204
Statistically Significant
Are these results for real, or did they
just occur by chance?
Remember, in sampling, all numbers
have ranges
205
Alpha and p-values
Alpha value:
the error rate you
are willing to accept
P-value
the error associated
with rejecting the
null hypothesis
206
Chi-square & T tables
t-distribution
chi-square distribution
For BOTH distributions
Area under the
curve =
Alpha & p-value are
areas under the
curve
critical value-associated with an
alpha level
calculated value-- 207
Chi-Square Goodness of Fit
When to use:
number of variables ________
scaling of variable _________
Basic idea:
could the numbers you get (the observed
value) come from a population which has
the pattern I expect? (the expected value)
208
Chi-square Goodness of Fit
Ho: This sample could have come from
a population which has this pattern:
________________________________
__
________________________________
__
Ha: There is a different pattern in the
population than I expect (or hope). 209
Chi-Square Goodness of Fit
Chi-square calculated=
sum of (Observedi -Expected i )
Expected i
2
degrees of freedom = number of cells 1
Alpha Value
Table Value
210
Now Graph
chi-square
calculated
chi-square table
value
211
Chi-Square Goodness of Fit
What type of dairy
treat do you like
best?
1. hard scoop ice
cream
2. soft serve ice cream
3. chocolate covered
ice cream bars
Ho:
Ha:
Chi-square Calculated:
Degrees of Freedom:
Chi-square Table:
Graph
212
Chi-square Goodness of Fit
Rules
If the chi-square
calculated is in the tail,
then _______ Ho;
conclude the pattern in the
data. is NOT what you
expected or wanted.
If chi-square calculated is
in the hump, then
_______ Ho; conclude, the
pattern IS what you
213
expected or wanted.
Do It Yourself Using SAS --
What is the pattern for the favorite brand of
soda?
Ho:
Ha:
Chi-square Calculated:
214
Do It Yourself Using SAS -(cont.)
Degrees of Freedom:
Chi-square Calculated:
Chi-square Table:
Graph
Conclusion:
215
Chi-square for Two Variables
When to use:
number of variables ________
scaling of variables ________
Basic Idea:
Compare the values you actually get from
you study to the values you would expect if
there was ____________between the
two variables
216
Chi-square for Two Variables
Ho: There is no relationship between
____ and _____
Ha: There is a relationship between
____ and _____; SPECIFICALLY
_________________
NO CALCULATIONS!! SAS DOES THIS
ONE
217
Chi-Square for Two Variables
Alpha level
Probability level
218
Chi-Square for Two Variables
If the probability level is > _____, do
not reject Ho,
conclude________________
If the probability level is < _____ then
reject Ho, conclude ______________
AND specify the nature of the
relationship.
CAREFUL--Do not just assume that the
relationship you predicted is correct 219
If you Reject HO -How Strong is The Relationship?
Look at Phi
Phi < 0.10 is
______
Between 0.11 and
0.40 is __________
Phi > 0.40 is
_______
220
Do it yourself using SAS
Ho:
Ha:
Chi-square calculated:
Probability level:
Alpha level
Phi:
Conclusion:
221
Rank-Order Tests
It’s 12:00. Do you know what
your Ha is?
222
Rank-order Data & Chi-square
When to use:
number of variables ________
scaling of variable ________
Basic idea: compare the observed value
(________________) with the values
you would expect if NO PREFERENCE
was shown in the data
223
Hypotheses & Calculations
Ho: There is no ranking in the data-there is no preference.
Ha:
_____________________________,
specifically, ______________________.
Chi-square calculated:
Sum of (Observedi -Expectedi)2
Expectedi
224
Rank-order Chi-square
First, multiply the rank-order data for each
variable
Variable 1 score = 1 (___) + 2 (___) +3
(___) ...
Variable 2 score = 1 (___) + 2 (___) +3
(___) ...
Variable 3 score = 1 (___) + 2 (___) +3
(___) ...
Compute expected value
Add up the total scores and divide by the
number of variables
225
Example
Ranking of movies: Mulan, Private Ryan,
Titanic (n=20)
1
2
3
Mulan
10
4
PR
7
8
Titanic
3
8
6
5
9
Mulan ranking = 1 (___) + 2 (___) +3 (___)
PR ranking = 1 (___) + 2 (___) +3 (___)
Titanic ranking = 1 (___) + 2 (___) +3 (___)
Expected value:
226
Rank-order Chi-square
Degrees of freedom
Alpha level
Chi-square table
Graph chi-square and chisquare calculated
Conclude
Managerial implications
227
Rules
If the chi-square calculated is in the tail,
then _______ Ho, conclude that there
is a preference shown in the data.
EXAMINE THE DATA TO
DETERMINE PREFERENCE. (It may
not be what you hypothesized!)
If the chi-square calculated is in the
hump, then ___________ Ho.
Conclude there is no preference shown.
228
Do This With the Soda Rankings
Rank calculations
Expected value
Ho:
Ha:
229
Do This With the Soda Rankings
(continued)
Chi-square calculated:
Chi-square table:
Graph:
Conclusion: reject or do not reject Ho
Managerial implication
230
T-test for One Mean
When to use:
Basic idea
number of variables __________
scaling of variables
__________
Look at the confidence intervals. Any
numbers in the same confidence intervals
are considered the same.
Key question--If my sample mean
(xbar) is ___, can my population
mean (mu) be ___?
231
Hypotheses for T-test for One
Mean
Interested in the average number of
sodas drunk per day.
Ho: The opposite of Ha: The
population mean is equal or
(less/greater than or equal to) the the
number hypothesized.
Ha: What you need to be actionable.
The population mean is (less than/
greater than) _____.
232
Example Ho and Ha
Ho: × = µ
Ha: × ≠ µ (two-tailed test)
Ho: X ≥ µ
Ha: X < µ (one-tailed test – lower tail)
Ho: X ≤ µ
Ha: X > µ (one-tailed test – upper tail)
233
Calculations
T calculated =
xbar - mu
standard error
Where:
xbar = sample mean
mu = hypothesized population mean
234
More T Calculations
Degrees of
freedom=
n-1
Alpha level=
T-table value =
235
Now Graph
t-calculated and
t-table value
on a normal curve
236
Rules for T-test for One Mean
If the calculated t-value is in the hump,
________ Ho. Conclude that your Ha is
not correct.
If the calculated t-value is in the tail
then _____ Ho. Examine your data to
see if Ha or the opposite of Ha is
correct.
237
Practice Once
Ho:
Ha: The populations purchase intention for a
gumball machine is >4.
X-Bar: 4.5 SE= 0.15, n=60
T-calculated
Degrees of freedom
T-table
Graph
Conclusion: Reject or Do not Reject Ho
Managerial implication:
238
Practice Again!
Ho:
Ha: The populations purchase intention for a
gumball machine is >4.
X-Bar: 2.3 SE= 0.18, n=60
T-calculated
Degrees of freedom
T-table
Graph
Conclusion: Reject or Do not Reject Ho
Managerial implication:
239
Now Use SAS
Ho:
Ha: The average population rating for Coke when
consumers know it is Coke is >6.
T-calculated
Degrees of freedom
T-table
Graph
Conclusion: Reject or Do not Reject Ho
Managerial implication:
240
T-test for Two Means
When to use:
Number of variables = _______
One variable (the groups) is _______ scaled
One variable (the dependent variable) is ________
scaled
Basic idea:
See if the confidence intervals for the two different
groups overlap. If they do, then
_________________________________ .
241
Hypotheses for T-test for Two
Means
Is there a difference between the number of
sodas males drink per day and the number of
sodas females drink per day?
Ho: The two groups are the same with
respect to __________ .
Ha: The two groups are different with
respect to _______. Specifically,
______________.
242
More on T-tests for Two
Means
No calculations
Check to see if variances are equal or
unequal
Look at “Equality of Variances” –
Ho: variances are equal
Ha: variances are not equal
If p>.05 accept Ho and use equal variances
If p<.05 reject Ho and use unequal variances
243
More on T-tests for two means
Check the T-test table to see if you
should accept or reject your Ho:
•
T-value =
(either for equal or unequal variance)
•
P-value =
244
Rules
If the probability level is ________
0.05, then ________ Ho. Conclude that
the two groups are different. LOOK AT
THE DATA TO DETERMINE WHAT THE
DIFFERENCE IS.
If the probability level is ______ 0.05,
then __________ Ho. Conclude that
the two groups are the same.
245
Your Turn
Is there a difference in the number of sodas
drunk per day between people who drink
soda with breakfast, and people who do not?
Nominal variable= ___________
Interval variable = ___________
Ho:
Ha:
Probability level
Conclude--reject or do not reject ho
Managerial Implication
246
Your Turn again
Is there a difference in the number of sodas
drunk per day between people who drink
soda with breakfast, and people who do not?
Nominal variable= ___________
Interval variable = ___________
Ho:
Ha:
Probability level
Conclude--reject or do not reject ho
Managerial Implication
247
ANOVA
When to use:
Testing mean differences between groups
Have more than 2 groups
Want to test interactions between 2 variables
Same as a t-test except that you have more
than two groups
Number of variables = _______
Some variables (the groups) are _______ scaled
One variable (the dependent variable) is ________
scaled
248
Ho: all the means are equal
Ha: one of the means differs (specify
how the mean differs)
249
No Calculations: SAS does
this
Use Proc GLM
Class: the nominally scaled variable(s)
Model: specifies the dependent variable, the
dependent variable and interactions
e.g.,
class= age;
model liking= age;
mean = age;
250
Interpretation:
Dependent variable: liking
Source
DF
Sum of
Squares
Model
6
93.52
15.59
Error
75
433.02
5.77
Corrected Total
Source
Model
81
Mean Square
526.55
F Value
2.70
Pr>F
0.02
NOTE:to determine significance – check the p value (if p less than
.05 reject Ho)
251
Do it yourself using SAS
You want to test whether age has an
impact on the number of sodas
consumed per day
HO:
HA:
252
F-calculated
Alpha
P-value
Conclusion:
253
THE END!!!
254