Execute the research
Download
Report
Transcript Execute the research
Chapter Fourteen
Data Analysis: Testing for
Association
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-1
Learning Objectives
Understand and evaluate associations
between variables
Explain the concept of covariation
Discuss the differences in Chi-square
value, Pearson product moment
correlation and Spearman rank order
correlation
Understand when and how to use
analysis
Apply discriminant analysis and
conjoint analysis to examine marketing
research problems
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-2
Introduction
PHASE III:
Execute the research
Marketing Research
Analyse the data
Step 8:
The concept of relationship:
A relationship is a consistent and
systematic link between two or
more variables
Visit correlation and regression at:
<www.med.umkc.edu>
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-3
Introduction
PHASE III:
Execute the research
Marketing Research
Analyse the data
Step 8:
Data analysis also entails the
process of testing and assessing
associations between variables
Associations between variables
include:
Presence of association
Direction of association
Strength of association
Type of association
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-4
Using Covariation to Describe
Variable Relationships
Covariation is the amount of
change in one variable that is
consistently related to the change
in another variable of interest
A scatter diagram is a graphic plot
of the relative position of two
variables using a horizontal axis
and a vertical axis to represent the
values of the respective variables
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-5
Using Covariation to Describe
Variable Relationships
Nature of scatter diagrams
Changes in the value of Y are often
systematically related to changes in
the value of X
Relationships can be either positive
or negative
If patterns are not linear, they may be
curvilinear
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-6
Types of Association:
Linear—Positive and Negative
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-7
Types of Association:
Nonexistent and Curvilinear
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-8
The Chi-square Value
Chi-square (χ2)
Used to determine whether the presence of
one variable is systematically associated
with the presence of another
Does not assess direction or strength
A Chi-square test is often referred to as a
‘goodness-of-fit’ test
Has categorical questions and allows a research
team to assess how closely what’s ‘observed’ fits
the pattern of what was ‘expected’ or anticipated
For example, did the responses to a survey follow a
particular pattern?
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-9
Chi-square Analysis
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-10
Treatment and Measures of
Correlation Analysis
The Pearson product moment correlation
coefficient is a statistical measure of the
strength and direction of a linear
relationship between two interval- and/or
ratio-scaled (metric) variables
Ranges from –1.00 to 1.00. The higher the
correlation coefficient, the stronger the level of
association
The correlation coefficient can be either + or –
depending on the relationship between 2 variables
If the correlation coefficient is statistically
significant, the null hypothesis is rejected. If it is
not significant then it has no meaning
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-11
Pearson Correlation Coefficient
Formula
The Pearson correlation coefficient
formula makes several assumptions
about the nature of the data:
Two variables are assumed to have been
measured using interval- and/or ratio-scaled
measures
The nature of the relationship to be measured is
linear
Variables to be analysed come from a bivariate
normally distributed population
Null hypothesis states that there is no
association and that the correlation coefficient
is 0
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-12
Correlation Analysis
A correlation analysis is often
referred to as ‘plus or minus one’
by some practitioners of marketing
research
Correlation allows a research team to
determine the sturdiness of a linear
relationship between two variables
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-13
Rules of Thumb about the
Strength of Correlation
Coefficients
Range of Coefficient
Description of Strength
.81 to 1.00
Very strong
.61 to .80
Strong
.41 to .60
Moderate
.21 to .40
Weak
.00 to .20
None
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-14
Coefficient of Determination
When the Pearson product moment
correlation is strong and significant, then
the variables are associated in a linear
fashion
The coefficient of determination (Rsquare) is the square of the Pearson
product moment correlation
Between 0.0 and 1.0
Shows the proportion of variation in one
variable accounted for by another
For example, R-square = 0.36 means that 36%
of the variation in one variable is associated
with the related variable
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-15
Treatment and Measures of
Correlation Analysis
Statistical
significance
Correlation coefficient is
converted into a student’s t test
statistic.
Can only show association;
doesn’t allow researcher to make
causal statements.
Substantive
significance
Coefficient of determination is a
number measuring the proportion
of variation in one variable
accounted for by another.
The larger the number, the
stronger the linear relationship
between two variables under
study.
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-16
SPSS Application: Pearson
Product Moment Correlation
Coefficient Example
Descriptive Statistics
Mean
Std.
Deviation
N
Recommend to
Friend
4.68
.98
50
Satisfaction Level
4.78
.95
50
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-17
Correlations Example
Correlations
Recommend
to Friend
Recommend to Friend
Pearson
correlation
1.000
Sig. (2-tailed)
N
Satisfaction Level
Pearson
correlation
Satisfaction
Level
.601**
.000
50
.601**
Sig. (2-tailed)
.000
N
50
50
1.000
50
** Correlation is significant at the .01 level (2-tailed)
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-18
The Spearman Rank Order
Correlation Coefficient
The Spearman rank order correlation
coefficient is a statistical measure of
the strength and direction of a linear
relationship between two variables
where at least one is ordinal-scaled
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-19
Spearman’s Rank Order
Correlation
Correlations
Food Quality Rank
Food Variety Rank
Correlation Coefficient
Sig. (2-tailed)
N
Correlation Coefficient
Sig. (2-tailed)
N
Spearman’s Rho
Food Quality
Food Variety
Rank
Rank
1.000
-.495**
.000
50
50
-.495**
1.000
.000
50
50
**Correlation is significant at the .01 level (2-tailed).
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-20
Describing Relationships and
Making Predictions
Beyond describing relationships,
researchers also may wish to
predict potential effects of X on Y
A predictive model
A mathematical model based on
relationships found among Y and X
variables to make a prediction
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-21
Regression Analysis
Regression equations include 2
basic approaches:
Bivariate regression analysis—X1 and Y1
Multivariate regression analysis—X1, X1, X2
… Xn and Y1
Assumptions
•Two variables are assumed to have
been measured using interval- and/or
ratio-scaled measures
•Nature of the relationship to be
measured is linear
•Variables to be analysed come from a
bivariate normally distributed
population
•Error terms associated with making
predictions are normally and
independently distributed
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-22
Regression Analysis
A regression analysis is often
referred to as the R-square test
Regression analysis designates an
independent variable as a ‘predictor’,
even though few (if any) relationships
on the marketing landscape are truly
deterministic in nature.
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-23
What is Regression Analysis?
Marketing managers need to make
predictions about situations affecting
the company and they may use a
regression equation.
Bivariate
regression
analysis
A statistical technique which
analyses the linear relationship
between two variables by
estimating coefficients for an
equation for a straight line.
One variable is the dependent
variable and the other is the
independent variable.
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-24
Regression Analysis
Once a regression equation has been
developed to predict values of Y, the
interest is in trying to find out how
accurate that prediction is.
Look at the actual value collected in the
sample
Compare this with what has been
predicted and see what the difference is
This is the common way to judge the
accuracy of a regression equation or
model
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-25
Points About Comparing
Actual and Predicted Values
Point 1—Differences between actual and predicted Y
values are represented by the error term of the
regression equation—ei
Point 2—By squaring e for each observation, and
summing them, the total represents an aggregate or
overall measure of the accuracy of the regression
equation
Point 3—Error terms can also be used to diagnose
potential problems
The pattern of errors produced by comparing actual Y
values with predicted Y values can show as errors are
normally distributed and/or have equal variances
across the range of X values
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-26
Patterns of Residuals—
Three Examples
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-27
Patterns of Residuals—
Three Examples (continued)
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-28
Developing and Estimating
The Regression Coefficients
Statistical
significance
Is there a
relationship?
If so, we need to
know how strong
that relationship is.
So …
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
Substantive
significance
Coefficient of
determination describes
the percentage of the
total variation in the
dependent variable that
can be explained by
using the independent
variable.
Look at size of
R-square and
regression
coefficient.
14-29
Summary of Selected
Multivariate Methods
Multiple regression enables the
marketing researcher to predict a single
dependent metric variable from two or
more metrically measured independent
variables.
Discriminant analysis can predict a
single dependent non-metric variable
from two or more metrically measured
independent variables.
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-30
Summary of Selected
Multivariate Methods
Conjoint analysis is used to estimate
the value (utility) that respondents
associate with different product and/or
service features, so that the most
preferred combination of features can
be determined.
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-31
Multiple Regression Analysis
A statistical technique which analyses
the linear relationship between a
dependent variable and multiple
independent variables by estimating
coefficients for the equation for a
straight line
Regression coefficient is calculated for each
variable (X) that describes its relationship
with the dependent variable (Y)
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-32
Multiple Regression Analysis
Multiple regression analysis is a useful
technique because most information
problems (and market opportunities)
involve several independent variables
Decision makers benefit from multiple
regression, since management can explore
how a ‘batch’ of independent variables
influence a dependent variable from the
output
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-33
Multiple Regression Analysis
These coefficients allow the marketing
manager to examine the relative
influence of each independent
variable on the dependent variable
This relationship is still linear
However, with more than one
independent variable we have to think
in terms of multiple independent
dimensions
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-34
Multiple Regression Analysis
To investigate these relationships
Examine the regression coefficients
They describe the average amount of change
expected in Y given a unit change in the
value of X (whichever X you are considering)
Each regression coefficient describes the
relationship of that variable to the dependent
variable
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-35
Multiple Regression Analysis
When multiple independent variables
have different scales, is it not possible
to make relative comparisons?
Calculate the standardised regression
coefficient
Beta coefficient—an estimated regression
coefficient that has been recalculated to have a
mean of 0 and a standard deviation of 1
This allows for variables with different units to be
compared directly on their association with Y
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-36
Multiple Regression Analysis
What is the strength of association?
R-square—Coefficient of determination
A measure of the amount of variation in Y
associated with the variation in the independent
variables, considered together
The larger the R-square, the more the behaviour
of Y is associated with the independent measures
used to predict it
R-square = .78, means that 78% of the variation in
sales revenue in the Canon example is accounted
for, or explained by, the variation in sales force,
advertising budget and customer attitudes
towards the products
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-37
Examining Residuals:
Standardised Residuals vs Normal
Distribution
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-38
Multicollinearity
Multicollinearity is a very important
problem marketing researchers need to
address
When multicollinearity exists in a data
set, the research team is challenged
because it makes it difficult (sometimes
impossible) to estimate separate
regression coefficients for correlated
variables
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-39
Multicollinearity and Multiple
Regression Analysis
Multicollinearity is a situation in which
several independent variables are
highly correlated with each other
The effect makes it difficult for the
regression equation to separate out the
independent contributions of the
independent variables
This situation inflates the standard error of
the coefficient and lowers the t-statistic
associated with it
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-40
Discriminant Analysis
A research team uses discriminant
analysis to classify groups or objects
by a set of independent variables
Discriminant analysis enables a marketing
researcher to determine linear
combinations of the independent variables
of interest to the client
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-41
Discriminant Analysis
Discriminant score—the basis for
predicting which group an individual
belongs to:
Zi = b1X1i + b2X2i + b3X3i + . . . +
bnXni
Z
= ith individual’s discriminant score
bn = Discriminant coefficient of the nth variable
Xni = Individual’s value on the nth X variable
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-42
Discriminant Analysis
bn = Discriminant coefficient of the
nth variable
Estimates of the discriminant power of a
particular X variable
The size of the coefficient is determined by
the variance structure of the variables
X’s with a large discriminant power will have
large weights
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-43
Discriminant Analysis—An
Example to Consider
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-44
Discriminant Analysis
Critical discriminators of likelihood
Is about the prediction of a categorical
variable—group membership, likelihood
to buy, recommend, etc.
Is the direction of group differences
based on finding a linear combination of
X’s
The discriminant function shows large
differences in group means
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-45
Conjoint Analysis
Conjoint analysis is used to estimate
the value (utility) that respondents
associate with different product
and/or service features, so that the
most preferred combination
of features can be determined
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-46
Conjoint Analysis
Estimates the relative importance
placed on the different attributes of a
product or service
As well as the value that is attached to
the various levels of each attribute
The assumption is that consumers
form preferences for products by
evaluating the overall utility or value
of the product
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-47
Conjoint Analysis
Marketing researchers draw upon the
power of conjoint analysis to estimate
the value (utility) respondents associate
with different product and/or service
features
Conjoint analysis, then, allows a marketing
research team to communicate the most
preferred combination of features to a client
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-48
Conjoint Analysis
Collect data, then produce a partworth estimate for each level of each
attribute
Estimate the total worth of the product
profiles, then compare it to
consumer’s choice ranking
Attribute importance estimate—the
importance of an attribute of an object
as estimated by conjoint analysis
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-49
Conjoint Analysis
To quantify the value that people
associate with different levels of
product/service attributes
However, there are limitations
Suffers from artificiality:
Respondents may be more deliberate than
in a real situation
Respondents may have additional
information when compared with reality
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-50
Conjoint Analysis—An
Example to Consider
Attribute
Restaurant Profile A
Restaurant Profile B
Price level
Inexpensive ($6–$10)
Moderate ($10–$20)
Atmosphere
Family style
Smart
Menu type
Sandwiches
Salad, main, dessert
Service level
Self-service
Table service
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-51
Conjoint Analysis—An
Example to Consider
Copyright 2004 McGraw-Hill Pty Ltd.
PPTs t/a Marketing Research by Lukas, Hair, Bush and Ortinau
14-52