Transcript Data Entry

Chapter Twelve
Data Processing,
Fundamental Data
Analysis, and the
Statistical Testing of
Differences
Chapter Twelve
Chapter Twelve Objectives
•
Develop an understanding of the importance and
nature of quality control checks
•
Understand the process of coding
•
Understand the data entry process and data entry
alternatives
•
Learn how surveys are tabulated and cross tabulated
•
Learn basic descriptive statistics
•
Understand the concept of hypothesis development
and how to test hypotheses
Chapter Twelve
Data Analysis Overview
Validation
and
Editing
Coding
Data
Entry
Chapter Twelve
Machine Tabulation
Cleaning
and
of Data Statistical
Analysis
Data Analysis Overview
Step One:
• Validation: Confirming the interviews / surveys occurred
• Editing: Determining the questionnaires were completed correctly
Step Two:
• Coding: Grouping and assigning numeric codes to the question responses
Step Three:
• Data Entry: Process of converting data to an electronic form
• Scanning the questionnaire into a database
Step Four:
• Clean the Data: Check for data entry errors or data entry inconsistencies
• Machine cleaning: Computerized check of the data
Step Five:
• One-Way Frequency Tables, Cross Tabulations
Chapter Twelve
Editing and Skip Patterns
Editing:
The process of ascertaining that questionnaires
were filled out properly and completely
Skip Patterns:
Sequence in which later questions are asked,
based on a respondent’s answer to an earlier
question
Chapter Twelve
Coding
Coding:
Grouping and assigning numeric codes to every
potential response to a question
The Process:
• List responses
• Consolidate responses
• Set codes
• Enter codes
• Keep coding sheet
Chapter Twelve
Data Entry
Data Entry:
Converting information to an electronic format
Intelligent Data Entry:
A form of data entry in which the information
being entered into the data entry device is
checked for internal logic
Chapter Twelve
Tabulation
The most basic tabulation is the one-way
frequency table:
Chapter Twelve
Cross-Tabulation Data
Bivariate cross-tabulation:
• Cross tabulation two items: “Business
Category” and “Gender”
Multivariate cross-tabulation:
• Additional filtering criteria—“Veteran
Status”. Now filtering three items.
Race/Ethnicity
(All)
Are You a Veteran?
Yes
You Liked the Chamber's Services (All)
Count of Respondent
Business Category
Computers/Technology
Construction
Manufacturing
Other
Professional
Grand Total
Gender
Female Male
Grand Total
1
3
4
1
1
5
5
3
2
5
1
1
9
7
16
Are You a Veteran?
(All)
You Liked the Chamber's Services (All)
Race/Ethnicity
(All)
Count of Respondent
Business Category
Computers/Technology
Construction
General Services
Manufacturing
No Response
Other
Professional
Retail
Wholesale
#N/A
Grand Total
Chapter Twelve
Gender
Female Male
Grand Total
5
7
12
2
4
6
1
1
13
6
19
1
4
5
15
11
26
1
3
4
4
4
8
1
1
2
1
1
42
42
84
Descriptive Statistics
Effective means of summarizing large data sets.
Key measures include: mean, median, mode, standard deviation,
skewness, and variance.
Significant discrepancies in “Mean”
and Median” should cause you to
look further into this data.
Years in Business
Mean
Standard Error
Median
Mode
Standard Deviation
Sample Variance
Kurtosis
Skewness
Range
Minimum
Maximum
Sum
Count
Chapter Twelve
22.4
2.6
15.0
5.0
23.1
534.5
3.8
2.1
98.0
2.0
100.0
1770.5
79.0
Measure of Central Tendency
Mean:
• The sum of the values for all observations of a
variable divided by the number of observations
Median:
• In an ordered set, the value below which 50 percent
of the observations fall
Mode:
• The value that occurs most frequently
Chapter Twelve
Measures of Dispersion
Variance:
• Sums of the squared deviations from the mean divided by the
number of observations minus one
• Same formula as standard deviation
Range:
• Maximum value for variable minus the minimum value for that
variable
Standard Deviation: Calculate by
• Subtracting the mean of a series from each value in a series
• Squaring each result then summing them
• Dividing the result by the number of items minus 1
• Take the square root of this value
Chapter Twelve
Statistical Significance
1. Mathematical differences
2. Statistical significance
3. Managerially important differences
Chapter Twelve
Hypothesis Testing: Key Steps
Step One: Stating the hypothesis
• Null Hypothesis: status quo proven to be true
• Alternative Hypotheses: another alternative proven to the true.
Step Two: Choosing the appropriate test statistic
• Test of means, test or proportions, ANOVA, etc.
Step Three: Developing a decision rule
• Determine the significance level
• Need to determine whether to reject or fail to reject the null
hypothesis
Chapter Twelve
Hypothesis Testing: Key Steps
Step Four: Calculating the value of the test statistic
• Use the appropriate formula to calculate the value of the
statistic.
Step Five: Stating the conclusion
• Stated from the perspective of the original research question
Chapter Twelve
Types of Errors in Hypothesis Testing
Type I error:
• Rejection of the null hypothesis when, in fact, it is true
Type II error:
• Acceptance of the null hypothesis when, in fact, it is false
One- and
Two-Tailed
Tests
Tests are either one- or two-tailed. This decision depends on the
nature of the situation and what the researcher is
demonstrating.
One-Tailed Test:
• “If you take the medicine, you will get better”
Two-Tailed Test:
• “If you take the medicine, you will get either better or worse.”
Chapter Twelve
Issues With Type I and II Errors
Actual State of the
Null Hypothesis
Fail to Reject Ho
Reject Ho
Ho is true
Correct (1- )
no error
Type I error ( )
Ho is false
Type II error ( )
Correct (1- )
no error
Chapter Twelve
Commonly Used Statistical Hypothesis Tests
1. Independent samples
2. Related samples
3. Degrees of freedom
4. p Values and significance testing
Chapter Twelve