Transcript Powerpoint
STATISTICS
David Pieper, Ph.D.
[email protected]
Types of Variables
Categorical Variables
Organized into category
No necessary order
No quantitative measure
Examples
male,
female
race
marital
status
treatment
A and treatment B
Types of Variables
Continuous Variables
Have specific order
Examples:
weight
temperature
blood
pressure
Age
Test
score
May be converted to categorical
Descriptive Statistics
Measures of central tendency
mean
(average)
Measures of variability
range
standard
deviation
Results of Memory Test
Age
Gender
Age
Group
Student
or Parent
Total
Score
17
M
HS
S
52
16
M
HS
S
49
30
F
Adult
P
50
16
M
HS
S
47
43
F
Adult
P
41
36
M
Adult
P
51
16
F
HS
S
43
43
F
Adult
P
41
36
F
Adult
P
33
Descriptive Statistics for
Memory Test
Age
Total Score
245
245
Minimum
7
12
Maximum
72
54
25.2
37.3
16
8
Number of Cases
Mean
SD
Research Hypothesis
Null hypothesis: relationship among
phenomena does not exist
Example: Age does not have an
influence on memory
Probability and p Values
p < 0.05
1
in 20 or 5% chance groups are not
different when we say groups are
significantly different
p < 0.01
1
in 100 or 1% chance of error
p < 0.001
1
in 1000 or .1% chance of error
Type of Statistical
Test to Use
Continuous variable as end point
2
groups: t-test
3
or more groups: ANOVA
Relation between 2 categorical variables:
Chi-square
Fisher’s
test
Exact test (2 x 2)
Relation between 2 continuous variables:
Regression
analysis or correlation
T-test
When comparing 2 groups and endpoint variable is continuous
Purpose is determine if the difference
between the 2 groups is unlikely due to
chance
T-test
Examples:
Blood pressure before and after exercise
program
Would parents do better on a memory test
than students
Results of Memory Test
Age
Gender
Age
Group
Student
or Parent
Total
Score
17
M
HS
S
52
16
M
HS
S
49
30
F
Adult
P
50
16
M
HS
S
47
43
F
Adult
P
41
36
M
Adult
P
51
16
F
HS
S
43
43
F
Adult
P
41
36
F
Adult
P
33
T-test results comparing Parents
and Students Total Score
Number
Mean
SD
Students
140
36.4
7.9
Parents
105
38.5
8.1
p < 0.05
Parents had higher scores than students
Analysis of Variance
(ANOVA)
When comparing 3 or more groups and
end-point is continuous
Example: Compare score on memory
test among:
Grade
school students
Middle school students
High school students
Parents
Total Score
Total Score
40
38
36
34
32
30
28
26
24
22
20
Grade
School
Middle
School
High School
Adult
Analysis of Variance p < 0.001
High School Students and Adults scored better than Grade School or
Middle School Students
Middle School Students scored better than Grade School Students
Chi-square Test
When comparing 2 or more groups and
the end point is categorical
Chi-square Gender
and Parent vs Student
% Male
50
Female
Male
Total
Total
79
63
142
61
(44%)
42
(40%)
103
140
105
245
40
Percent
Student Parent
30
20
10
0
Students
P = 0.6
There was no significant gender
difference between students and
parents
Adults
Correlation or Regression
When determining if there is a linear
relationship between 2 continuous variables
Ranges from -1 to 1
Pearson’s Correlation Coefficient
Diastolic BP (mm)
Weight (kg)
90
82
140
114
68
56
110
62
100
83
95
110
Is Diastolic BP related to Weight?
r = 0.805 p < 0.01
Correlation of Age and Score on Memory Test
r = 0.4
No correlation of age and score on memory test
Illustrations: Use Graphs
p < 0.01
• Label axes
• Include brief
description
Patients that failed the exercise test had a
higher mortality than patients that passed
Free Statistics Software
Mystat: http://www.systat.com/MystatProducts.aspx
List of Free Statistics Software:
http://statpages.org/javasta2.html