American Intercontinental University

Download Report

Transcript American Intercontinental University

QMB-350
Leah Murray
 Independent

The variable we are basing a prediction on
 Dependent

Response variable
Name the independent and dependent variables
-Hours of studying and grade
-temperature and heating bill
- Age of computer and value
- Size of home and property tax bill
 Positive

Both variables either increase or decrease



Size of house and property tax bill
Temp and air conditioning use
Years of experience and salary
 Negatives

One increases and the other decreases



Age of car and value
Temperature and heating bill
Age and strength
 Strength
of relationship between two
variables
 Correlation coefficient- measures strength of
relationship



Strong: time spent on assignment and grade
Weak: number of sit ups performed and weight
None: Hair color and stats grade
A
graph with the independent variable on the
x axis and the dependent variable on the y
variable
 Visual way to describe the relationship
between two variables
 Measures
both the strength and direction of
the correlation



Close to +1=strong positive relationship
Close to -1=strong negative relationship
Close to 0: No relationship
**This is calculated in excel***
 WARNING:
Just because something has a
strong correlation does NOT mean that there
is causation
 Example:
Shoe size and spelling ability
 Line
placed on a scatter plot to minimize the
distance between the lines and the data
points
 The
line of best fit has an equation with two
components

Slope- how steep the line is and direction of
correlation


Number with the x
Y intercept-where the graph crosses the Y axis

Number all alone
 This
is calculated in excel
Minutes
Grade
20
Time v grade
120
68
30
75
35
72
40
85
60
88
73
90
80
100
Grade
100
80
60
Grade
40
20
0
0
20
40
60
80
Minutes spent on Assignment
100
SUMMARY
OUTPUT
Regression Statistics
Multiple R
0.94343
R Square
0.89006
Adjusted R Square 0.868072
Standard Error
4.129936
Observations
7
ANOVA
df
SS
Regression
1
690.4324
Residual
Total
5
6
85.28185
775.7143
Coefficie
nts
Standard
Error
Intercept
59.90572
3.889456
X Variable 1
0.469408
0.073779
MS
690.432
4
17.0563
7
F
40.4794
5
t Stat
15.4020
8
6.36234
6
P-value
2.09E05
0.00141
8
Significanc
eF
0.001418
Lower 95%
49.90755
0.279753
Upper
95%
69.9038
8
0.65906
3
Lower
95.0%
Upper
95.0%
49.90755
69.90388
0.279753
0.659063
 You
can find the coefficients in the output
 Y=b+ax

B-intercept
A=x variable
Y=59.906+0.469x
**If the x variable is negative we use a – not a + -***
Y

intercept
If the X variable was 0, this would be the value
of the y variable
 Slope

If x increased by 1 unit y would increase (or
decrease) by this many units
 Fill
in a value and solve for the other
variable
 What would you expect your grade to be if
you spent 65 minutes on the assignment
I
received an 90 on the assignment, how
long did I spend working on the assignment
**No predictions are absolute***
 Making
predictions outside of the bounds of
the data


Example if you have a bunch of data about
heights and weights of children 2-4 years old you
cannot use the equation to predict the weight of
a 30 year old based on height
If we have data that predicts heating bills based
on temperature for temperatures in the 0-30
range we cannot expect the equation to give us a
good prediction for a 70 degree day
 How
much of the dependent variable is
explained by the independent variable and
the regression line
 Symbol:
 Found in excel output
 R Square
 0.89
 What does it mean? 89% of your grade is
explained by the time you spent on it
 **We want values close to 1 or 100%**
 Standard
deviation of the actual y values
around the predicted y values
 Want this number to be small
Standard Error 4.129936
Smaller numbers mean more consistent
predictions