Introduction
Download
Report
Transcript Introduction
Introduction
What is Econometrics
Application of statistical methods to economics.
It is distinguished from economic statistics (statistical
data) by the unification of economic theory,
mathematical tools, and statistical methodology.
It is concerned with (1) estimating economic
relationships (2) confronting economic theory with
facts and testing hypotheses about economic
behavior, and (3) forecasting economic variables .
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Estimating Economic
Relationships
Examples include:
– d/s of various products and services
– firms wishes to estimate the effect of advertising on sales
and profits
– relate stock price to characteristics of the firm
– macro policy, federal, state, and local tax revenue forecasts
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Testing Hypotheses
Examples include:
– Has an advertising campaign been successful in increasing
sales?
– Is demand elastic or inelastic with respect to price-important for
competition policy and tax incidence, among other things.
– Effectiveness of government policies on macro policy.
– Have criminal policies been effective in reducing crime?
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Forecasting
Examples include:
– Firms forecast sales, profits, cots of production, inventory
requirements
– Utilities project demand for energy. Sometimes, these
forecasts aren’t very good, such as what is currently
happening in California.
– Federal government projects revenues, expenditures,
inflation, unemployment, and budget and trade deficits
– Municipalities forecast local growth.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Uncertainty in These Three Steps
The reason is that we generally base these steps on
sample data rather than a complete census.
Therefore, estimated relationships are not precise.
Conclusions from hypothesis tests may accept a
false hypothesis or reject a true one.
Forecasts are not on target.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
CODING.XLS
Represents responses from a questionnaire
concerning the president's environmental policies.
The data set includes data on 30 people who
responded to the questionnaire.
The data is organized in rows and columns.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Observations
An observation is a member of the population or
sample. Alternative terms for observations are cases
and records.
Each row corresponds to an observation. The
number of observations vary widely from one data set
to another, but they can all be put in this format.
In this data set, each person represents an
observation.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Variables
Each column represents a variable. An alternative
term for variable that is commonly used in database
packages is field.
In this data set, each piece of information about a
person represents a variable. The six variables are
person’s age, gender, state of residence, number of
children, annual salary and opinion of the president’s
environmental policies.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Variables -- continued
The number of variables can vary widely from one
data set to another.
It is customary to include a row that gives variable
names.
Variable names should obviously be meaningful - and
no longer than necessary.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Type of Data
There are several ways to categorize data.
– Numerical versus categorical
– Cross-sectional versus time series
Using this example we can look at the various types
of data.
On the next slide is an alternate way to represent the
data set.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Numerical versus Categorical
The basic distinction between the two is whether you
intend to do any arithmetic on the the data. It makes
sense to do arithmetic on numerical data.
Clearly, the Gender and State variables are
categorical and the Children and Salary variables are
numerical. Age and opinion variables are more
difficult to categorize.
Age is expressed numerically, and we might want to
perform some arithmetic on age such as the average
age of respondents. However, age could be treated
as categorical.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Numerical versus Categorical -continued
The Opinion variable is expressed numerically on a
1-5 Likert scale. These numbers are only codes for
the categories strongly disagree, disagree, neutral,
agree, and strongly agree. It is not intended for
arithmetic to be performed on these numbers; in fact,
it is not appropriate to do so.
The Opinion variable is best treated as categorical.
In the case of the Opinion variable there is a general
ordering of categories that does not exist in the
Gender and State variables.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Numerical versus Categorical -continued
We classify these types of variables as ordinal. If
there is no natural ordering , as with the Gender and
State variables, we classify the variables as nominal.
Both ordinal and nominal variables are categorical.
Categorical variable can be coded numerically or left
in uncoded form. This option is largely a matter of
taste.
Coding a truly categorical variable doesn’t make it
numerical and open to arithmetic operations.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Numerical versus Categorical -continued
Some options for this example are to:
– code Gender (1 for male and 2 for female)
– uncode Opinion variable
– categorize the Age variable as young (34 or younger),
middle aged (from 35-59) and elderly (60 or older).
The one performing the study often dictates if
variables should be treated numerically or
categorically; there is no right or wrong way.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Numerical versus Categorical -continued
Numerical variables can be subdivided into two types
- discrete and continuous.
The basic distinction between the two is whether the
data arises from counts or continuous
measurements.
The Children variable is clearly discrete whereas
Salary is best treated as continuous.
This distinction is sometimes important because it
dictates the type of analysis that is most natural.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12
Cross-sectional versus Time
Series
Data can be categorized as cross-sectional or time
series.The Opinion data is Example 2.1 is crosssectional. A pollster sampled a cross section of
people at one particular point in time.
In contrast, time series data occurs when we track
one or more variables through time. An example
would be the series of daily closing values of the Dow
Jones Index.
Very different type of analysis are appropriate for
cross-sectional and time series data.
2.2 | 2.3 | 2.4 | 2.5 | 2.6 | 2.7 | 2.8 | 2.9 | 2.10 | 2.11 | 2.12